H +(91)-977-67-43382
Surbhi Singh
B singh.surbhicse@gmail.com
SUMMAR y
AnITprofessionalwith4+Yearsofworkexperienceinsoftwaredevelopment, analytics
and big data domain, having experience in end to end application development with
addedhands- onexperienceinsolvingbigdataproblemsanddoananalysistoprovidea
better data driven solution out of it.
Agile - Team player - Passionate to learn - Adaptive person.
INTERESTS
Data Pipeline (End to End), Data Engineering, Text mining
SKILLS
Programming Python Scripting Shell Scripting
Database SQL, MySql, RDS Big Data Hadoop, PySpark, Hive, Sqoop, Pandas
Cloud AWS OS Windows, Linux
WebApp Flask Other Docker, Databricks, Terraform
EXPERIENCE
2019 Oct- Gartner Inc ,Role: Data Analytics Engineer
Present
2016 May- Accenture
Role: Application Development Analyst
2019Sep
ROLES & R ESPONSIBILITIES
- As a developer and data analytics engineer, currently taking responsibility in
designing/developing end-to-end data pipelines and orchestrate the entire workflow
foradatascience/dataengineering projects.
- Architecting and developing data driven solutions, tools and dashboards for
stakeholders in various domains including HR, Finance and Marketing.
- Worked for the design and development of a data analytics platform in Python that
includestoolsformultipledatasciencetaskslikesentiment analysis, recommendation
systems and NLP techniques bycollaborating with data scientist.
- Experience with end-to-end integration of models with databases and storage
systems like HDFS, Hive andSQL.
- Following agile methodologies like Scrum and maintaining detailed technical
documentations.
- Daily interaction with business stakeholders and meeting their expectations with the
right deliverable and demo presentation for everyspirit.
PrOJECTS
2019 Oct - Gartner Inc
Present
1. Early Risk Detection System Developed end-to-end data pipeline for Data Science
model to detect risk on membership using Sqoop, SQLAlchemy, Pyspark, Python on
AWS Platform(EC2, S3, EMR). Interacted with stake-holders and data scientists to
understand their requirement and built and implemented a solution around the
same.
Technology: Python, Pyspark, Sqoop, Hive, Shell Scripting, Pandas, AWS, Databricks
2. HR Biodata Worked closely with Data Scientists to build a data pipeline for Resume
Scoring Model. Worked on Flask API to get resume and candidate data from workday
and store the same on RDS and S3 and maintain metrics on Cloudwatch. Also built a
Docker image for deployment. Worked with NLP models like spacy, standford NLP.
Technology: Python, Flask, S3, ECS, Docker, RDS, Cloudwatch, Terraform
2015 September- Accenture
2019July
1. Client: Lloyd Banking Group. Worked on distcp and reconcillation scrpits.Worked on
Ingesting data into Hadoop data lake using file based and sqoop based ingestion.
Worked with Spark-Core scripts for ingesting data. Scheduled the jobs on Oozie
coordinator. Creation of Hive table with required schema. Worked on Shell scripting
for implementing spark-submit code for ingestion. Sqoop--Imported data into HDFS
via command line from edge. Check yarn logs for RCA in case on failure. Logged work
on JIRA, worked on GIT.
Technology:Hadoop, Hive, Sqoop , Spark(Basic), Shell Scripting, Putty, WinSCP , Unix
2. OSS - AMS . Developing API and UI components using JDA agile Business Process
Platform. Creating scheduling Jobs for running the APIs from the backend during
batch using IBM Autosys Platform. Preparing technical documents for a given CR.
Documentation of functional knowledge of the application to enable knowledge
sharing. Ensuring ProperCommunication with the clients and all stakeholders
Technology: JDA Agile business process platform, JDA Order Sequencing and Sloting, IBM
Autosys
A CADEMIC SCORE
May 2016 Bachelor of Technology in Computer Science, KIIT University, Odisha, CGPA: 7.65/10.
H +(91)-977-67-43382
Surbhi Singh
B singh.surbhicse@gmail.com
SUMMAR y
AnITprofessionalwith4+Yearsofworkexperienceinsoftwaredevelopment, analytics
and big data domain, having experience in end to end application development with
addedhands- onexperienceinsolvingbigdataproblemsanddoananalysistoprovidea
better data driven solution out of it.
Agile - Team player - Passionate to learn - Adaptive person.
INTERESTS
Data Pipeline (End to End), Data Engineering, Text mining
SKILLS
Programming Python Scripting Shell Scripting
Database SQL, MySql, RDS Big Data Hadoop, PySpark, Hive, Sqoop, Pandas
Cloud AWS OS Windows, Linux
WebApp Flask Other Docker, Databricks, Terraform
EXPERIENCE
2019 Oct- Gartner Inc ,Role: Data Analytics Engineer
Present
2016 May- Accenture
Role: Application Development Analyst
2019Sep
ROLES & R ESPONSIBILITIES
- As a developer and data analytics engineer, currently taking responsibility in
designing/developing end-to-end data pipelines and orchestrate the entire workflow
foradatascience/dataengineering projects.
- Architecting and developing data driven solutions, tools and dashboards for
stakeholders in various domains including HR, Finance and Marketing.
- Worked for the design and development of a data analytics platform in Python that
includestoolsformultipledatasciencetaskslikesentiment analysis, recommendation
systems and NLP techniques bycollaborating with data scientist.
- Experience with end-to-end integration of models with databases and storage
systems like HDFS, Hive andSQL.
- Following agile methodologies like Scrum and maintaining detailed technical
documentations.
- Daily interaction with business stakeholders and meeting their expectations with the
right deliverable and demo presentation for everyspirit.
PrOJECTS
2019 Oct - Gartner Inc
Present
1. Early Risk Detection System Developed end-to-end data pipeline for Data Science
model to detect risk on membership using Sqoop, SQLAlchemy, Pyspark, Python on
AWS Platform(EC2, S3, EMR). Interacted with stake-holders and data scientists to
understand their requirement and built and implemented a solution around the
same.
Technology: Python, Pyspark, Sqoop, Hive, Shell Scripting, Pandas, AWS, Databricks
2. HR Biodata Worked closely with Data Scientists to build a data pipeline for Resume
Scoring Model. Worked on Flask API to get resume and candidate data from workday
and store the same on RDS and S3 and maintain metrics on Cloudwatch. Also built a
Docker image for deployment. Worked with NLP models like spacy, standford NLP.
Technology: Python, Flask, S3, ECS, Docker, RDS, Cloudwatch, Terraform
2015 September- Accenture
2019July
1. Client: Lloyd Banking Group. Worked on distcp and reconcillation scrpits.Worked on
Ingesting data into Hadoop data lake using file based and sqoop based ingestion.
Worked with Spark-Core scripts for ingesting data. Scheduled the jobs on Oozie
coordinator. Creation of Hive table with required schema. Worked on Shell scripting
for implementing spark-submit code for ingestion. Sqoop--Imported data into HDFS
via command line from edge. Check yarn logs for RCA in case on failure. Logged work
on JIRA, worked on GIT.
Technology:Hadoop, Hive, Sqoop , Spark(Basic), Shell Scripting, Putty, WinSCP , Unix
2. OSS - AMS . Developing API and UI components using JDA agile Business Process
Platform. Creating scheduling Jobs for running the APIs from the backend during
batch using IBM Autosys Platform. Preparing technical documents for a given CR.
Documentation of functional knowledge of the application to enable knowledge
sharing. Ensuring ProperCommunication with the clients and all stakeholders
Technology: JDA Agile business process platform, JDA Order Sequencing and Sloting, IBM
Autosys
A CADEMIC SCORE
May 2016 Bachelor of Technology in Computer Science, KIIT University, Odisha, CGPA: 7.65/10.