ABOUT ME
Expert Data Engineer with 9+ years of Data Engineering experience. Spearheaded Hadoop-to-AWS cloud transformations, harnessing S3, EMR, Glue, and Redshift for cutting-edge data solutions; renowned for elevating analytics and operational efficiency.
● Transitioned Hadoop-based data ecosystems to AWS, using S3 for durable storage, EMR for Map Reduce and Spark processing, and Glue for data cataloging and ETL jobs.
● Deep expertise in big data architectures with a focus on AWS services like EMR, which provides a managed Hadoop framework, and leveraging YARN on EMR for resource management.
● Skilled in data migration using AWS DMS and SCT to efficiently move databases to AWS, replacing the traditional use of Scoop.
● Developed Lambda functions for automated data transfer and processing, serving as a more scalable alternative to Hadoop Map Reduce jobs.
● Proficient in data warehousing using AWS Redshift, ensuring optimized query performance and efficient data management.
● Expert in setting up AWS infrastructure for big data applications, ensuring high availability and disaster recovery using AWS services.
● Established secure and multi-tenant data environments on AWS, implementing IAM roles and policies for fine-grained access control.
● Utilized AWS Kinesis for real-time data streaming into AWS ecosystem, replacing traditional messaging systems.
● Familiar with AWS Step Functions and SWF to manage distributed applications, replacing the need for Hadoop Ozzie workflows.
● Monitored AWS environments using CloudWatch, ensuring high performance and reliability of big data applications.
● Executed real-time data analytics by harnessing the power of AWS Kinesis Analytics, providing insights from streaming data.
● Integrated AWS services with Databricks for enhanced data science and analytics capabilities.
● Embraced Agile and Scrum methodologies in cloud-based big data projects, ensuring continuous integration and delivery using AWS DevOps tools.
● Implemented data governance and classification on AWS, ensuring compliance with data privacy and protection standards.
● Adopted AWS Athena for serverless querying, enabling quick analysis of data stored in S3 without the need for complex ETL workflows.
● Integrated AWS CloudFormation and Terraform for infrastructure as code (IaC) practices, ensuring reproducible and maintainable cloud environments.
● Expertise in data orchestration using AWS Data Pipeline for regular data movement and processing tasks, enhancing operational efficiency.
● Implemented AWS security best practices, utilizing IAM for granular access control, KMS for encryption management, and VPC for network isolation.
EXPERIENCE
Platform Engineer/ Data Engineer
Deutsche Bank, New York, NY October2020 – August 2022
Sit amet luctussd fav venenatis, lectus magna fringilla inis urna, porttitor rhoncus dolor purus non enim praesent in elementum sahas facilisis leo, vel fringilla est ullamcorper eget nulla facilisi etisam dignissim diam quis enim lobortis viverra orci sagittis eu volutpat odio facilisis mauris sit.
Data Engineer
Health First, New York, NY
Sit amet luctussd fav venenatis, lectus magna fringilla inis urna, porttitor rhoncus dolor purus non enim praesent in elementum sahas facilisis leo, vel fringilla est ullamcorper eget nulla facilisi etisam dignissim diam quis enim lobortis viverra orci sagittis eu volutpat odio facilisis mauris sit.
EDUCATION
2001 - 2005
Bachelor of Science in Computer Science & Engineering (CSE)
2005, Southern University, Bangladesh
2005- 2007
Master Degree in Designing
University of Texas
SKILLS: SQL, Apache Hive, Apache Spark (Databricks, Jupiter Notebook), Anaconda, Python (Django, Pandas, Scikit- learn, Matplotlib) Time Series Forecasting, A/B testing, Bayesian methods, Tableau, Power BI, Java, Data Visualization, Analytical Skills, Cost Accounting, Corporate Finance Knowledge, Statistical Analysis.
85%
JavaScript
Non enim praesent
92%
Figma
Non enim praesent
81%
React
Non enim praesent
78%
Python
Non enim praesent
90%
WordPress
Non enim praesent
87%
Adobe XD
Non enim praesent
AWARDS
14 May 2020
Bluebase
Non enim praesent
26 June 2018
Demble
Non enim praesent