Skip to content
View aksingh4545's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report aksingh4545

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aksingh4545/README.md

Hi πŸ‘‹, I'm Ankit Kumar Singh

Data Engineer | Cloud & Analytics | Building calm, reliable data systems

I enjoy designing data pipelines that are simple, scalable, and easy to reason about.


πŸ‘‹ A bit about me

I work at the intersection of data engineering and cloud platforms.
Most days, I am building pipelines, cleaning messy data, or learning how large systems behave at scale.

I like systems that are:

  • predictable
  • observable
  • easy to maintain

Silence helps me focus. Clean logs make me happy.


🧠 Skills & Technologies

Core stack

Web & scripting (used when needed)

Cloud & tools


πŸ”­ What I’m working on right now

  • Streamlit based data apps connected to AWS S3
  • Batch style pipelines using Python and SQL
  • Exploring Azure Databricks, PySpark, Kafka for distributed data processing

🌱 Currently learning

  • Data modeling for analytics workloads
  • Spark internals and performance tuning
  • Event-driven pipelines and message queues
  • Writing clearer documentation for data systems

🀝 Connect with me


πŸ“Š GitHub activity

Popular repositories Loading

  1. image_resize image_resize Public

    This project implements an event-driven, serverless image processing pipeline on AWS. Images uploaded to Amazon S3 are automatically resized using AWS Lambda and Pillow, stored in a destination buc…

    Python 4 1

  2. streamlit_s3_pipeline streamlit_s3_pipeline Public

    The system supports real-world resumes (PDF, DOCX, TXT), handles noisy formats, and follows industry-grade data engineering practices.

    Python 3 1

  3. practice2 practice2 Public

    JavaScript 2

  4. Login_Cognito Login_Cognito Public

    This repo about how to use AWS Congito fully managed services with streamlit application.

    Python 2 2

  5. Event_hub Event_hub Public

    Azure Event Hub -β†’ ADLS -β†’ Databricks -β†’ Delta -β†’ Cosmos DB . The goal of this system is to ingest real-time events, process them reliably using event-time semantics, and serve analytics-ready resu…

    2

  6. Minor_Project Minor_Project Public

    HTML 1