Architecting and Engineering Data at Scale
Now StreamingAround six years of architecting and building fault-tolerant, event-driven systems that process millions of transactions across global markets. Currently at Tiger Analytics, developing production solutions for Fortune 500 clients.

01 / The Architect
I build the systems that move data — reliably, at scale, under pressure.
My career started on a factory floor at Bosch, where I learned that data isn't abstract. It's a sensor on a manufacturing line, an alert at 3 AM, the difference between $150K saved or lost. That perspective never left.
Six years later, I'm architecting serverless event-driven pipelines that handle 8,000 concurrent executions with sub-second latency. I've migrated databases from PostgreSQL to DynamoDB without losing a transaction, built ML registries that compress 14-day deployments into 2, and engineered platforms that 50+ data scientists rely on daily.
The best infrastructure is the kind nobody notices — until it isn't there. I obsess over the boring, hard parts: idempotency, fault tolerance, auditability, the 99.9th percentile. That's where reliability lives. That's the work I love.
~ / siddhartha.kumar — zsh
02 / Impact
Numbers that ship.
Every metric below ran in production. None of them are estimates.
Concurrent Executions
Step Functions orchestrating real-time POS data
Message Reliability
Across 15+ international markets, sub-3-min SLA
Faster Deployments
ML model rollout: 14 days → 2 days
Annual Savings
Reduced MTBF by 20%, MTTR by 25% at Bosch
03 / Featured Systems
Production-grade infrastructure,
built for the long run.
A selection of systems I've architected, migrated, or rebuilt from scratch. Click any card for the full architecture breakdown.
Production Systems
Mission-critical infrastructure running in production today.
Internal Platforms
Reusable infrastructure that compounds across teams.
Foundations
Earlier work that shaped the systems thinking.
04 / Trajectory
The path so far.
2022 – Present
Data Engineer
Currently hereTiger Analytics India Consulting Pvt. Ltd.
Bengaluru, India
- Architected and built POS pipeline at 8,000+ concurrent executions
- Led PostgreSQL → DynamoDB migration (95% latency cut)
- Built ML Model Repository for pharma client (85% faster deployments)
- Engineered multi-tenant portal for 10+ enterprise clients
- Designed fault-tolerant Lambda functions with 99.9% reliability
- Automated marketing-mix workflows (90% manual intervention reduced)
2021 – 2022
Trainee Data Engineer
Futurense Technologies Pvt. Ltd.
Bengaluru, India
- Built distributed ETL POCs on AWS EMR + PySpark (500GB+ workloads)
- Created real-time Tableau dashboards (65% faster reporting)
2019 – 2020
Graduate Apprentice — Data Analytics
Bosch Limited (Robert Bosch India)
Jaipur, India
- Digitized executive KPI reporting (70% cycle reduction)
- IoT root cause analysis: 20% MTBF reduction, $150K+ saved annually
- Reduced MTTR by 25% via predictive maintenance insights
05 / Tech Arsenal
The tools I trust in production.
Everything below has shipped in real systems — not just listed for keywords.
Languages & Frameworks
Daily-use, production-tested
Python
Expert
SQL
Expert
PySpark
Expert
Pandas
Expert
NumPy
Advanced
Boto3
Expert
Flask
Advanced
REST APIs
Expert
Shell Scripting
Advanced
AWS Cloud
Primary cloud — production architecture
Lambda
Expert
Step Functions
Expert
DynamoDB
Expert
S3
Expert
Athena
Advanced
Glue
Advanced
EMR
Advanced
EC2
Advanced
API Gateway
Advanced
CloudWatch
Expert
EventBridge
Advanced
SNS / SQS
Advanced
IAM
Advanced
Data Platforms
Storage & query engines
Snowflake
Expert
PostgreSQL
Advanced
DynamoDB
Expert
Hive
Proficient
Data Lakes
Advanced
Data Warehouses
Advanced
Distributed Systems
Architecture patterns & frameworks
Apache Spark
Expert
Apache Airflow
Expert
Event-Driven Architecture
Expert
Microservices
Advanced
Message Queues
Advanced
Data Engineering
Patterns & methodologies
ETL / ELT Pipelines
Expert
Dimensional Modeling
Advanced
OLAP
Advanced
SCD Type 2
Advanced
Schema Design
Expert
Partitioning
Expert
Indexing
Advanced
Data Quality
Advanced
DevOps & Tooling
Build, ship, observe
Docker
Advanced
Git
Expert
Azure DevOps
Advanced
CI/CD
Advanced
Terraform
Proficient
Grafana
Advanced
JIRA
Expert
Agile / Scrum
Expert
Analytics & BI
Visualization & analytical platforms
Power BI
Advanced
Tableau
Advanced
Databricks
Expert
Dataiku DSS
Advanced
06 / Credentials
Validated, not just claimed.
Astronomer Champions Program
Astronomer
Recognized by Astronomer for expertise and contributions to the Apache Airflow community. One of a select group invited annually.
AWS Certified Cloud Practitioner
Amazon Web Services
Microsoft Azure Fundamentals (AZ-900)
Microsoft
Microsoft Azure Data Fundamentals (DP-900)
Microsoft
Microsoft Azure AI Fundamentals (AI-900)
Microsoft
Astronomer Airflow 3 Fundamentals & DAG Authoring
Astronomer
Databricks Generative AI Fundamentals
Databricks
Google Generative AI Fundamentals
07 / Open Source
Code in the open.
Live activity from my GitHub. The work I'm building, learning, and shipping in public.
5
Repos
0
Stars
0
Followers
Today
Last push
Top Languages
Featured Repositories
View allprofessional-portfolio
My portfolio
claude-model-proxy
Access all Ollama cloud models through Claude code desktop application
ML-with-Python
Machine learning experiments, notebooks, and exploratory analysis.
Applied-Data-Science-Capstone
Applied data science project — analysis, modeling, and insights.
08 / Direct Line
Let's build something
that scales.
Working on infrastructure that needs to be fast, fault-tolerant, and audited? That's the kind of problem I solve. Reach out — I read everything.