VENU BABU TELLAGORLA

Data Engineer | Cloud Data Architect
Remote/Flexible, US.

About

Highly accomplished Data Engineer and Cloud Data Architect with over two years of experience in designing, building, and optimizing scalable data pipelines, real-time analytics platforms, and AI/ML systems across GCP, AWS, and Azure. Proven ability to drive significant improvements in data delivery speed, cost reduction, and system performance, achieving up to 99.9% pipeline uptime and 80% performance gains. Adept at leveraging advanced analytics, big data engineering, and Agile methodologies to deliver measurable business results and enhance operational efficiency.

Work

Excel IT Corp
|

Data Engineer

Summary

Led data engineering initiatives for an AI-Powered Analytics Platform, focusing on scalable data pipelines, real-time analytics, and machine learning model integration to enhance financial intelligence.

Highlights

Engineered scalable data pipelines processing 5M+ daily financial records using Python, PySpark, Kafka, and BigQuery, boosting data delivery speed by 40% for real-time trading analytics and business intelligence.

Developed end-to-end machine learning models and workflows with TensorFlow, PyTorch, and Scikit-learn, improving model accuracy by 8% and training pipeline performance by 35%.

Implemented real-time data processing frameworks (Kafka, Pub/Sub, Apache Flink, Kinesis) achieving sub-second latency for critical trading alerts, compliance monitoring, and event streaming.

Architected cloud-native data lakehouses using Delta Lake, Apache Iceberg, and Databricks, enabling unified batch/streaming analytics with ACID transactions for financial audit requirements.

Automated CI/CD pipelines with GitHub Actions and Azure DevOps, achieving 99.9% pipeline uptime and 35% faster release cycles while reducing manual processing time by 40% for billing and revenue systems.

Cognizant Technology Solutions
|

ETL Developer

Summary

Designed and implemented robust ETL/ELT processes and data solutions for healthcare clients, optimizing data integration and ensuring HIPAA compliance across multi-cloud environments.

Highlights

Built hybrid ETL/ELT processes across AWS (S3, Glue, Lambda) and GCP (BigQuery, Dataflow) using Python, PySpark, and Apache Airflow, processing terabyte-scale datasets with HIPAA compliance.

Developed real-time clinical alert systems using Kafka event streams and AWS Lambda, reducing alert latency by 80% and supporting critical patient care workflows.

Implemented big data engineering solutions with Apache Spark and Flink for unstructured data processing, achieving 45% cost reduction through performance optimization.

Created comprehensive data warehouse using dimensional modeling (Kimball approach) with star and snowflake schemas, enabling 100+ analysts to perform advanced analytics.

Led Agile development initiatives as Scrum Master, delivering 15+ data projects with zero production incidents and establishing data governance practices for healthcare industry compliance.

Education

Indiana Wesleyan University

M.S.

Information Technology and Project Management

Certificates

Google Cloud Platform Certification

Issued By

Google Cloud

Microsoft Azure Data Engineer Associate

Issued By

Microsoft

Skills

Programming Languages

Python (Django, Flask, Pandas, NumPy), SQL, PySpark, Scala, R, Java, JavaScript, Bash.

Data Engineering & ETL

Apache Spark, Apache Kafka, Apache Flink, Apache Airflow, dbt, Informatica, DataStage, Apache Beam, Apache NiFi.

Cloud Platforms

Google Cloud Platform (BigQuery, Pub/Sub, Cloud Functions, Dataflow), AWS (S3, Lambda, Glue, RDS, Kinesis, EMR, SQS, SNS, QuickSight), Azure (Data Factory, Synapse, Databricks, Event Hubs, ML Studio).

Big Data & Streaming

Hadoop, Hive, Snowflake, Redshift, Presto, Delta Lake, Event Streaming, Real-time Processing.

Databases & Data Storage

BigQuery, Snowflake, Redshift, SQL Server, Oracle, Teradata, MySQL, PostgreSQL, MongoDB, Cosmos DB, DynamoDB, Cassandra.

Machine Learning & AI

TensorFlow, PyTorch, Scikit-learn, Azure ML Studio, MLflow, Kubeflow, Hugging Face, Microsoft Copilot, ChatGPT, Gemini.

Analytics & Visualization

Tableau, Power BI, Grafana, Splunk, OpenSearch, Excel, Data Visualization, Statistical Modeling.

Data Modeling

Kimball Methodology, Star Schema, Snowflake Schema, Dimensional Modeling, Data Warehouse Modeling, SCD Types.

Data Governance & Quality

Apache Atlas, Atlan, Azure Purview, Great Expectations, Monte Carlo, Data Lineage Tracking, GDPR, HIPAA, SOX, Data Privacy, Compliance Automation.

DevOps & Infrastructure

Docker, Kubernetes, Terraform, CI/CD, GitHub Actions, Jenkins, Azure DevOps, Infrastructure as Code.

Operating Systems

Windows, Linux.

Methodologies

Agile, Scrum, DevOps, DataOps, MLOps, Cross-functional Environment, SDLC, Data Mesh Architecture.