Girish M S

Machine Learning Engineer & Data Engineer

I build intelligent systems and scalable data pipelines that transform raw data into actionable insights.

About

I'm a passionate Machine Learning Engineer and Data Engineer with expertise in building intelligent AI systems and scalable data pipelines. My work focuses on transforming complex data into actionable insights and deploying production-ready ML solutions.

Currently, I specialize in Large Language Models (LLMs), Natural Language Processing, and MLOps. I have extensive experience with cloud platforms (AWS, Azure), building data engineering solutions with technologies like Apache Spark, Databricks, and Informatica.

My expertise spans the full ML lifecycle: from data engineering and model development to deployment and monitoring. I've worked on projects involving predictive modeling, computer vision, recommendation systems, and generative AI applications.

When I'm not coding, you can find me exploring the latest advancements in AI research, contributing to open-source projects, or sharing knowledge through technical writing and mentoring.

Experience

  1. 2023 — Present

    Lead the development and deployment of machine learning models for production systems. Build scalable ML pipelines using MLOps best practices. Mentor junior engineers and drive technical decisions for AI initiatives.

    • Python
    • TensorFlow
    • PyTorch
    • AWS SageMaker
    • MLflow
    • Kubernetes
    • Docker
  2. 2021 — 2023

    Designed and implemented large-scale data pipelines processing terabytes of data daily. Built ETL workflows using Apache Spark and Databricks. Optimized data warehouse performance and reduced query latency by 60%.

    • Apache Spark
    • Databricks
    • AWS
    • Python
    • SQL
    • Airflow
    • Informatica
  3. 2020 — 2021

    Developed computer vision models for object detection and image classification. Implemented NLP solutions for text analysis and sentiment classification. Collaborated with product team to integrate ML features into production applications.

    • Python
    • TensorFlow
    • OpenCV
    • NLTK
    • FastAPI
    • PostgreSQL
  4. 2019 — 2020

    Performed exploratory data analysis and created interactive dashboards for business intelligence. Built predictive models for customer churn and revenue forecasting. Automated reporting processes using Python and SQL.

    • Python
    • SQL
    • Tableau
    • Power BI
    • Pandas
    • Scikit-learn

Projects

  • AI-Powered Chatbot with RAG

    Built an intelligent conversational AI system using Retrieval-Augmented Generation (RAG) with LangChain and OpenAI. Implemented vector databases for efficient document retrieval and context-aware responses. Deployed on AWS with scalable infrastructure handling 1000+ daily queries.

    • Python
    • LangChain
    • OpenAI
    • Pinecone
    • FastAPI
    • Docker
    • AWS
    Featured Project
  • Real-time Data Pipeline

    Designed and implemented a real-time data processing pipeline using Apache Kafka and Spark Streaming. Processed millions of events per day with sub-second latency. Built monitoring dashboards and alerting systems for pipeline health.

    • Apache Kafka
    • Spark Streaming
    • AWS
    • Python
    • Scala
    • Grafana
    Featured Project
  • Computer Vision for Quality Control

    Developed a deep learning model for automated defect detection in manufacturing. Achieved 98% accuracy using custom CNN architecture. Reduced quality inspection time by 75% and deployed to edge devices for real-time processing.

    • Python
    • TensorFlow
    • OpenCV
    • ONNX
    • Flask
    • Raspberry Pi
    Featured Project
  • Recommendation Engine

    Built a hybrid recommendation system combining collaborative filtering and content-based approaches. Implemented using matrix factorization and neural networks. Increased user engagement by 40% and improved click-through rates.

    • Python
    • TensorFlow
    • Scikit-learn
    • Redis
    • PostgreSQL
    • Docker
    Featured Project
  • NLP Sentiment Analysis Platform

    Created a multi-language sentiment analysis system for social media monitoring. Fine-tuned BERT models for domain-specific classification. Processed 100K+ posts daily with real-time sentiment tracking and trend analysis.

    • Python
    • Transformers
    • BERT
    • FastAPI
    • MongoDB
    • Kubernetes
    Featured Project
  • MLOps Platform

    Established end-to-end MLOps infrastructure for model training, deployment, and monitoring. Implemented CI/CD pipelines for ML models with automated testing and validation. Reduced model deployment time from weeks to hours.

    • MLflow
    • Kubeflow
    • Jenkins
    • Docker
    • Kubernetes
    • AWS
    • Python
    Featured Project

Certifications & Awards

Certifications

  • AWS Certified AI Practitioner

    Amazon Web Services

    2024
  • Informatica CDI R41 Professional Certification

    Informatica

    2024
  • IBM Data Science Professional Certificate

    IBM

    2024
  • Applied Data Science Capstone Specialization

    Coursera

    2024
  • DataBricks Fundamentals

    Databricks

    2024
  • Intel AI Specialization

    Intel

    2024

Awards & Recognition

  • MVP Award

    Informatica

    2024
  • PC Admin Certification

    Informatica

    2024
  • Volvo Recognition Award

    Volvo

    2024
  • Texpo Certificate

    Tech Expo

    2024