Skip to content
View s23deepak's full-sized avatar
😁
Open to Full-time roles in Machine Learning
😁
Open to Full-time roles in Machine Learning

Block or report s23deepak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
s23deepak/README.md

Deepak Swaminathan | AI Engineer

AI Engineer with 3+ years of experience building proprietary middleware and agentic workflows to drive operational efficiency. Specialized in Generative AI fine-tuning (RLHF, GRPO, LoRA) and deploying high-throughput, cloud-native solutions on GCP/Vertex AI and AWS.


πŸ“Š GitHub Stats

Deepak's GitHub stats Top Langs


πŸ›  Technical Skills

  • AI & Agentic Systems: LangGraph (State Graphs), CrewAI, Prompt Engineering, RAG Pipelines (LangChain), Agentic Feedback Loops, GitHub Copilot
  • Fine-Tuning & Eval: RLHF, GRPO, LoRA/QLoRA, SFT, LLM Evaluation (BLEU, ROUGE)
  • Engineering & Ops: FastAPI, SQL, Streamlit, React, Docker, Git, GitHub Actions
  • Clinical & Compliance: HIPAA, GDPR/ENISA, RBAC, FHIR Standards, DICOM
  • Cloud & Hardware: GCP (Vertex AI), AWS, NVIDIA H100, AMD MI300X, vLLM, PySpark

πŸš€ Featured Projects

  • Built a multimodal assistant using MedGemma 4B for real-time medical image analysis (X-ray, MRI) and SOAP note generation.
  • Architected a HIPAA-compliant RAG pipeline for patient context retrieval using FHIR-compliant EHR data and encrypted DICOM images.
  • Developed a LangGraph-based state graph where multiple MedGemma 4B instances execute in parallel to compute independent differential diagnoses and inter-agent agreement.
  • Developed a fine-tuning framework at the intersection of SFT and GRPO using LoRA, Unsloth, and Hugging Face.
  • Optimized model inference using vLLM's PagedAttention and multi-GPU setups to ensure low-latency, real-time reasoning.
  • Developed a LangGraph workflow utilizing Chain-of-Thought (CoT) patterns and a RAG architecture to analyze complex soccer queries.
  • Used vLLM's PagedAttention and multi-GPU setups to ensure low-latency responses for real-time agentic reasoning.
  • Deployed a high-concurrency AlphaZero agent on Hugging Face Spaces.
  • Engineered the communication infrastructure to handle real-time state updates.
  • Optimized model inference for a web-based environment to serve complex AI models via serverless-style hosting.
  • Orchestrated a PySpark and BigQuery pipeline to process over 1 million images for real-time feature extraction.
  • Developed models for object detection, showcasing the ability to handle massive-scale image data pipelines.

πŸ’Ό Professional Experience

Freelance | Machine Learning Specialist

  • Deployed a custom "Morning Briefing" multi-agent system using CrewAI to automate patient data synthesis and streamline clinical reporting.
  • Executed RLHF and GRPO fine-tuning on NVIDIA H100 and AMD MI300X GPUs to optimize model performance for clinical safety.

Sopra Steria | Machine Learning Engineer

  • Engineered a RAG pipeline using LangChain and ChromaDB to transform static GDPR/ENISA policy documents into interactive agentic interfaces.
  • Developed FastAPI-based middleware and BiLSTM models to automate the routing of 10,000+ monthly service tickets, reducing manual triage by 40%.

πŸŽ“ Education & Research

  • MS in Applied Data Analytics, Boston University (Jan 2024 - May 2025).
  • Research Assistant (Reinforcement Learning): Architected an open-source C++ package integrating LibTorch with OpenAI Gym for high-performance RL including A2C and PPO.
  • Teaching Assistant (Big Data Analytics): Led sessions on deploying PySpark solutions across GCP and AWS for large-scale batch and streaming.

πŸ“« Connect with Me

Pinned Loading

  1. s23deepak.github.io s23deepak.github.io Public

    HTML

  2. MedGemma MedGemma Public

    Multimodal Clinical Decision Support System powered by Google MedGemma 4B β€” artifact detection, clinical correlation, agentic tool routing, SOAP generation, and role-based clinical features

    Python 1

  3. Supervised-Reinforcement-Learning Supervised-Reinforcement-Learning Public

    Implementation of new fine-tuning method published by Google - Supervised Reinforcement Learning.

    Python 3 1

  4. AI-Football-Assister AI-Football-Assister Public

    Just like a football assist leads to the perfect goal, AI-Football-Assister leads you to answers through intelligent research across trusted football databases.

    Jupyter Notebook

  5. AlphaZero-Parallelization AlphaZero-Parallelization Public

    AlphaZero with multiple implementation of different types of parallelization on custom gym environment.

    Jupyter Notebook

  6. Object-Detection-with-OpenImages Object-Detection-with-OpenImages Public

    Object Detection on 1M images of Google's Open Image Dataset using Faster RCNN and SGD

    Python