Md Shohanur Islam Sobuj
Machine Learning Engineer · MLOps · LLMs · Multimodal AI
Professional Summary
Machine Learning Engineer with 6+ years delivering production ML systems across MLOps, LLMs, RAG, multimodal AI, NLP, and computer vision. Currently at Anymate Me GmbH in Köln, building an agentic slide-to-video pipeline (PDF/PPTX → RAG agent → TTS → avatar lip-sync) with p95 latency under 3 minutes and 99.4% uptime. Published researcher with 12 peer-reviewed papers across Nature, IEEE, ACL, and NeurIPS venues — 225+ Google Scholar citations, h-index 8.
Work Experience
Machine Learning Engineer
CurrentAnymate Me GmbH
- Architected end-to-end agentic slide-to-video pipeline: PDF/PPTX → RAG content agent → TTS → avatar lip-sync, achieving p95 latency < 3 min for 10-slide decks
- Built PPTX content rewriting agent using LLM + RAG, enabling users to restructure and localise presentation content without manual editing
- Implemented groundedness validation (DeBERTa-v3 NLI) on generated scripts, maintaining 96.2% auto-approval rate in production
- Designed MLOps pipelines on GCP — model versioning in MLflow, canary deploys, automated rollback on metric regression
- Reduced pipeline error rate to < 4% through stage-level observability (Grafana + Loki) and per-job retry logic
Machine Learning Engineer
Business Automation Ltd.
- Implemented Change Data Capture (CDC) pipeline with MySQL, Debezium, Apache Kafka, and Zookeeper — enabling real-time data sync across microservices
- Developed SmartRemarks NLP system for automated content analysis and sentiment classification, handling 10K+ requests/day in production
- Built OCR-based TIN certificate validation system using computer vision, achieving 95%+ extraction accuracy and reducing manual processing time by 80%
- Designed event-driven microservices architecture for real-time ML inference, standardising deployment workflows across environments
- Led migration of ML-backed services to containerised deployments (Docker + Kubernetes)
Machine Learning Engineer
Anchorblock Technology LLC
- Built distributed ML infrastructure handling millions of data points daily with auto-scaling on AWS
- Implemented CI/CD pipelines for ML model deployment using GitHub Actions — reduced deployment time by 60%
- Developed conversational AI systems and LLM-powered chatbots for enterprise clients using RAG architecture
- Designed and maintained REST APIs for seamless ML model integration into client products
- Led technical architecture decisions for distributed computing systems across 3 concurrent client projects
Machine Learning Engineer
Fiverr (Freelance)
- Completed 50+ ML projects for clients across Europe, North America, and Asia
- Specialised in NLP (text classification, sentiment analysis, named entity recognition) and computer vision (object detection, image classification)
- Built end-to-end data pipelines and predictive models for e-commerce, healthcare, and finance clients
Education
B. Sc. (Engineering) in Electrical and Electronic Engineering
Hajee Mohammad Danesh Science and Technology University (HSTU)
- Research focus: Natural Language Processing, Deep Learning, Bangla language models
- Published 4 research papers during undergraduate studies
Technical Skills
Publications & Research
Md Kowsher, Md Shohanur Islam Sobuj, Nusrat Jahan Prottasha, et al.
Nusrat Jahan Prottasha, ..., Md Shohanur Islam Sobuj, ..., Md Kowsher, et al.
Mohammad Majbah Uddin, Md Shohanur Islam Sobuj
Md Shohanur Islam Sobuj, et al.
Nusrat Jahan Prottasha, Asif Mahmud, Md Shohanur Islam Sobuj, et al.
Md Shohanur Islam Sobuj, Md Imran Hossen, Md Foysal Mahmud, Mahbub Ul Islam Khan
Md Kowsher, Md Shohanur Islam Sobuj, Asif Mahmud, Nusrat Jahan Prottasha, Prakash Bhat
Md Kowsher, Md Shohanur Islam Sobuj, Nusrat Jahan Prottasha, Mohammad Shamsul Arefin, Yasuhiko Morimoto
Md Kowsher, Md Shohanur Islam Sobuj, et al.
Md Ferdous Wahid, Md Fahim Shahriar, Md Shohanur Islam Sobuj
Md Shohanur Islam Sobuj, et al.
Md Shohanur Islam Sobuj, et al.