(Updated on: 26/07/2024) Hi there! I'm

Abhishek Kumar

Are you looking for a creative and focused person that loves to take initiative? Look no further

I am a Machine Learning Engineer with a strong background in Computer Science and Artificial Intelligence. I am proficient in developing and implementing deep learning algorithms for a variety of applications, including text and image classification, object detection, image segmentation, and generation using diffusion and generative models.

I am seeking new roles as an applied scientist to leverage my expertise in solving complex real-world problems.

I am a Senior Data Scientist with 4.5 years of experience in machine learning, data science, and software engineering with a Master’s degree in Computer Science & Engineering specializing in artificial intelligence (computer vision and NLP). I build most frequently with Python, C++, and PyTorch. I am currently building a conversational chatbot using generative AI that provides transactional insights and personalized financial advisory based on users’ spending and earning patterns. I am seeking new roles as an applied scientist to leverage my expertise in solving complex real-world problems Here are a few technologies I've been working with recently:
  • Python
  • C++
  • Pytorch
  • NumPy
  • OpenCV
  • Langchain
  • AWS
  • Docker
  • Git
  • Linux

Experience

Senior Data Scientist - Mobikwik
Feb 2024 - Present

Gurgaon, Haryana

  • Developed a financial advisory chatbot based on user transactional patterns using Langchain, MongoDB, Django, and LLMs.
  • Created a multiclass scope checker module that classifies questions and provides standard responses for out-of-scope questions.
  • Developed a RAG pipeline with a question2mongo query generation module that prompts LLM models with extracted information from question and parameter mappings to generate mongo query required to retrieve custom user data.
  • Utilized Weaviate DB as a caching system to cache templated queries and metadata to lower system latency and reduce API cost.
  • Added few shot prompting and user-level feedback to improve query generation accuracy.
  • Designed a text manipulator system that takes in users' transactional categories, subcategories, banks, payment modes, and merchants to create similar follow-up questions for user engagement.
  • Built a state machine system that routes user questions to different modules based on question intent and conversation context.
  • Integrated Grafana, New Relic, and Prometheus for logging and monitoring API requests and responses.
Computer Vision Researcher - A2IL
Sept 2022 - Feb 2024

Buffalo, New York

  • Created a text+pose guided image generation system using latent diffusion models (ControlNet with Stable diffusion weights).
  • Added noise and performed latent interpolation on best pair human face images to generate high-resolution morphs.
  • Generated images at different noise levels and selected the best candidate by computing CLIP similarity scores for fixed prompts.
  • Developed an image manipulation detection system to detect, localize, and label tampered parts in news articles.
  • Trained a CNN model with a Resnet-50 backbone to detect jpeg compression errors in tampered images with 93% accuracy.
  • Utilized NEDB-Net to extract noise and edge-based features to localize and get manipulation masks for the tampered regions.
  • Fine-tuned a custom Yolo-v8 model to detect objects in the localized regions and classify them into 18 categories.
  • Created analytic containers to detect image text inconsistencies in multimodal news articles.
  • Developed a text-transformer tool to perform controlled named entity and parts of speech replacements in texts using SpaCy.
  • Extracted text from images using Paddle OCR and developed contradiction and mismatch detection algorithms to detect inconsistencies between image text and news body.
Systems Engineer - Infosys
Feb 2018 - Aug 2020

Bhubaneswar, India

  • Trained random forest model to classify dependent and independent parts for refrigeration systems with 97% accuracy.
  • Developed a module to predict part prices using XGBoost regressor and integrated with CUMMINS part pricing system.
  • Built API service using Django REST Framework to parse delegation data from files received in hourly batches.
  • Developed employee task delegation system to manage production workflows and approvals using python.
  • Reduced batch data transfer failure rate by 20% through automation and monitoring of batch jobs using RPA and Appworx.
  • Managed deployment and maintenance pipelines for system availability and reliability using Jenkins and Git/Github.

Education

2021 - 2023
Masters in Computer Science (Machine Learning and Computer Vision)
University at Buffalo, New York
GPA: 3.72 out of 4.0
  • Worked as a Computer Vision Research Assistant in the IAD lab under Dr.David Doermann.
  • Built analytic components for SemaFor (DARPA) as part of the UB-SRI team.
  • Researched and wrote my thesis on : Forensic Systems to Detected Manipulated News Media.
  • Was a Teaching Assistant for graduate level courses (CSE 701/702), sports video analytics at University at Buffalo.
  • Guided students to develop deep learning projects for video analysis.
  • Courses : Deep Learing, Computer Vision and Image Processing, Machine Learning, Information Retrieval, Neurosymbolic AI (seminar), Distributed Systems, Analysis and Design of Algorithms.

Extracurricular Activities

  • Ranked 3rd in UB Hack (org. by MLH) for our Decentralized News Platform
  • Ranked 7th in Soccernet 2022 competition for our work on soccer player re-identification in broadcast videos.
  • Ranked 5th in Soccernet 2024 for our work on soccer Temporal Action Spotting.
2013 - 2017
Bachelor of Technology in Mechanical Engineering
SRM University, Chennai
GPA: 3.94 out of 4
  • Researched on carbon nano-tube infused phase change materials to create efficient cooling fluids.
  • Relevant Courses: Robotics, Probability and Statistics, Linear Algebra, Numerical Methods, Matlab, Advanced Calculus.

Extracurricular Activities

  • Won 2nd Prize in the annual tech fest (Aaruush'14) for our Infrared based line follower robot.
  • Active member of the photography club.

Projects

Temporal Action Spotting
Temporal Action Spotting
Implemented a transformer based model with multiscale flow and RGB features to classify actions and identify temporal boundaries for 17 action classes. Achieved 52 % mAP and ranked 5th in Soccernet competition.
Text-to-Image using Residual GANs
Text-to-Image using Residual GANs
Developed a Deep Residual GAN network to create images from latent noise. Used BIRD CUB-200 dataset to train the model.
Image Denoiser
Image Denoiser
Developed a CNN model in PyTorch that uses residual learning on deep CNNs to remove noise from iamges.
Player Re-Identification
Player Re-Identification
Designed a dual branch deep learning model to re-identify soccer players across multiple camera viewpoints depicting the same action during the game. Fused appearence and body part features from a subnet of OpenPose using Compact bilinear pooling to get the fused features.
Splice Detector
Splice Detector
A Deep Convolutional Neural Network model to detect manipulated images using JPEG compression analysis.

Get In Touch

Drop an email for any questions. My inbox is always open !