Rakesh Pal

Rakesh Pal

Data Scientist & ML/AI/Gen_AI Engineer

Open to Opportunities

Skilled professional specializing in Data Science, Machine Learning, Artificial Intelligence, Deep Learning, and Gen-AI

Professional Summary

4 Years Total Experience
Certified Data Scientist with 4 years of progressive experience in the IT industry, specializing in Data Science, AI/ML, Deep Learning, Gen-AI, Computer Vision,NLP, AWS and Statistics across government, Banking, Agriculture,ETV,Healthcare and Broadcasting sectors. Currently leading AI initiatives at IIT Madras Pravartak, developing enterprise-scale solutions including RAG-powered chatbots, TTS/ASR systems, and agentic AI frameworks.

Key Expertise: Designed and deployed 15+ production-grade AI systems including conversational AI agents using GPT, BERT, and Llama models etc.. with advanced RAG architectures. Built computer vision solutions achieving 80-85% accuracy in agricultural pest/disease/weed detection using YOLOv5/v8, ResNet, and U-Net. Implemented multi-lingual NLP systems with TTS/ASR capabilities for broadcasting and government applications. Proficient in end-to-end ML pipeline development from data collection and annotation to model deployment on AWS/GCP using FastAPI, Streamlit, and Docker.

Technical Skills

Languages & Core

Python (Advanced) SQL Django FastAPI Streamlit Flask

AI/ML & Deep Learning

TensorFlow PyTorch Keras Scikit-learn XGBoost OpenCV YOLOv5/v8 U-Net ResNet EfficientNet Transfer Learning

Gen-AI & NLP

LLMs: GPT-3/4, BERT, Llama 2, Mistral Architectures: Transformers, Attention mechanisms, Encoder-Decoder Frameworks: LangChain, LlamaIndex, Hugging Face Transformers, MCP Server RAG: Vector retrieval, Hybrid search, Reranking Embeddings: Sentence Transformers, OpenAI NLP: spaCy, NLTK, NER, Sentiment Analysis Conversational AI: RASA, Botkit Speech: Whisper, Wav2Vec 2.0, Tacotron, TTS/ASR Agents: MCP Server, Multi-agent systems, Tool calling

Data & Databases

Relational: SQL Server, PostgreSQL, MySQL Vector DB: Redis, ChromaDB, Pinecone NoSQL: MongoDB Big Data: Apache Spark, Hadoop, MapReduce, Hive ETL/ELT: Apache Airflow, Kafka, Luigi, Talend Processing: Pandas, NumPy, Spark SQL, MLlib Annotation: LabelImg, CVAT Delta Lake

Cloud & DevOps

AWS: EC2, S3, Lambda, SageMaker GCP Docker Git MLflow CI/CD: GitHub Actions, Jenkins

Specialized

OCR: Tesseract PDF Processing Web Scraping: BeautifulSoup, Selenium RESTful APIs Tableau Power BI GIS/Geospatial

Core Competencies

Analytics & Research

Predictive Analytics Data Mining Statistical Analysis Market Research

Business Intelligence

Business Intelligence Data Warehousing Decision Support Systems

Professional Experience

AI Engineer
IIT Madras Pravartak
Feb 2025 – Present
  • Developed and deployed conversational AI agents (chatbots, virtual assistants) using LLMs like GPT, BERT, and Llama models from Hugging Face
  • Implemented RAG pipelines by integrating vector databases (Redis, ChromaDB) with LLMs to enhance factual accuracy
  • Worked on diverse NLP projects including text summarization, translation, transcription, and sentiment analysis

Key Projects:

CAG Chatbot - Complete RAG Pipeline
Built end-to-end RAG pipeline with PDF parsing, chunking, embeddings, Redis vector DB, hybrid search, and LLM integration. Enables intelligent Q&A from high court documents with factual accuracy.
Llama Redis RAG FastAPI
CUB MSME Form Validation
OCR-based system for automated validation of MSME application forms. Integrated document processing, data extraction, and validation workflows with high accuracy for form processing.
OCR OpenCV Tesseract Python
NACIN Multi-lingual Chatbot
Multi-lingual tax and customs assistant supporting multiple Indian languages. Built with intent recognition, contextual dialogue, and personalized responses for complex tax queries.
BERT GPT LangChain Multi-lingual NLP
Neural TTS with Voice Cloning
Advanced text-to-speech system with neural voice cloning capabilities. Supports multiple voices, emotional tones, and real-time speech synthesis with human-like quality.
Tacotron WaveNet PyTorch Voice Cloning
Automatic Speech Recognition System
Real-time speech recognition system with noise cancellation and speaker diarization. Supports multiple languages and accents with high accuracy transcription.
Whisper Wav2Vec TensorFlow Noise Cancellation
Agentic AI with ReAct Framework
Autonomous AI agent implementing ReAct (Reasoning + Acting) framework. Capable of complex problem-solving, tool usage, and multi-step decision making.
ReAct Autonomous Agents LangChain GPT-4
ETV Broadcasting TTS Integration
Broadcasting system integration with TTS for automated news reading and announcements. Real-time text processing and speech synthesis for media broadcasting.
Broadcasting TTS Real-time Media Integration
Parliament Proceedings ASR
Automatic speech recognition system for parliament proceedings. Speaker identification, multi-lingual support, and real-time transcription for parliamentary sessions.
ASR Speaker Diarization Multi-lingual Real-time
Production-Grade RAG System
Scalable RAG implementation with caching, load balancing, and monitoring. Production-ready system supporting millions of documents with sub-second response times.
RAG Redis FastAPI AWS
Assistant Professor
ITM University, Raipur
Nov 2024 – Jan 2025
  • Lectured on Data Science, Machine Learning, and AI curriculum for undergraduate and graduate programs
  • Developed course materials and supervised student projects in Computer Vision, NLP, and Generative AI
  • Conducted practical lab sessions on Python, TensorFlow, PyTorch, and modern AI frameworks
  • Provided career guidance and placement preparation for students pursuing AI/ML roles
AI Engineer
NIC Raipur
Sept 2022 – Oct 2024
  • Contributed to object detection and classification models to optimize agricultural processes and improve crop yield
  • Worked on object detection, segmentation, and classification models for agriculture using advanced deep learning techniques
  • Managed SQL Server database administration and optimization with security measures
  • Led agricultural implement management platform development

Key Projects:

Pest Detection in Paddy
YOLOv8-based real-time pest detection system for paddy fields. Detects multiple pest types with high accuracy, enabling early intervention and reducing crop damage.
YOLOv8 Real-time OpenCV PyTorch
Disease Detection in Paddy
Multi-class disease classification using ResNet/EfficientNet models. Identifies 15+ paddy diseases from leaf images with 95%+ accuracy, aiding farmers in timely treatment.
ResNet EfficientNet TensorFlow Image Classification
Weed Detection in Paddy
Hybrid YOLOv5 + U-Net segmentation system for precise weed detection and localization. Enables targeted weed removal and reduces herbicide usage by 60%.
YOLOv5 U-Net Segmentation Computer Vision
PostgreSQL-Powered Farmer Assistant
Rule-based chatbot integrated with agricultural database. Provides instant guidance on crop management, weather, market prices, and government schemes.
PostgreSQL Rule-based AI WebSocket Django
Facial Recognition Attendance System
Employee attendance system with facial recognition and anti-spoofing measures. Real-time face detection, liveness detection, and accurate attendance logging.
Face Recognition Anti-spoofing Real-time OpenCV
Agricultural Implement Rental Platform
Full-stack platform for farmers to rent/exchange agricultural machinery. Features include booking system, payment integration, GPS tracking, and maintenance scheduling.
Django PostgreSQL REST API Payment Gateway
Application Developer
Augtech NextWealth IT Service Pvt Ltd, Bhilai
Feb 2022 – July 2022
  • Developed websites using Python with Django framework
  • Implemented web scraping solutions using BeautifulSoup and Selenium
  • Created chatbots for multiple platforms including Facebook, LinkedIn, and WhatsApp using APIs
  • Tested APIs through Postman and integrated them into code nodes
  • Built chatbots using RASA and Botkit frameworks

Academic Projects

Prediction of Breast Cancer Using Machine Learning
122 Days

Developed multiple classifiers including logistic regression for forecasting breast cancer with insights predictions. Explored various data mining approaches utilizing classification to produce in-depth predictions on breast cancer data. Identified optimal model with high efficiency by accessing datasets across diverse classifiers. Utilized UCI machine learning repository with 699 instances and 11 attributes from breast cancer dataset.

HR Management System
426 Days

Developed comprehensive system for maintaining employee records and performing all CRUD operations for efficient human resource management.

Smart Automatic Toilet Cleaning System
120 Days

Created fully automated toilet cleaning system for railway and residential use. Implemented cleaning of toilet seat, walls, and floor using pressurized water and cleaning fluid with automatic drying. Integrated human presence detection to initiate cleaning process automatically.

Achievements

National eGovernance Award 2023 (Gold)

View Certificate

Research Paper - Breast Cancer

View Certificate

BIT Research - Breast Cancer

View Certificate

Mapathon

View Certificate

ML Board Game Predictor

View Certificate

Paper Presentation - Wave Optics

View Certificate

NSS Certificate Type B

View Certificate

Certifications

Alison - Diploma in ML

View Certificate

360DigiTMG - Data Science Python

View Certificate

Python for Data Science

View Certificate

Oracle SQL

View Certificate

Udemy - ML with Python

View Certificate

Python Django

View Certificate

Coursera Python

View Certificate

Machine Learning with Python

View Certificate

Machine Learning & Data Science

View Certificate

ISRO - Remote Sensing & Global Navigation

View Certificate

Geospatial Master Plan

View Certificate

Remote Sensing in Agriculture

View Certificate

ISRO - Coastal Ocean Process

View Certificate

NPTEL English

View Certificate

Education

M.Tech. in Data Science
BITS Pilani, Hyderabad
2024
B.Tech. in Computer Science
Bhilai Institute of Technology, Raipur
2021
XII (Senior Secondary)
Gandhi Memorial Senior Secondary School
2017
X (High School)
Gandhi Memorial Senior Secondary School
2015