Hello! I'm Shubhang Dhawan
-
24-year-old, recent graduate with a Master's in Data Science (2024) from Chandigarh University and a Bachelor's in Science from CSJMU.
-
Over a year of experience in AI/ML through internships at Frontera Health and Times Network.
-
Frontera Health Internship: Worked on a healthcare chatbot using RAG, Mistral AI, OpenAI API, MongoDB Atlas, and Google Cloud.
-
Times Network Internship: Developed AI-powered article generation, news video translation, and a chatbot using Azure GPT-4o, Meta API, vector search, and NLP techniques.
-
Expertise in RAG, LlamaIndex, LangChain, vector databases, semantic search, and frontend development with Node.js.
-
Published research papers in IEEE and UGC.
-
Passionate about AI and data-driven solutions, eager to contribute to innovative projects.
• Looking forward to working in AI/ML, software development, and related fields
EXPERIENCE

COMPANY NAME

Designation-:AI/ML Engineer
Duration -: Nov 2024 – Feb 2025
AI-Powered Article Generation Based on Editor's Persona:
• Implemented Azure GPT-4o, Azure Cosmos DB, and Azure AI Search with vector-embeddings.
• Generated articles aligning the editor's persona following the 5W1H approach for structured content.
AI-Driven News Video Translation & Hindi AI Voiceover:
• Utilized OpenAI API, GenAI voiceover, and transcription models to enhance news video accessibility.
• Enabled automated translation, transcription, & Hindi AI- voiceover for multilingual content delivery.
Car Specifications Chatbot on WhatsApp:
• Developed using Meta API, GPT-4o Mini, and RAG for efficient response generation.
• Implemented vector-based similarity search to fetch accurate car specifications from Times Drive data.
Internship

COMPANY NAME
Designation -: LLM Engineer
Duration -: March 2024 – October 2024
Internship
GenAI Chatbot for Health & Wellness Sector:
-
Collaborated with the team on developing a GenAI chatbot, focusing on creating an engaging and efficient tool for the health and wellness space.
-
Worked on building the chatbot using RAG, semantic search, Llama-Index, and Mistral AI to deliver accurate and helpful responses.
-
Utilized MongoDB Atlas and Llama-Index to store embeddings and vectors, which improved how the chatbot handled search queries and managed data efficiently.
-
Enhanced chatbot responses by integrating Mistral AI 7B Instruct and GPT-3.5 Turbo, ensuring more precise and relevant outputs.
-
Teamed up on the frontend development using Node.js to create a smooth and user-friendly interface for the chatbot.
-
Deployed and optimized the chatbot on Google Cloud Platform (GCP) to boost performance and ensure scalability.
EDUCATION
2022-2024
Chandigarh University
Master's in Data Science
2019-2022
Chhatrapati Shahu Ji Maharaj University,
Bachelor's in Science
SKILLS

Python
Deep-Learning
Machine Learning
MySQL
Data-Analytics
Natural Language Processing
Google Cloud Platform
Microsoft Azure
MongoDB Atlas
Object-Oriented Programming
Large Language Models
Vector Database
Research - Publications
An Enhancement Of Online Drug Recommendation System Using"BGFT-DBi LSTM"&"PRFFC"Approaches
• My research paper introduces a new online drug recommendation system to address individual side
effects and improve personalization.
• It uses advanced techniques like BFGT-DBI(Bayes Functional with Gaussian Tanh-based Deep
Bidirectional)-LSTM for better accuracy.
• The system processes user comments from social media, clusters users by age, and extracts key
features to recommend safe drugs.
• This approach aims to enhance the reliability of online drug recommendations for diverse health issues.​
Ethical Implications of AI in Healthcare
• This Research paper explores the Ethical considerations of using Artificial Intelligence (AI) in healthcare.
• It discusses how AI can transform how we diagnose illnesses, treat patients, and care for them.
• The focus is on the challenges AI brings, like keeping patient information private, ensuring data is secure, and avoiding biases in AI algorithms.
• Overall, it looks at how we can use AI effectively while still protecting patients' rights.​
Projects

1
An Ai-Healthcare-Chatbot-master project designed to provide intelligent and accessible healthcare information through natural language processing and conversation.
2
News-Feed Harmony
News Harvest is a Python project designed to streamline the extraction, categorization, and storage of news articles from diverse RSS feeds. Python script parses RSS feeds, extracts articles, from xml files categorizes based on sentiment/keywords, stores in MySQL database, and exports to CSV
4
Corono-Metrics Dashboard
In response to the global COVID-19 pandemic. This project was all about digging into COVID-19 data and turning it into easy-to-understand visuals. ,used SQL to sift through heaps of information on the pandemic, like how many people got sick, where it happened the most, and how testing and vaccinations were going.
5
Dynamic Insights Hub from
48-Laws of Power Book
A cutting-edge Intelligent Analysis and Retrieval System tailored for the profound "48 Laws of Power" book. My role involved crafting a dynamic platform that dissects, interprets, and distills insights from this iconic work, empowering users with unparalleled mastery over strategic principles.
6
This task involves analyzing reviews to identify and classify sentiments related to specific aspects. For example, in a garage review, subthemes like "incorrect tyres sent" (negative), "garage service" (positive), and "wait time" (negative) are identified. The goal is to develop a method to extract these subthemes and their sentiments from text.








