textify-VQA
textify-VQA is a Vision-Question-Answering (VQA) project focused on answering product-related questions using both visual and textual information from product images.
I'm an AI & Data Science enthusiast, Completed my B.Tech in Data Science & AI at IIT Bhilai. Passionate about Machine Learning, Deep Learning, and AI applications, I have worked on several projects, including document AI, medical AI, NLP, and computer vision.
I've worked on a variety of projects, from simple websites to complex web applications. And many of them are open-source. Here are a few of my favorites.
MultiTaskNLP is a deep learning project designed to solve multiple natural language processing tasks — Named Entity Recognition (NER), Sentiment Classification, and Emotion Detection — simultaneously using a shared encoder architecture.
I've written something about AI, programming and life.
textify-VQA is a Vision-Question-Answering (VQA) project focused on answering product-related questions using both visual and textual information from product images.
Meta-learning approaches (Black-box and MAML) evaluated on the Omniglot dataset for few-shot image classification.
A concept-based explainability framework applied to large multimodal models (LMMs) using the VQAv2-small dataset.
The Career Crafters Interview Analysis System is an AI-powered platform designed to simulate interviews, assess both verbal and non-verbal communication skills, and provide detailed feedback to improve performance.
View my Resume
Snippets of Cool AI Stuff