As a Research Engineer at MedAi, I specialize in leveraging Data Science and NLP within healthcare. My work centers on making digital health solutions more accessible, with particular expertise in information retrieval and human-computer interaction. I played a key role in pioneering the first AI-powered health platform designed specifically for Bengali speakers, emphasizing accessibility and inclusivity. The platform incorporates advanced features including symptom assessment and clinical decision support capabilities. My research interests lie in the convergence of digital healthcare, social computing, mental health, and wellbeing. I'm particularly passionate about human-centered computing, focusing on developing innovative technologies that deliver practical, meaningful impact in people's lives
Preprint
Automatic Speech Recognition for Biomedical Data in Bengali Language.Shariar Kabir*1; Nazmun Nahar*1; Mamunur Rashid*2; Shyamasree Saha*1. arXiv preprint arXiv:2406.12931 (2024)(pdf)
Working on
AmarDoctor: First Multilingual Digital Platform For AI-Driven Primary Care Triage And Patient Management System For Bengali Speakers. Nazmun Nahar*1; Shariar Kabir*1; Sumaiya Tasnia Khan*4; Suparna Das*3; Shyamasree Saha*1; Mamunur Rashid*2. (pdf)
Being a major contributor to this project since its inception, seeing it flourish makes me feel proud. Therefore, this achievement is very personal to me.
Thesis Project: Bengali Text Recognition Using Deep Learning, under the supervision of Professor Dr. Md. Monirul Islam. For this project, I created a word image dataset from printed documents, annotated it, then trained deep neural networks on it using a variety of methods, including CNN, RNN, LRU, and others.
Coursework: Artificial intelligence, Structured programming language, Object oriented programming language, Data Structures, Algorithms, Database, Computer architecture, Software engineering and information system design, Software development, Basic graph theory and others.
I secured a position in a government bank through a rigorous and highly competitive selection process. However, I found that the role lacked the technical challenges I seek in my career. Consequently, I made a bold decision to leave the position in pursuit of opportunities that allow me to continuously learn and apply my technical skills to real-world problems.
This symptom checker module serves as an intelligent tool that facilitates the symptom selection process by suggesting relevant symptoms based on the input provided by the patient.
Once the patient has completed entering their symptoms, the module prompts additional questions tailored to the patient's responses (yes, no, or don't know). Subsequently, it presents provisional diagnoses along with pertinent specialization recommendations, guiding the patient towards the appropriate healthcare professional.
The medical assistant chatbot discerns the user's mood and offers general illness options if the mood is suboptimal. It then tailors additional questions based on the identified intent of the chosen option and ultimately suggests whether the user should consult a physical health specialist or a mental health specialist.
To enhance user experience and provide more visually appealing symptom icons, I fine-tuned a Stable Diffusion model to generate custom vector arts for image data. This process involved preparing images, generating preliminary captions with image-to-text model, curating the captions, and training a Lora model.
While there are variations of the same symptoms in both Bengali and English, it's impractical to encompass all potential variations. Therefore, this module aligns user-provided symptoms with our existing database.
We employ a BERT model for sentence embeddings, specifically leveraging Bio-BERT for superior performance with medical data. To ensure accuracy, we utilize a combination of sentence similarity scores and organ matching. This prevents symptoms specified for one organ from being erroneously matched to another organ solely based on similarities in sentence structure.
Created a client management system allowing authorized clients to access our AI-powered services. Implemented JWT-based authentication, enabling clients to purchase services for customized durations according to available packages.
Due to the limited availability of conversational medical data in Bengali, our goal is to gather medical data, annotate it, and fine-tune a Large Language Model (LLM) for Named Entity Recognition (NER) of entities in the medical domain.
This project is currently in progress, with the objective of extracting medical data from conversations between doctors and patients. The data includes vital signs such as blood pressure, pulse rate, and temperature, as well as existing diseases, current symptoms, and diagnostic test information.