Hello, I'm

Kazi Mahathir Rahman

My journey in tech began during my BSc in Computer Science. I’m passionate about research in Reinforcement Learning, AGI, LLM, and Computer Vision, Whether it’s building smarter systems or exploring new frontiers in AI, I always love learning, experimenting new ideas.

Where did I work?

Experience

A timeline of my professional journey.

AI Engineer & Consultant

Anton Rx [Jan 2025 – September 2025]

Full Time

I'm an independent Machine Learning consultant with a focus on healthcare data. I work with a U.S.-based company called Anton RX, where I build predictive models and agentic AI. I have experience working on healthcare projects like claims & rebate systems, while also ensuring everything follows industry regulations.

Researcher

CVIS Research LAB [Sept 2023 – Jan 2025]

Part-time

I began working at the CVIS Research Lab under the guidance of Dr. Md. Ashraful Alam, where I contributed to various projects and research initiatives. As part of my responsibilities, I developed the CVIS website using WordPress. Additionally, I conducted three research projects focused on Computer Vision and Natural Language Processing, collaborating closely with lab researchers and fellow team members to achieve significant outcomes.

What do I know?

Skills

PyTorch
TensorFlow
Hugging Face
Selenium
C
C++
Scikit-learn
WordPress
python
Power BI
React
Django
MongoDB
HTML
PHP
CSS
MySQL
What I've done?

Projects

VesselDetector

In our project, we tackled the challenge of detecting ships from Synthetic Aperture Radar (SAR) images using the YOLOv7 model. SAR images are often plagued by significant noise. To mitigate this, we applied noise reduction techniques such as Denoising Diffusion Probabilistic Models (DDPM) and anisotropic diffusion, which significantly enhanced image quality. Our model was trained on SSDD, DS SDD, and FUSAR datasets and demonstrated high accuracy in ship detection.

Visit
SwinFaceRec

This project investigates facial expression recognition using Swin Transformer models (Swin-S, Swin-M, Swin-B). The system initially employs Haar Cascade for face detection, followed by Swin Transformer-based classification on datasets like RAF-DB, CK+, and FER2013.

Visit
PyParallel Library

PyParallel is a PyTorch-based library designed to simplify deep learning on multiple devices with features for model and data parallelism, allowing efficient training across CPUs, GPUs, and TPUs. It includes optimization tools like gradient checkpointing, mixed precision, automatic loss scaling, gradient clipping, and gradient accumulation to improve performance and memory usage. Additionally, PyParallel offers real-time hardware monitoring, providing CPU, GPU, and TPU memory tracking through an interactive dashboard, making it easy to manage resources during training.

Visit
FIFA22 Position Predictor

This project aims to predict player positions in FIFA 23 by employing various data preprocessing techniques such as normalization, scaling, feature engineering, and handling missing values on a Kaggle dataset. Several machine learning models including Random Forest, Support Vector Machines, and Neural Networks were compared to optimize player position prediction accuracy.

Visit
BRACU Helper ChatBo

BRACU Helper is a university chatbot built to assist BRAC University students by answering their common queries. Developed using Django for the backend and React for the frontend, the chatbot utilizes a fine-tuned Sentence Transformer model to generate accurate and relevant responses. The project includes features like user authentication, a responsive chat interface, and data storage for chat logs, making it a useful tool for enhancing the student experience.

Visit
EssenceAI

EssenceAI is a text summarizer website developed using Django and React, designed to help users generate concise summaries of large bodies of text. The site leverages a custom-trained T5 model, fine-tuned with the Gigaword dataset, ensuring high-quality and contextually relevant summaries. The interface is built using the Bootstrap CSS framework, offering a modern and responsive design.

Visit
Islamic Assistance BOT

This project is an Islamic Knowledge Assistance Chatbot built using Django, HTML, and CSS, designed to provide accurate answers to questions related to Islam. The chatbot employs a retrieval-based approach utilizing TF-IDF vectorization and cosine similarity, along with natural language processing techniques like tokenization and stopword removal, to match user queries with the most relevant answers from a custom dataset of Islamic questions and answers.

Visit
Tweet Sentiment Analysis

This project focuses on sentiment analysis of a dataset using both machine learning and deep learning models, including Random Forest, RNN, LSTM, and GRU. Despite employing techniques like dropout, regularization, and hyperparameter tuning, all models exhibited overfitting. Improvements in data preprocessing and acquiring a more diverse dataset are necessary generalization to unseen data.

Visit
Hotel Review Data Analysis

This project aimed to investigate the relationship between TripAdvisor reviews and corresponding ratings. By employing a combination of Natural Language Processing (NLP) techniques and deep learning models, the study sought to understand the extent to which review text could predict or explain rating variations.

Visit
Daraz Scraper

This project is a web scraping bot designed to extract product details from the Daraz online shopping platform. By providing a search keyword and specifying the page number, the bot uses Selenium to navigate Daraz's website, collect data such as product name, brand, price, and seller information, and then saves this data into a CSV file.

Visit
Bikroy.com Scraper

This project involved web scraping mobile phone data from Bikroy.com using Python. Key information such as mobile name, model, price, reviews, features, and other details were extracted from the first five pages of the website due to hardware constraints. The Python libraries Beautiful Soup 4, requests and selenium were primarily used for this task.

Visit
ResNet-34 (Build From Scratch)

This project explores the development of object detection models using ResNet-34, ApexNet, InceptionNet, and GoogleNet architectures build from scratch on a specified image dataset utilizing TensorFlow.

Visit
Breast Cancer Surviva Prediction

This project investigated the potential of machine learning for breast cancer analysis and prediction. By exploring various models like SVM, Logistic Regression, KNN, and Random Forest on a preprocessed dataset, the project aimed to identify the most effective approach. While all models achieved decent accuracy, SVM stood out with a superior F1 score, indicating its better ability to balance precision and recall.

Visit
Protein Sequence Classification

This research investigates the prediction and classification of protein sequences using a dataset encompassing diverse protein information. A comprehensive approach was employed, incorporating various preprocessing techniques to prepare the data effectively. Multiple machine learning models were then applied and compared to determine the most suitable method for accurately predicting and categorizing protein sequences based on their underlying characteristics.

Visit
Stock Prediction

This project explores the potential of machine learning for stock price prediction using the S&P 300 index as a dataset. Random Forest, Logistic Regression, and Support Vector Machine models were implemented and compared to forecast future stock prices. The project encompasses data collection, preprocessing, feature engineering, model training, evaluation, and visualization.

Visit
Fitness Tracking and Household Service Website

This platform lets users track fitness goals and book household services like cleaning, repairs, and gardening. Built with PHP, MySQL, HTML, and CSS, it offers easy management for health and home tasks in one place.

Visit Visit
Github Portfolio Website

I've constructed a dynamic online portfolio showcasing my web development abilities. Leveraging HTML, CSS, JavaScript, and PHP, I've crafted a responsive and interactive platform hosted on GitHub. This digital showcase effectively highlights my skills and projects, providing a comprehensive overview of my professional capabilities.

Visit
What are my innovation?

Research

Explore my research contributions, publications, and innovative

TextDiffuser-RL: Efficient and Robust Text Layout Optimization for Text-to-Image Synthesis

TextDiffuser-RL is a lightweight pipeline for text-embedded image generation, combining reinforcement learning for layout and diffusion-based synthesis. It runs efficiently on CPUs and GPUs, matches TextDiffuser-2 in quality, and is 45.67x faster with minimal memory use.

Read More
Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation

We introduce a pipeline that converts spoken English into smooth 3D ASL animations. It uses Whisper for speech-to-text, MarianMT for ASL gloss translation, and a keypoint-based motion system trained on Sign3D-WLASL. Grammar rules and interpolation ensure accurate and natural sign language animation.

Read More
How I am?

About Me

About Me Portrait

I am an experienced AI Engineer specializing in the U.S. healthcare sector, with expertise in machine learning and MLOps. My work focuses on developing predictive models, agentic AI, and automating decision systems for healthcare applications. This always ensuring regulatory compliance and optimizing clinical workflows.

My research interests include reinforcement learning, computer vision, natural language processing, and large language models. I am passionate about bridging the gap between cutting-edge AI research and real-world impact, contributing to innovative projects that advance both technology and healthcare outcomes.

You can see my resume for formal details.

Resume/CV

Let's connect!

Get In Touch

Interested in connecting or collaborating? You can reach me directly at
mahathirmahim73@gmail.com or kazi.mahathir.rahman@g.bracu.ac.bd.
Find me on social media: