Projects with this topic
-
A modular Clinical NLP Pipeline built to process and analyze unstructured medical text using both traditional machine learning and transformer-based approaches.
The project combines multiple components including OCR, text preprocessing, feature engineering, classification, named entity recognition, and visualization into a single end-to-end pipeline. It supports extracting clinical insights from raw documents and predicting medical categories using both TF-IDF + SVM and BERT-based models.
The system was designed and implemented as a structured Python project, with each stage separated into independent modules for scalability and maintainability.
Key Highlights
Built an end-to-end NLP pipeline for clinical text processing. Implemented SVM (≈51% accuracy) and BERT (≈77% accuracy) models. Integrated OCR for extracting text from medical documents. Performed Named Entity Recognition (NER) on clinical data. Designed modular architecture (src/) for clean code organization. Exported outputs for visualization and dashboard integration.Updated -
Repository for the RGrid ML recruitment test!
Updated -
Propaganda Detection in Arabic Social Media Text
Updated -
A natural language processing implementation on movie genre classification by synopsis.
Updated -
Text vectorizer using Facebook's LASER multi-lingual encoder
Updated -
Simple text classification project on 20news data, using word embedding methods for text representation and supervised machine learning algorithms to classify.
Updated -
A Natural Language Processing Project Given any aptitude Question it should be able to solve. The very first phase is to classify the he given Dataset into 10 categories like Probability, Ages etc.
Updated