Note: You can select more than one with ctrl + click
A scalable NLP solution for analyzing customer reviews, designed to improve satisfaction and product offerings. The project leverages NLP, MLOps, and Azure ML with a scalable and reusable architecture.
Highlights:
Impact: Empowered data-driven business decisions by building production-grade machine learning systems on cloud infrastructure to transform unstructured feedback into actionable insights.
Tools: Python, Git, MLflow, Azure (Azure Machine Learning, Azure Blob Storage), Docker, sklearn, pandas, numpy, venv
A robust RAG-based chatbot designed to provide accurate answers to complex computer science and data science questions, utilizing a knowledge-based question-answering approach. The system includes real-time evaluation and monitoring, ensuring high accuracy, relevance, and performance.
Highlights:
Impact: Enhanced learning and interview preparation by providing an intelligent, real-time assistant capable of answering complex questions with precision and minimal hallucinations, while continuously monitoring and optimizing performance metrics to deliver reliable and relevant results.
Tools: Azure OpenAI, Ollama, SentenceTransformers, ElasticSearch, Grafana and PostGres DB,Streamlit, Python, Docker and docker-compose or podman, Conda, Git and Github
A comprehensive MLOps pipeline for predicting student grades, exemplifying a scalable and effective approach to deploying and maintaining machine learning models in production.
Highlights:
Impact: Demonstrated the end-to-end development of a production-ready MLOps pipeline, enabling accurate and scalable predictions while ensuring reliability, maintainability, and efficient infrastructure management.
Tools: Python, Shell Script, Git, GitHub Actions, MLflow, AWS (Lambda with ECR, Kinesis, S3, IAM, CloudWatch), Terraform, Mage, Docker, docker-compose, Evidently, Grafana, LocalStack, Makefile, black, pytest, isort, pylint, sklearn, pandas, numpy, pyproject.toml, Pre-commit, venv
A robust data pipeline to analyze GitHub’s top contributors and activity types, providing actionable insights into user behavior on the platform through data engineering skills.
Highlights:
Impact: Enabled detailed analysis of GitHub user actions through a scalable and automated pipeline, supporting data-driven decisions for platform activity insights.
Tools: Python, Git, Pyspark/Apache Spark 3, Google Cloud(Cloud Run, Google Cloud Storage, Google BigQuery), Terraform, Mage Orchestrator, venv, Looker Studio
A refactored version of a strategy game I made, enhanced using Object-Oriented Programming (OOP) principles and the MVP (Model-View-Presenter) pattern to improve performance and flexibility for future updates.
Highlights:
Impact: Revitalized a legacy project by applying a modular software development approach, improving maintainability and scalability.
Tools: Python, git, venv
A robust RAG-based chatbot designed to provide accurate answers to complex computer science and data science questions, utilizing a knowledge-based question-answering approach. The system includes real-time evaluation and monitoring, ensuring high accuracy, relevance, and performance.
Highlights:
Impact: Enhanced learning and interview preparation by providing an intelligent, real-time assistant capable of answering complex questions with precision and minimal hallucinations, while continuously monitoring and optimizing performance metrics to deliver reliable and relevant results.
Tools: Azure OpenAI, Ollama, SentenceTransformers, ElasticSearch, Grafana and PostGres DB,Streamlit, Python, Docker and docker-compose or podman, Conda, Git and Github
A scalable NLP solution for analyzing customer reviews, designed to improve satisfaction and product offerings. The project leverages NLP, MLOps, and Azure ML with a scalable and reusable architecture.
Highlights:
Impact: Empowered data-driven business decisions by building production-grade machine learning systems on cloud infrastructure to transform unstructured feedback into actionable insights.
Tools: Python, Git, MLflow, Azure (Azure Machine Learning, Azure Blob Storage), Docker, sklearn, pandas, numpy, venv
A comprehensive MLOps pipeline for predicting student grades, exemplifying a scalable and effective approach to deploying and maintaining machine learning models in production.
Highlights:
Impact: Demonstrated the end-to-end development of a production-ready MLOps pipeline, enabling accurate and scalable predictions while ensuring reliability, maintainability, and efficient infrastructure management.
Tools: Python, Shell Script, Git, GitHub Actions, MLflow, AWS (Lambda with ECR, Kinesis, S3, IAM, CloudWatch), Terraform, Mage, Docker, docker-compose, Evidently, Grafana, LocalStack, Makefile, black, pytest, isort, pylint, sklearn, pandas, numpy, pyproject.toml, Pre-commit, venv
A robust data pipeline to analyze GitHub’s top contributors and activity types, providing actionable insights into user behavior on the platform through data engineering skills.
Highlights:
Impact: Enabled detailed analysis of GitHub user actions through a scalable and automated pipeline, supporting data-driven decisions for platform activity insights.
Tools: Python, Git, Pyspark/Apache Spark 3, Google Cloud(Cloud Run, Google Cloud Storage, Google BigQuery), Terraform, Mage Orchestrator, venv, Looker Studio
A refactored version of a strategy game I made, enhanced using Object-Oriented Programming (OOP) principles and the MVP (Model-View-Presenter) pattern to improve performance and flexibility for future updates.
Highlights:
Impact: Revitalized a legacy project by applying a modular software development approach, improving maintainability and scalability.
Tools: Python, git, venv