Projects

My projects makes use of vast variety of latest technology tools. My best experience is to create Data Science projects and deploy them to web applications using cloud infrastructure.

Reddit ETL

pythonDockerAirflowAWS AthenaTableau

Finding top posts on Reddit using Apache Airflow and Docker

ETL on ATM transactions

AWSpythonSqoopPySparkAmazon Redshift

Analysis on Denmarks top Financial company using Amazon Redshift and Big Data technologies.

Analysis on Customer Purchasing Behavior

pythonnumpyscipyseabornexcel

Modelling regression models on count data to accurately predict customer purchasing behavior.

Lead Scoring Model

pythonpandasnumpySeabornStatistics

A logistic regression machine learning model to predict if the customer is a potential lead or not by assigning a score.

Fire Risk and Prioritization of Fire Inspections in Collin County

pythonpandasnumpySeabornStatistics

Geospatial data analytics on over 1 million fire inspection records ML Training.