My projects makes use of vast variety of latest technology tools. My best experience is to create Data Science projects and deploy them to web applications using cloud infrastructure.
Reddit ETL
pythonDockerAirflowAWS AthenaTableau
Finding top posts on Reddit using Apache Airflow and Docker
ETL on ATM transactions
AWSpythonSqoopPySparkAmazon Redshift
Analysis on Denmarks top Financial company using Amazon Redshift and Big Data technologies.
Analysis on Customer Purchasing Behavior
pythonnumpyscipyseabornexcel
Modelling regression models on count data to accurately predict customer purchasing behavior.
Lead Scoring Model
pythonpandasnumpySeabornStatistics
A logistic regression machine learning model to predict if the customer is a potential lead or not by assigning a score.
Fire Risk and Prioritization of Fire Inspections in Collin County
pythonpandasnumpySeabornStatistics
Geospatial data analytics on over 1 million fire inspection records ML Training.