Hello, I'm Yash Rao Data Scientist & Analyst

Transforming complex data into actionable insights and elegant visualizations

About Me

I'm a data scientist and analyst passionate about transforming data into valuable insights. With expertise in statistical analysis, machine learning, and data visualization, I help organizations make data-driven decisions.

My approach combines technical expertise with strong communication skills to translate complex findings into clear, actionable recommendations. I'm constantly exploring new technologies and methodologies to enhance my analytical capabilities.

3 Years Experience
4 Projects Completed
Professional portrait

Skills & Expertise

  • Statistical Analysis Expert
  • Data Cleaning & Preprocessing Advanced
  • Exploratory Data Analysis Expert
  • A/B Testing Proficient
  • Dashborading and Reporting Expert
  • Supervised Learning Expert
  • Unsupervised Learning Expert
  • Deep Learning Advanced
  • Image Processing Expert
  • Natural Language Processing Advanced
  • Docker and Containerization Proficient
  • CI/CD with GitHub actions Proficient
  • Cloud Monitoring Advanced
  • Cloud Resource Management Advanced

Tools & Technologies

Python
SQL
R
Power BI
TensorFlow
Google Cloud
Airflow
Spark

Featured Projects

Machine Learning Project

News Notifier

Detects and visualizes trending news topics by clustering semantically similar articles. Uses MiniLM for embeddings, Facebook BART for summarization, and UMAP for dimensionality reduction.

Python Transformers and Clustering (UMAP) Airflow
Data Visualization Project

HR Attrition Dashboard

Built an interactive Power BI dashboard to analyze employee attrition across roles, age groups, education, salary, and experience. Identified key drivers of attrition including low salary (under $5K), age group 26–35, and job roles like Laboratory Technician and Sales Executive. Enabled HR teams to make data-driven retention strategies.

PowerBI SQL Data Visualization
Data Analysis Project

Personalized Cafe And Restaurant Recommendation System

Performed association rule mining on transaction data to identify product affinities and optimize store layouts.

Google BigQuery RAG ChromaDB and OpenAI API
NLP Project

Asthama Predictor

Developed a machine learning pipeline to predict asthma risk based on patient demographic and clinical data. Performed extensive preprocessing, feature selection (Boruta, LDA, Random Forest), and balanced the dataset using undersampling techniques. Evaluated models like XGBoost, SVM, and AdaBoost using AUC and MCC for reliable classification.

R Data Preprocessing Machine Learning
Data Visualization Project

Predicting ESG Risks using Flood detection

Published a report to highlight the ESG risks of floods in flood prone areas using Satellite Imagery, Geospatial Analysis and Deep Learning.

Deep Learning Python Geospatial Analysis
Time Series Analysis Project

Wound Detection And Segmentation

Built an end-to-end wound analysis system using a fine-tuned DeepLabv3 model for segmentation. Integrated real-time image processing via Apache Kafka and Flask API. Supports RGB input and outputs segmentation masks for clinical assessment

Image Processing Google Deeplab V3 Apache Kafka

Experience & Education

Graduate Research Assistant

Boston University

Feb 2024 – May 2024
  • Built ChromaDB vector DB for melanoma research.
  • Integrated Llama2 RAG with 90% accuracy.
  • Used Query Translation + Step-Back Querying.

M.S. in Applied Data Analytics

Boston University

Sep 2023 – Jan 2025

GPA: 3.93 / 4.00

Research Intern

IIT Roorkee

Sep 2022 – Feb 2023
  • Developed 1D–2D ResNet50 for hyperspectral imagery.
  • Achieved 91% accuracy and ROC-AUC of 0.995.

Data Analyst

Cutso Foods LLP

Jun 2020 – Jul 2022
  • Built REST APIs, improved SQL by 35%.
  • Created dashboards & automated reports.
  • Reduced latency by 30% with multithreading.

B.Tech in Computer Science

VIT, India

Jul 2019 – Jun 2023

GPA: 8.90 / 10

Certifications

Microsoft Power BI Data Analyst (PL-300)

Issued by Microsoft · April 2024

View Credential

IBM Data Science Professional Certificate

Issued by IBM · July 2023

View Program

AWS Certified Cloud Practitioner

Issued by Amazon Web Services · Aug 2024

Explore Credential

Get In Touch

Interested in working together or have questions about my projects? Feel free to reach out!

Location

Boston, MA