Portfolio

Explore My Work

View the full presentations and in-depth articles that comprehensively discuss each project. Dive into detailed insights, methodologies, and real-world applications in Data Science, Business Intelligence, Data Analytics, and Data Engineering.

Final Projects

Optimization Strategy for Sales and Customer Loyalty at Kalbe Nutritionals

This project analyzes Sales, RFM, Cohort, Churn, and CLV using SQL, Python, and Tableau to help Kalbe Nutritionals understand sales trends, customer segmentation, and retention strategies...

Data Engineering Project: Building a Telco Churn Analysis Pipeline

End-to-end Telco Churn Data Engineering Project using Docker, Python, Apache Airflow, PostgreSQL, and Spark. This project automates data ingestion, processing, and storage through a scalable pipeline...

Home Credit Default Risk Prediction: Data Science Final Project

Best Group and Best Student Awarded for Data Science Final Project predicting Home Credit customer default risk. Involves data preprocessing, EDA, feature engineering, and ML with XGBoost &...

Business Intelligence

Leveraging RFM & CLV for a Data-Driven Marketing Strategy at All-U-Need Mart

Optimizing All-U-Need Mart’s marketing strategy with RFM & CLV analysis using Python, SQL, and Tableau. This project enhances customer retention, reduces churn, and refines discount strategies through...

Olist E-Commerce Dashboard

An interactive Tableau dashboard providing both executive and operational insights into Olist’s sales trends, customer satisfaction, order fulfillment, and strategic performance for data-driven...

Funnel & Cohort Analysis for Business Growth

Funnel and Cohort Analysis for Business Growth: An end-to-end project using Python, SQL, and Tableau to uncover customer behavior, retention trends, and sales insights. Includes interactive dashboards...

Superstore RFM Analysis: Unlocking Customer Insights for Business Growth

Analyze customer data from the Superstore dataset through RFM segmentation and visualization. Perform EDA to identify key problems, calculate RFM scores, and derive actionable insights. Present...

Driving Growth with Uplift Modeling:Analyzing Marketing Promotions for Optimized Conversions

Uplift Modeling: Discount vs. BOGO (Control: No Offer) | This project analyzes the impact of Discount and BOGO offers compared to No Offer (Control) using S-Learner & Uplift Random Forest. It includes...

Automating School Data ETL from API with Apache Airflow

Business Intelligence project: showcasing an Airflow ETL pipeline to automate the extraction of school data from an API, apply transformations, and load the data into a PostgreSQL database. Includes...

Automating ETL with Stored Procedures in PostgreSQL

Automate your ETL process using PostgreSQL stored procedures! This project showcases the implementation of dwh.generate_sales(), a robust ETL solution that extracts, transforms, and loads sales data...

Designing a Data Warehouse for Employee Performance Analytics

Explore a comprehensive Data Warehouse design for employee performance analytics, leveraging the star schema approach. Discover insights on employee performance by department, education level, and...

Sales Data Warehouse Design: A Coffee Shop Case Study

Optimized Sales Data Warehouse Design for a coffee shop business using a Star Schema approach. Includes ERD, data marts, SQL queries, and insightful business analysis to improve store performance...

Data Science

Bank Customer Churn Prediction and Customer Lifetime Value (CLV) Optimization

Analyze bank customer churn and optimize Customer Lifetime Value using 11 machine learning models (e.g., Logistic Regression, Random Forest, XGBoost). Includes EDA, churn prediction, CLV analysis, and...

Netflix Recommendation System: Analysis and Machine Learning Implementation

A Netflix Recommendation System using machine learning models (KNN, Decision Tree, Random Forest, Logistic Regression, Naive Bayes, K-Means) evaluated for accuracy and clustering, with insights into...

A/B Testing of Spotify’s Playlist Recommendation Algorithm

This project analyzes the impact of a new playlist recommendation algorithm on engagement, retention, and conversion rates using t-tests and chi-square tests. Includes randomized test design, feature...

Predicting Car Selling Prices with Machine Learning

This data science project predicts car selling prices using machine learning based on features like engine size, fuel type, and ownership history. It includes data preprocessing, EDA, feature...

Predicting Airbnb Prices in Sydney

In this case study, I explore how to predict Airbnb prices in Sydney using various features such as location, property attributes, and host characteristics. By building a machine learning model, I aim...

YouTube Supervised Learning

Predicting YouTube video views using regression models with data analysis and preprocessing on the YouTube Statistics dataset. Involves feature engineering, model selection (Linear Regression, Random...

Data Analytics

Kimia Farma CLV & Sales Analysis

Proyek ini bertujuan untuk menganalisis performa bisnis Kimia Farma tahun 2020-2023, dengan fokus pada: Customer Lifetime Value (CLV) untuk memahami pola loyalitas pelanggan Tren penjualan dan...

Exploratory Data Analysis (EDA) of Bank Marketing Campaign

Business Intelligence Project: Analyze marketing campaign impact on term deposits using EDA, correlation, chi-square tests, and linear regression...

Exploratory Data Analysis (EDA) for House Prices

Analyzing housing data using Python with EDA, data preprocessing, and statistical methods. Includes Pearson Correlation, Chi-Square Test, Linear Regression, T-Test, and ANOVA to explore trends...

SQL Movie Database Analysis: Genres, Ratings & Actors

This SQL project explores a movie database using queries to find characters, analyze genres, and determine actor involvement. Key features include identifying the most popular genre, highest and...

Unlocking Retail Success with Data Analytics — Insights into Sales Performance and Customer Behavior

In this case study, I performed a detailed analysis of retail product sales data to extract meaningful insights using Python. This article walks through the process, the techniques employed, and the...

Data Engineering

Data Warehouse & ETL Optimization for Financial Services at ID/X Partners

A Scalable Data Engineering Solution By Hijir Della Wirasti Project Overview This project was conducted as part of a Data Engineering initiative with ID/X Partners, focusing on Data Warehouse...

Bus Travel Data Warehouse

Data Engineering project focused on designing a Star Schema for a bus travel agent dataset using Kimball's methodology. Includes ERD creation, identification of slowly changing dimensions (SCD)...

Automated ETL Workflow with Airflow – MySQL to PostgreSQL

For this Data Engineer project, I implemented a testing workflow for Airflow DAGs. The project involves creating more than 5 tables in MySQL, extracting data to PostgreSQL, and managing the pipeline...

ETL Pipeline with Apache Airflow: Extract, Transform, and Load Data into SQLite

This Data Engineer project creates an ETL pipeline using Airflow. Data is extracted from CSV sources, saved as CSV in the `data/` folder, and loaded into SQLite. The `BranchOperator` selects the...

Streaming Data Pipeline

This Data Engineering project involves collecting streaming data using Apache Kafka, performing real-time ETL with Apache Spark Streaming, and validating data. The project includes creating a...

Kafka

This project sets up a Kafka cluster using Docker with one broker and Kafka UI (Bitnami or Confluent). A producer generates random events every 5 seconds, and a consumer processes them. The project...

Web Scrapping

A web scraping program to extract data from the website https://toscrape.com/ and save the scraped data into a file...