- Project Overview
- Data Source
- Tools
- Data Cleaning
- Exploratory Data Analysis
- Results
- Recommendations
- Limitations
- References
The goal of this project is to analyze BlinkIT’s grocery sales data to uncover insights on sales trends, item popularity, and performance across different store types and locations. By examining metrics such as item sales, outlet performance, and customer ratings, we aim to provide actionable recommendations for inventory management, promotional strategies, and store operations.
This analysis is based on data from the "BlinkIT Grocery Analysis" Excel file, containing detailed information about grocery items, store attributes, and sales metrics.
- Python (Pandas): For data cleaning and preparation
- Microsoft Excel: For raw data exploration
- Markdown: For report documentation
The following steps were taken to clean and prepare the data for analysis:
- Missing Values: Checked for missing values in key columns and handled them as follows:
- Used median values for numerical columns (e.g.,
Item Weight
) where missing. - Imputed categorical columns with the most frequent category.
- Used median values for numerical columns (e.g.,
- Data Formatting: Standardized formats in categorical columns (e.g.,
Item Fat Content
). - Outliers: Detected and managed outliers in
Sales
andItem Visibility
columns to ensure accurate analysis. - Date Parsing: Parsed
Outlet Establishment Year
to calculate the age of each outlet.
The analysis explored the following key questions:
- What are the top-performing item categories in terms of sales?
- How does sales performance vary across outlet types and locations?
- Are there specific trends in customer ratings based on item type or outlet type?
- Which stores have the highest item visibility, and does visibility correlate with sales?
- Top-Performing Categories: Fruits, vegetables, and beverages emerged as top-sellers.
- Sales Trends by Outlet Type: Supermarkets of Type 1 generally had higher average sales, especially in Tier 1 locations.
- Rating and Sales Correlation: Higher-rated items saw a small increase in sales, suggesting that quality impacts customer choice.
- Outlet Age and Performance: Older outlets had slightly higher sales, possibly due to brand establishment and customer loyalty.
Based on the findings, the following actions are recommended:
- Promotional Strategies: Increase promotions for high-visibility items in Tier 1 locations.
- Inventory Focus: Prioritize stocking popular categories (e.g., fresh produce, beverages) in high-demand outlets.
- Customer Feedback: Use ratings to adjust inventory for items with consistently high ratings and low returns.
- Outlet Expansion: Consider opening additional stores similar to high-performing locations, particularly in Tier 1 areas.
- Data Completeness: Some item weight data was missing, requiring imputation, which may slightly affect accuracy.
- Visibility Metrics: The
Item Visibility
metric was difficult to interpret without additional context on store layout.