In this project, I conducted an in-depth analysis of the Brazilian E-Commerce company, which includes detailed records of 100,000 orders made between 2016 and 2018 across various online marketplaces in Brazil. This analysis provides a comprehensive view of the Brazilian e-commerce landscape, capturing a wide range of variables such as order status, pricing, payment methods, freight logistics, customer demographics, product characteristics, and customer reviews.
This analysis aims to extract actionable insights that can help understand customer behavior, optimize logistics, and improve overall e-commerce performance!
Order Volume:
- Total Orders
- Orders per Month/Quarter
Revenue Metrics:
- Total Revenue
- Average Order Value (AOV)
- Revenue by Product Category
Customer Metrics:
- Customer Lifetime Value (CLV)
- Repeat Purchase Rate
- Customer Acquisition Cost (CAC)
Payment Performance:
- Payment Method Distribution: The percentage of transactions made using different payment methods (e.g., credit card, debit card, voucher).
- Payment Approval Rate
- Average Payment Value
Logistics and Delivery Metrics:
- Average Delivery Time
- Freight Cost per Order
- Delivery Success Rate
- Late Delivery Rate
Customer Satisfaction:
- Average Review Score
- Percentage of 5-Star Reviews
- Return Rate
Geographical Metrics:
- Sales by Region
- Order Distribution by Region
- Freight Cost by Region
Product Performance:
- Top-Selling Products
- Inventory Turnover
- Product Return Rate
- Project Overview
- Tools Used
- Data cleaning
- Exploratory data analysis
- Findings
- Recommendations
- Limitations
- Tableau visualization
By analyzing various aspects of the sales data, we seek to identify trends, make data-driven recommendations, and gain a deeper understanding of the company´s performance.
- Excel: Primary data cleaning
- SQL: Google BigQuery for data cleaning and transformation
- Panda: for data cleaning, inspection, and analysis
- Tableau: Visualization
- Python, PySpark
In the initial data preparation phase I performed the following tasks:
- Data loading and inspection
- Handling missing values and duplicates
- Data cleaning and formatting
In this analysis, I explored key questions to gain insights into e-commerce performance and customer behavior:
- How did order volume and revenue evolve over time? Is there any seasonality? What are the peak sales periods?
- What are the patterns in payment methods and their impact on sales?
- What are the main drivers of delivery performance?
- Which product categories and attributes are the most popular?
- What are the key customer demographics and behavior patterns?
- How do customer reviews correlate with product performance?
- What are the trends in customer retention and repeat purchases?
- Did delivery performance or customer behavior vary by region?
- What is the overall sales trend?
- Which products are top sellers?
- Which products are sold in high quantities?
- Which seller sold the most?
- Which products received low reviews?
- Which products received high reviews?
- Which products generated the highest revenue?
Based on the findings the following insight was generated:
-
Sales Trends and Seasonality: Holiday Peaks: The significant sales peak during the holiday season highlights the importance of seasonal promotions and inventory planning. The reduction in sales towards the end of 2018 may indicate a need for improved post-holiday strategies or adjustments in inventory management.
-
Revenue by Category: Health and Beauty Dominance: The Health and Beauty category's contribution of 14.59% to total revenue underscores its popularity. This category’s performance suggests that investing in marketing and promotions for these products could further boost sales.
-
Gift Watches: With 13.20% of the total revenue, Gift Watches also play a significant role. Consider exploring opportunities for expanding this category or introducing related products to capitalize on its success.
-
Delivery Performance: Ahead of Expectations: The company’s ability to deliver 54.51% ahead of expected delivery times is impressive and could be a key competitive advantage. Emphasizing this performance in marketing materials could enhance customer satisfaction and attract new customers.
-
Top Seller Performance: Seller ID 399,322,315: This seller's significant sales volume suggests they are a major contributor to overall revenue. It may be beneficial to analyze their strategies and practices to identify best practices that could be replicated across other sellers.
-
Top Rated Sellers: Sao Paulo Excellence: Four out of the top ten five-star rated sellers located in Sao Paulo indicate a strong market presence in this region. There may be regional factors contributing to their high ratings that could be leveraged in other regions.
Based on the analysis I recommend the following actions:
-
Optimize Inventory Management: Stock High-Demand Products: Ensure top-selling products, especially those in high-demand categories like Health and Beauty and Gift Watches, are adequately stocked, particularly during peak seasons such as the holidays. Implementing advanced forecasting models can help anticipate demand more accurately.
-
Review and Communicate Delivery Times: Adjust Expected Delivery Times: Given the company’s strong delivery performance, consider updating expected delivery times to more accurately reflect actual delivery capabilities. This adjustment can enhance customer trust and reduce dissatisfaction with delivery expectations.
-
Recognize and Reward Top Sellers: Incentivize Performance: Develop recognition programs or incentives for top-performing sellers, like Seller ID 399,322,315. Acknowledging and rewarding their efforts can boost motivation and set a benchmark for other sellers.
-
Address Negative Reviews: Investigate Low Ratings: For sellers with high numbers of one-star reviews, conduct a thorough review of their processes and customer service practices. Identify common issues and implement corrective actions to improve overall customer satisfaction.
-
Leverage Regional Strengths: Explore Sao Paulo Success: Investigate what makes sellers in Sao Paulo excel. It may involve factors like regional marketing strategies, customer service practices, or logistical efficiencies. Apply these insights to improve performance in other regions.
- Customer Behavior Insights: Limited Customer Data: The dataset might not include detailed customer profiles or purchasing history beyond what is recorded, limiting the depth of behavioral analysis.
- Data Privacy and Ethical Considerations: Anonymization: The dataset is anonymized, which means some context about individual transactions or customer behavior may be lost, potentially limiting the depth of analysis.
- I had to remove all null values from the revenue column because they would have affected the accuracy of my conclusion.
- In addition to that, I faced 601 products without any category. Because of limitations in the data provided, I wasn't able to add them to their respective categories.
Dashboard
Click here to Access the Interactive Tableau Dashboard