ecommercetools  by practical-data-science

Python toolkit for ecommerce, marketing science, and technical SEO

created 4 years ago
255 stars

Top 99.2% on sourcepulse

GitHubView on GitHub
Project Summary

EcommerceTools is a Python data science toolkit designed for professionals in e-commerce, marketing science, and technical SEO. It provides a comprehensive suite of functions for data analysis and model building, streamlining workflows within Jupyter notebooks or standalone Python projects.

How It Works

The toolkit leverages Pandas DataFrames for data manipulation and offers specialized modules for various e-commerce tasks. It includes utilities for data loading and formatting, customer segmentation (RFM, ABC), cohort analysis, purchase latency, and predictive modeling (CLV, AOV). The SEO module integrates with Google APIs for sitemap discovery, Core Web Vitals, Knowledge Graph data, and Search Console queries.

Quick Start & Requirements

  • Installation: pip3 install ecommercetools
  • Prerequisites: Python, Pandas. Some SEO functions require Google PageSpeed Insights API key or Google Search Console API access. NLP summarization requires PyTorch and a large model (1.2 GB+).
  • Data: The package includes a load_sample_data() function for immediate use with the Online Retail dataset.

Highlighted Details

  • Offers advanced customer analytics including RFM segmentation with heterogeneity (H) and predictive CLV/AOV using BG/NBD and Gamma-Gamma models.
  • Integrates with Google APIs for comprehensive SEO analysis, including Core Web Vitals and Search Console data.
  • Provides NLP capabilities for text summarization using Huggingface Transformers.
  • Includes functions for generating paid search ad copy and keywords.

Maintenance & Community

Created by Matt Clarke. Further community or maintenance information is not detailed in the README.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial use or closed-source linking is not detailed.

Limitations & Caveats

The NLP summarization feature requires a substantial model download and can be time-consuming. The get_serps() function is not intended for large-scale scraping and may be blocked. Some SEO functions require API keys, and fetching all data from Google Search Console may hit quota limits. The license is not specified, which may impact commercial adoption.

Health Check
Last commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
9 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.