Skip to content

boubker98/customer-analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“Š Customer Analytics

Customer Analytics is a scalable, chunked data processing pipeline written in Python. It efficiently ingests, validates, transforms, and analyzes large customer purchase datasets β€” with support for schema enforcement, aggregation, and logging.


πŸš€ Features

  • βœ… Chunked processing of large CSVs
  • βœ… Data validation with pandera
  • βœ… Automatic logging of data size and memory usage
  • βœ… Cleaning and normalization of customer data
  • βœ… Aggregations: revenue, basket size, and customer counts
  • βœ… ISO 3166-1 alpha-2 country code validation with pycountry
  • βœ… Testable with pytest

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages