GitHub - Eric-Xu/nydssg_pipelines: Presentation at the NYC Data Science Study Group on how to streamline your cross-validation and classification workflow using scikit-learn's Pipelines and FeatureUnions modules.

As a prediction model grows in complexity, scikit-learn's Pipeline module and FeatureUnion module offers a convenient way to organize all of our data extraction, transformation, normalization, and training steps. By chaining transformers and estimators together, we can extract features into a single unit pipeline. Each feature pipeline can then be reordered and combined using FeatureUnion. This not only saves time, but allows us to keep our code better organized, while we look for the ideal combination of techniques for solving a modeling task.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
image		image
.gitignore		.gitignore
README.md		README.md
pipelines.ipynb		pipelines.ipynb
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

Eric-Xu/nydssg_pipelines

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages