-
-
hooqu Public
hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to Python
-
-
-
doom-emacs Public
Forked from doomemacs/doomemacsAn Emacs framework for the stubborn martian hacker
Emacs Lisp MIT License UpdatedAug 12, 2020 -
pydeequ Public
Forked from margitaii/pydeequPython API for Deequ
Python Apache License 2.0 UpdatedJun 6, 2020 -
data-science-interviews Public
Forked from alexeygrigorev/data-science-interviewsData science interview questions and answers
-
-
presentation-resources Public
Some links, tools and resources for creating presentations. Particularly targeted to non-artsy persons.
UpdatedJan 30, 2020 -
dirty_cat Public
Forked from skrub-data/skrubEncoding methods for dirty categorical variables
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 16, 2020 -
luigi Public
Forked from spotify/luigiLuigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Python Apache License 2.0 UpdatedDec 27, 2019
cd4ml-workshop Public
Forked from ThoughtWorksInc/cd4ml-workshopRepository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshops
A unified approach to explain the output of any machine learning model.
neural-style-pt Public
Forked from ProGamerGov/neural-style-ptPyTorch implementation of neural style transfer algorithm
scikit-learn Public
Forked from scikit-learn/scikit-learnscikit-learn: machine learning in Python
ichbineinberliner Public
http://mfcabrera.com/blog/2015/1/17/ichbineinberliner.html
arrow Public
Forked from apache/arrowApache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for effic…
grip Public
Forked from joeyespo/gripPreview GitHub Markdown files like Readme locally before committing them.
catboost Public
Forked from catboost/catboostA fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…
MLAlgorithms Public
Forked from rushter/MLAlgorithmsMinimal and clean examples of machine learning algorithms implementations
conda-auto-env Public
Forked from drorata/conda-auto-envAutomatically activate a conda environment when entering a folder with `*_env` directory
docopt Public
Forked from docopt/docoptPythonic command line arguments parser, that will make you smile
bump2version Public
Forked from c4urself/bump2versionVersion-bump your software with a single command
ml-training-advanced Public
Forked from amueller/ml-training-advancedMaterials for the "Advanced Scikit-learn" class in the afternoon