Python library for creating data pipelines with chain functional programming

#

# datascience

## Repositories 700

a curated list of R tutorials for Data Science, NLP and Machine Learning

R
Updated Apr 18, 2018

Modin: Speed up your Pandas workflows by changing a single line of code

Code, Notebooks and Examples from Practical Business Python

Jupyter Notebook
Updated Oct 7, 2018

A series of notebooks that help teach kids principles of programming, python and maths.

datascience
numerical-computation
jupyter-notebook
kids-learn
prime-numbers
algorithm
learn-to-code
learning-python
learning-by-doing
introduction-to-python
introduction-to-data-science
introduction-to-algorithms

Jupyter Notebook
Updated Sep 13, 2018

Python Library for Model Interpretation/Explanations

An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana

The missing MatPlotLib for Scala + Spark

datascience
machinelearning
deeplearning
neural-network
natural-language-processing
artificial-intelligence

Updated Oct 26, 2018

Collection of functions to enhance ggplot2 plots with results from statistical tests.

ggplot-extension
statistical-tests
dataviz
r
statistical-analysis
statistical-inference
data
visualization
datascience
violin-plot
vignette
badge
parametric
robust
plot

R
Updated Nov 14, 2018

Jupyter Notebooks from the old UnsupervisedLearning.com (RIP) machine learning and statistics blog

machine-learning
machine-learning-algorithms
machinelearning
statistics
python
jupyter-notebook
jupiter-notebook
ipython-notebook
ipynb
ipynb-notebook
ipynb-jupyter-notebook
data-science
datascience

Jupyter Notebook
Updated Jul 23, 2018

krangl is a {K}otlin DSL for data w{rangl}ing

Kotlin
Updated Nov 7, 2018

Curated list of my reads, implementations and core concepts of Artificial Intelligence, Deep Learning, Machine Learni…

artificial-intelligence
deeplearning
python
datascience
tensorflow
pytorch
blogs
mathamatics
awesome-list
deep-learning
neural-network
awesome
list
machine-learning
algorithms

Updated Feb 22, 2018

A javascript library providing a new data structure for datascientists and developpers

data-frame
manipulation
sql
groupby
javascript
functional
data
datascience
datastructures
sql-syntax
dataframe
matrix

JavaScript
Updated May 30, 2018

R's and Pandas DataFrame in modern C++ using native types, continuous memory storage, and no virtual functions for da…

statistics
data-frame
heterogeneous-containers
data-science
data-structures
numerical-analysis
dataframe
dataframes
machine-learning
heterogeneous
heterogeneous-lists
cpp-library
header-only
data-analysis
datastructures
datascience
containers
template-metaprogramming
cpp17
cpp

C++
Updated Nov 9, 2018

The foundational library of the Morpheus data science framework

datascience
data-analysis
data-analytics
regression
regression-models
principal-component-analysis
finance
quantitative-finance
dataframe
dataframe-library
statistics
statistical-analysis

Java
Updated Jun 22, 2018

A central repository for all my projects

Jupyter Notebook
Updated Sep 8, 2017

Repo that contains the supporting material for O'Reilly Webinar "An Intro to Predictive Modeling for Customer Lifetim…

ltv
clv
cltv
o-reilly
webinars
python
notebook
stan
pystan
customer-lifetime
customer-lifetime-value
datascience
active

Jupyter Notebook
Updated May 25, 2017

JavaScript
Updated Jan 10, 2018

A fast xgboost feature selection algorithm

machine-learning
machine-learning-algorithms
feature-selection
xgboost-algorithm
xgboost
dimension-reduction
algorithm
boruta
data-science
datascientist
datascience
machinelearning

Python
Updated Jan 22, 2018

An OCaml kernel for Jupyter (IPython) notebook

ocaml
functional-programming
jupyter-kernels
machine-learning
datascience
dataanalysis
jupyter-notebook
jupyter
ocaml-kernel
ocaml-repl

Jupyter Notebook
Updated Nov 6, 2018

Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications

Jupyter Notebook
Updated Oct 5, 2018

Set of notes with links to help those who are Data Science Beginners

Updated Oct 19, 2018

Tool for encapsulating, running, and reproducing data science projects

Python
Updated Oct 31, 2018

Materials for DataScience.com LTV and Neural Nets Talks at PyData Seattle

Jupyter Notebook
Updated Sep 6, 2018

knyfe is a python utility for rapid exploration of datasets.

Python
Updated Apr 3, 2015

RENKU (連句) is a software platform designed to foster multidisciplinary (data) science collaboration.

A day to day plan for this challenge. Covers both theoritical and practical aspects

machine-learning
python
eda
vizualization
100daysofmlcode
datascience
tutorials
siraj-raval-challenge
machine-learning-algorithms
infographics
banner
100-days-of-code
implementation
regression-algorithms

Jupyter Notebook
Updated Sep 29, 2018

A {K}otlin g{ra}mmar for data {vis}ualization

Kotlin
Updated Nov 7, 2018

Say "ni" to data of any size

Perl
Updated Nov 13, 2018