Awesome H2O
Below is a curated list of all the awesome projects, applications,
research, tutorials, courses and books that use
H2O, an open source,
distributed machine learning platform. H2O offers parallelized
implementations of many supervised and unsupervised machine learning
algorithms such as Generalized Linear Models, Gradient Boosting Machines
(including XGBoost), Random Forests, Deep Neural Networks (Deep Learning),
Stacked Ensembles, Naive Bayes, Cox Proportional Hazards, K-means, PCA,
Word2Vec, as well as a fully automatic machine learning algorithm
(AutoML).
H2O.ai produces many
tutorials,
blog posts,
presentations and
videos about H2O, but
the list below is comprised of awesome content produced by the greater H2O
user community.
We are just getting started with this list, so pull requests are very much
appreciated! đ Please review the
contribution guidelines before making a pull
request. If youâre not a GitHub user and want to make a contribution,
please send an email to community@h2o.ai.
If you think H2O is awesome too, please â the
H2O GitHub repository.
Contents
Blog Posts & Tutorials
-
Using H2O AutoML to simplify training process (and also predict wine
quality)
Aug 4, 2020
-
Visualizing ML Models with LIME
-
Parallel Grid Search in H2O
Jan 17, 2020
-
Importing, Inspecting and Scoring with MOJO models inside H2O
Dec 10, 2019
-
Artificial Intelligence Made Easy with H2O.ai: A Comprehensive Guide
to Modeling with H2O.ai and AutoML in Python
June 12, 2019
-
Anomaly Detection With Isolation Forests Using H2O
Dec 03, 2018
-
Predicting residential property prices in Bratislava using recipes -
H2O Machine learning
Nov 25, 2018
-
Inspecting Decision Trees in H2O
Nov 07, 2018
-
Gentle Introduction to AutoML from H2O.ai
Sep 13, 2018
-
Machine Learning With H2O â Hands-On Guide for Data Scientists
Jun 27, 2018
-
Using machine learning with LIME to understand employee churn
June 25, 2018
-
Analytics at Scale: h2o, Apache Spark and R on AWS EMR
June 21, 2018
-
Automated and unmysterious machine learning in cancer detection
Nov 7, 2017
-
Time series machine learning with h2o+timetk
Oct 28, 2017
-
Sales Analytics: How to use machine learning to predict and optimize
product backorders
Oct 16, 2017
-
HR Analytics: Using machine learning to predict employee turnover
Sep 18, 2017
-
Autoencoders and anomaly detection with machine learning in fraud
analytics
May 1, 2017
-
Building deep neural nets with h2o and rsparkling that predict
arrhythmia of the heart
Feb 27, 2017
-
Predicting food preferences with sparklyr (machine learning)
Feb 19, 2017
-
Moving largish data from R to H2O - spam detection with Enron
emails
Feb 18, 2016
-
Deep learning & parameter tuning with mxnet, h2o package in R
Jan 30, 2017
Books
-
Hands on Time Series with R
Rami Krispin. (2019)
-
Mastering Machine Learning with Spark 2.x
Alex Tellez, Max Pumperla, Michal Malohlava. (2017)
-
Machine Learning Using R
Karthik Ramasubramanian, Abhishek Singh. (2016)
-
Practical Machine Learning with H2O: Powerful, Scalable Techniques
for Deep Learning and AI
Darren Cook. (2016)
-
Disruptive Analytics
Thomas Dinsmore. (2016)
-
Computer Age Statistical Inference: Algorithms, Evidence, and Data
Science
Bradley Efron, Trevor Hastie. (2016)
-
R Deep Learning Essentials
Joshua F. Wiley. (2016)
-
Spark in Action
Petar ZeÄeviÄ, Marko BonaÄi. (2016)
-
Handbook of Big Data
Peter BĂŒhlmann, Petros Drineas, Michael Kane, Mark J. van der Laan
(2015)
Research Papers
-
Maturity of gray matter structures and white matter connectomes, and
their relationship with psychiatric symptoms in youth
Alex Luna, Joel Bernanke, Kakyeong Kim, Natalie Aw, Jordan D. Dworkin,
Jiook Cha, Jonathan Posner (2021).
-
Appendectomy during the COVID-19 pandemic in Italy: a multicenter
ambispective cohort study by the Italian Society of Endoscopic Surgery
and new technologies (the CRAC study)
Alberto Sartori, Mauro Podda, Emanuele Botteri, Roberto Passera,
Ferdinando Agresta, Alberto Arezzo. (2021)
-
Forecasting Canadian GDP Growth with Machine Learning
Shafiullah Qureshi, Ba Chu, Fanny S. Demers. (2021)
-
Morphological traits of reef corals predict extinction risk but not
conservation status
NussaĂŻbah B. Raja, Andreas Lauchstedt, John M. Pandolfi, Sun W. Kim, Ann
F. Budd, Wolfgang Kiessling. (2021)
-
Machine Learning as a Tool for Improved Housing Price Prediction
Henrik I W. Wolstad and Didrik Dewan. (2020)
-
Citizen Science Data Show Temperature-Driven Declines in Riverine
Sentinel Invertebrates
Timothy J. Maguire, Scott O. C. Mundle. (2020)
-
Predicting Risk of Delays in Postal Deliveries with Neural Networks
and Gradient Boosting Machines
Matilda Söderholm. (2020)
-
Stock Market Analysis using Stacked Ensemble Learning Method
Malkar Takle. (2020)
-
H2O AutoML: Scalable Automatic Machine Learning. Erin LeDell, Sebastien Poirier. (2020)
-
Single-cell mass cytometry on peripheral blood identifies immune cell
subsets associated with primary biliary cholangitis
Jin Sung Jang, Brian D. Juran, Kevin Y. Cunningham, Vinod K. Gupta,
Young Min Son, Ju Dong Yang, Ahmad H. Ali, Elizabeth Ann L. Enninga,
Jaeyun Sung & Konstantinos N. Lazaridis. (2020)
-
Prediction of the functional impact of missense variants in BRCA1 and
BRCA2 with BRCA-ML
Steven N. Hart, Eric C. Polley, Hermella Shimelis, Siddhartha Yadav,
Fergus J. Couch. (2020)
-
Innovative deep learning artificial intelligence applications for
predicting relationships between individual tree height and diameter
at breast height
İlker Ercanlı. (2020)
-
An Open Source AutoML Benchmark
Peter Gijsbers, Erin LeDell, Sebastien Poirier, Janek Thomas, Berndt
Bischl, Joaquin Vanschoren. (2019)
-
Machine Learning in Python: Main developments and technology trends
in data science, machine learning, and artificial intelligence
Sebastian Raschka, Joshua Patterson, Corey Nolet. (2019)
-
Human actions recognition in video scenes from multiple camera
viewpoints
Fernando Itano, Ricardo Pires, Miguel Angelo de Abreu de Sousa, Emilio
Del-Moral-Hernandeza. (2019)
-
Extending MLP ANN hyper-parameters Optimization by using Genetic
Algorithm
Fernando Itano, Miguel Angelo de Abreu de Sousa, Emilio
Del-Moral-Hernandez. (2018)
-
askMUSIC: Leveraging a Clinical Registry to Develop a New Machine
Learning Model to Inform Patients of Prostate Cancer Treatments Chosen
by Similar Men
Gregory B. Auffenberg, Khurshid R. Ghani, Shreyas Ramani, Etiowo Usoro,
Brian Denton, Craig Rogers, Benjamin Stockton, David C. Miller,
Karandeep Singh. (2018)
-
Machine Learning Methods to Perform Pricing Optimization. A
Comparison with Standard GLMs
Giorgio Alfredo Spedicato, Christophe Dutang, and Leonardo Petrini.
(2018)
-
Comparative Performance Analysis of Neural Networks Architectures on
H2O Platform for Various Activation Functions
Yuriy Kochura, Sergii Stirenko, Yuri Gordienko. (2017)
-
Algorithmic trading using deep neural networks on high frequency
data
Andrés Arévalo, Jaime Niño, German Hernandez, Javier Sandoval, Diego
LeĂłn, Arbey AragĂłn. (2017)
-
Generic online animal activity recognition on collar tags
Jacob W. Kamminga, Helena C. Bisby, Duc V. Le, Nirvana Meratnia, Paul J.
M. Havinga. (2017)
-
Soil nutrient maps of Sub-Saharan Africa: assessment of soil nutrient
content at 250 m spatial resolution using machine learning
Tomislav Hengl, Johan G. B. Leenaars, Keith D. Shepherd, Markus G.
Walsh, Gerard B. M. Heuvelink, Tekalign Mamo, Helina Tilahun, Ezra
Berkhout, Matthew Cooper, Eric Fegraus, Ichsani Wheeler, Nketia A.
Kwabena. (2017)
-
Robust and flexible estimation of data-dependent stochastic mediation
effects: a proposed method and example in a randomized trial
setting
Kara E. Rudolph, Oleg Sofrygin, Wenjing Zheng, and Mark J. van der Laan.
(2017)
-
Automated versus do-it-yourself methods for causal inference: Lessons
learned from a data analysis competition
Vincent Dorie, Jennifer Hill, Uri Shalit, Marc Scott, Dan Cervone.
(2017)
-
Using deep learning to predict the mortality of leukemia patients
Reena Shaw Muthalaly. (2017)
-
Use of a machine learning framework to predict substance use disorder
treatment success
Laura Acion, Diana Kelmansky, Mark van der Laan, Ethan Sahker, DeShauna
Jones, Stephan Arnd. (2017)
-
Ultra-wideband antenna-induced error prediction using deep learning
on channel response data
Janis Tiemann, Johannes Pillmann, Christian Wietfeld. (2017)
-
Inferring passenger types from commuter eigentravel matrices
Erika Fille T. Legara, Christopher P. Monterola. (2017)
-
Deep neural networks, gradient-boosted trees, random forests:
Statistical arbitrage on the S&P 500
Christopher Krauss, Xuan Anh Doa, Nicolas Huckb. (2016)
-
Identifying IT purchases anomalies in the Brazilian government
procurement system using deep learning
Silvio L. Domingos, Rommel N. Carvalho, Ricardo S. Carvalho, Guilherme
N. Ramos. (2016)
-
Predicting recovery of credit operations on a Brazilian bank
Rogério G. Lopes, Rommel N. Carvalho, Marcelo Ladeira, Ricardo S.
Carvalho. (2016)
-
Deep learning anomaly detection as support fraud investigation in
Brazilian exports and anti-money laundering
Ebberth L. Paula, Marcelo Ladeira, Rommel N. Carvalho, Thiago MarzagĂŁo.
(2016)
-
Deep learning and association rule mining for predicting drug
response in cancer
Konstantinos N. Vougas, Thomas Jackson, Alexander Polyzos, Michael
Liontos, Elizabeth O. Johnson, Vassilis Georgoulias, Paul Townsend, Jiri
Bartek, Vassilis G. Gorgoulis. (2016)
-
The value of points of interest information in predicting
cost-effective charging infrastructure locations
Stéphanie Florence Visser. (2016)
-
Adaptive modelling of spatial diversification of soil classification
units. Journal of Water and Land Development
Krzysztof UrbaĆski, StanisĆaw GruszczyĆsk. (2016)
-
Scalable ensemble learning and computationally efficient variance
estimation
Erin LeDell. (2015)
-
Superchords: decoding EEG signals in the millisecond range
Rogerio Normand, Hugo Alexandre Ferreira. (2015)
-
Understanding random forests: from theory to practice
Gilles Louppe. (2014)
Benchmarks
Presentations
Courses
Software
License
To the extent possible under law, H2O.ai has
waived all copyright and related or neighboring rights to this work.