Return to page

H2O.ai Blog

Filter By:

79 results Category: Year:
H2O Release 3.46
by Wendy Wong, Adam Valenta | April 15, 2024 H2O Release , H2O-3

We are excited to announce the release of H2O-3 3.46.0.1! Some of the highlights of this major release are that we added custom metric support for XGBoost, allowed grid search models to be sorted with custom metrics, and we enabled H2O MOJO and POJO to work with MLFlow. Several improvements were also made to the Uplift model (like MLI ...

Read more
Open-Weight AI Models: A Path to Responsible Innovation
by Sri Ambati | April 04, 2024 H2O-3 , Responsible AI , h2oGPT

The recent Request for Comments (RFC) issued by the National Telecommunications and Information Administration (NTIA) on open-weight AI models has sparked an important conversation about the future of AI. As we consider the potential benefits and risks associated with making AI model weights more accessible and transparent, it is clear ...

Read more
H2O Release 3.44
by Marek Novotny, Wendy Wong | October 20, 2023 H2O Release , H2O-3

We are excited to announce the release of H2O-3 3.44.0.1! We have added and improved many items. A few of our highlights are the implementation of AdaBoost, Shapley values support, Python 3.10 and 3.11 support, and added custom metric support for Deep Learning, Uplift Distributed Random Forest (DRF), Stacked Ensemble, and AutoML. Please r...

Read more
Building a Fraud Detection Model with H2O AI Cloud

In a previous article [1], we discussed how machine learning could be harnessed to mitigate fraud. This time, we’ll delve into a step-by-step guide on leveraging H2O AI Cloud to construct efficient fraud detection models. We’ll tackle this process in three critical stages: build, operate, and detect. First, we’ll utilize Driverless AI in ...

Read more
A Look at the UniformRobust Method for Histogram Type
by Hannah Tillman, Megan Kurka | July 25, 2023 GBM , H2O-3

Tree-based algorithms, especially Gradient Boosting Machines (GBM’s), are one of the most popular algorithms used. They often out-perform linear models and neural networks for tabular data since they used a boosted approach where each tree built works to fix the error of the previous tree. As the model trains, it is continuously self-corr...

Read more
Reducing False Positives in Financial Transactions with AutoML

In an increasingly digital world, combating financial fraud is a high-stakes game. However, the systems we deploy to safeguard ourselves are raising too many false alarms, with over 90% of fraud alerts being false positives. These false positives, not only frustrating for consumers but also costly for financial institutions, can eclipse t...

Read more
H2O.ai and Snowflake Enable Developers to Train, Deploy, and Score Containerized Software Without Compromising Data Security

H2O.ai today announced its participation as a launch partner for Snowflake’s Snowpark Container Services (available in private preview), which provides our joint customers with the flexibility to train, deploy, and score models all within their Snowflake account. This further expands the ease of use for data science teams to create machin...

Read more
H2O Releases 3.40.0.1 and 3.42.0.1
by Marek Novotny, Wendy Wong | June 23, 2023 GBM , GLM , H2O Release , H2O-3 , XGBoost

Our new major releases of H2O are packed with new features and fixes! Some of the major highlights of these releases are the new Decision Tree algorithm, the added ability to grid over Infogram, an upgrade to the version of XGBoost and an improvement to its speed, the completion of the maximum likelihood dispersion parameter and its expan...

Read more
10 Consejos para Convertirte en un Científico de Datos Exitoso
by Favio Vazquez | January 19, 2023 AutoML , Beginners , Data Science

La ciencia de datos llegó para quedarse. Los científicos de datos utilizan sus habilidades para ayudar a las empresas a tomar mejores decisiones sobre sus productos, servicios, a optimizar procesos, ahorrar y mejorar rentabilidad. Convertirse en un científico de datos de éxito implica muchos aspectos y el estudio continuo, ya que es un...

Read more
Explaining models built in H2O-3 — Part 1

Machine Learning explainability refers to understanding and interpreting the decisions and predictions made by a machine learning model. Explainability is crucial for ensuring the trustworthiness and transparency of machine learning models, particularly in high-stakes situations where the consequences of incorrect predictions can be signi...

Read more
New in Wave 0.24.0
by Martin Turoci | November 21, 2022 H2O Hydrogen Torch , H2O Release , H2O Wave

Another Wave release has arrived with quite a few exciting new features. Let’s quickly go over the biggest ones.Wave init CLI​How many times you wanted to build a Wave app fast, but then you realized you need to start from scratch, copy over the skeleton of your app and work up from there? For these exact reasons, we introduced a new wave...

Read more
Bias and Debiasing
by Kim Montgomery | April 15, 2022 Explainable AI , H2O-3

An important aspect of practicing machine learning in a responsible manner is understanding how models perform differently for different groups of people, for instance with different races, ages, or genders. Protected groups frequently have fewer instances in a training set, contributing to larger error rates for those groups. Some models...

Read more
Data Science with H2O.ai: An Introduction to Machine Learning and Predictive Modeling

Our own Jonathan Farland recently recorded a talk about machine learning and predictive modeling. In his talk, Jon also gave an overview of open source H2O and H2O AI Cloud . This video is a great resource for getting up to speed with the latest technology from H2O in half an hour. Some of you may prefer to go through the slides while l...

Read more
H2O Release 3.36 (Zorn)
by Michal Kurka | January 07, 2022 AutoML , H2O Release , H2O-3

There’s a new major release of H2O, and it’s packed with new features and fixes! Among the big new features in this release are Distributed Uplift Random Forest, an algorithm typically used in marketing and medicine to model uplift, and Infogram, a new research direction in machine learning that focuses on interpretability and fairness in...

Read more
H2O.ai Tools for a Beginner

Note : this is a community blog post by Shamil Dilshan Prematunga . It was first published on Medium .Hey, this is not a deep technical blog. I’d like to share the experience I had with H2O tools when I was studying Machine Learning. As a Research Engineer, I am currently working on an area based on Telecommunication. Day by day with my e...

Read more
New Features Now Available with the Latest Release of the H2O AI Cloud 21.10
by H2O.ai Team | October 18, 2021 H2O AI Cloud , H2O Release

The Makers here at H2O.ai have been busy building new features and enhancing capabilities across our AI platform . Designed to support our core mission of democratizing AI, these additions to our platform simplify the ability to make AI you can trust, operate it efficiently and innovate with ready-made AI applications.Launched in January ...

Read more
Introducing DatatableTon - Python Datatable Tutorials & Exercises
by Rohan Rao | September 20, 2021 Datatable , H2O-3 , Python , Tutorials

Datatable is a python library for manipulating tabular data. It supports out-of-memory datasets, multi-threaded data processing and has a flexible API.If this reminds you of R’s data.table , you are spot on because Python’s datatable package is closely related to and inspired by the R library.The release of v1.0.0 was done on 1st July,...

Read more
H2O Release 3.34 (Zizler)
by Michal Kurka | September 15, 2021 H2O Release , H2O-3

There’s a new major release of H2O, and it’s packed with new features and fixes! Among the big new features in this release, we’ve added Extended Isolation Forest for improved results on anomaly detection problems, and we’ve implemented the Type III SS test (ANOVAGLM) and the MAXR method to GLM. For existing algorithms, we improved the pe...

Read more
Visualizing Large Datasets with H2O-3
by Parul Pandey | September 09, 2021 H2O-3 , Tutorials

Exploratory data analysis is one of the essential parts of any data processing pipeline. However, when the magnitude of data is high, these visualizations become vague. If we were to plot millions of data points, it would become impossible to discern individual data points from each other. The visualized output in such a case is pleasing ...

Read more
AI-Driven Predictive Maintenance with H2O AI Cloud
by Parul Pandey, Asghar Ghorbani | August 02, 2021 AutoML , H2O AI Cloud , Machine Learning Interpretability , Manufacturing

According to a study conducted by Wall Street Journal , unplanned downtime costs industrial manufacturers an estimated $50 billion annually. Forty-two percent of this unplanned downtime can be attributed to equipment failure alone. These downtimes can cause unnecessary delays and, as a result, affect the business. A better and superior al...

Read more
The Emergence of Automated Machine Learning in Industry
by Parul Pandey | June 30, 2021 AutoML , Company

This post was originally published by K-Tech, Centre of Excellence for Data Science and AI, powered by NASSCOM. The link of the post can be found here. The concept of Automated Machine Learning has gained much traction recently. Automated Machine Le...

Read more
How Much is My Property Worth?

Note : this is a guest blog post by Jaafar Almusaad .How Much is My Property Worth?This is the million-dollar question – both figuratively and literally. Traditionally, qualified property valuers are tasked to answer this question. It’s a lengthy and costly process, but more critically, it’s inconsistent and largely subjective. Mind you, ...

Read more
Shapley summary plots: the latest addition to the H2O.ai’s Explainability arsenal

It is impossible to deploy successful AI models without taking into account or analyzing the risk element involved. Model overfitting, perpetuating historical human bias, and data drift are some of the concerns that need to be taken care of before putting the models into production. At H2O.ai, explainability is an integral part of our ML ...

Read more
Safer Sailing with AI
by Ana Visneski, Jo-Fai Chow, Kim Montgomery | April 01, 2021 Customers , Data Science , H2O Hydrogen Torch , H2O-3 , Machine Learning Interpretability

In the last week, the world watched as responders tried to free a cargo ship that had gone aground in the Suez Canal. This incident blocked traffic through a waterway that is critical for commerce. While the location was an unusual one, ship collisions, allisions , and groundings are not uncommon. With all the technology that mariners hav...

Read more
H2O AI Cloud: Democratizing AI for Every Person and Every Organization

Harnessing AI’s true potential by enabling every employee, customer, and citizen with sophisticated AI technology and easy-to-use AI applications. Democratization is an essential step in the development of AI, and AutoML technologies lie at the heart of it. AutoML tools have played a pivotal role in transforming the way we consume an...

Read more
H2O-3 Improvements from Two University Projects
by Veronika Maurerova | February 08, 2021 Academic Program , H2O-3

In September 2019 H2O.ai became a silver partner of the Faculty of Informatics at Czech Technical University in Prague. The main goal of this partnership is to make connections between students and companies to prepare an environment where students can use their knowledge in practice and gain real-work experiences. In general, within th...

Read more
New Improvements in H2O 3.32.0.2
by Veronika Maurerova | December 17, 2020 H2O Release , XGBoost

There is a new minor release of H2O that introduces two useful improvements to our XGBoost integration: interaction constraints and feature interactions.Interaction ConstraintsFeature interaction constraints allow users to decide which variables are allowed to interact and which are not.Potential benefits: Better predictive performance...

Read more
Introducing H2O Wave
by Jo-Fai Chow, Benjamin Cox | December 15, 2020 H2O Hydrogen Torch , H2O-3 , Product Updates , Python

For almost a decade, H2O.ai has worked to build open source and commercial products that are on the leading edge of innovation in machine learning, from AutoML to Explainable AI . We are thrilled to announce the release of what we believe to be the future of AI Applications: H2O Wave . Wave is an open source, lightweight Python developmen...

Read more
Mitos e verdades sobre o AutoML
by Alan Silva, Bruna Smith | November 10, 2020 AutoML , Beginners , Business , Community , Machine Learning

Todas as revoluções que tivemos até hoje, tanto as tecnológicas quanto industriais, possuem uma semelhança: elas estão ligadas à forma como os seres humanos lidam com as máquinas. Antes, os processos eram feitos de forma muito manual e, com o tempo, acabaram sofrendo uma evolução natural voltada para a automação. Com o aprendizado de máqu...

Read more
H2O on Kubernetes using Helm
by H2O.ai Team | October 16, 2020 H2O-3 , Kubernetes , Technical

Deploying real-world applications using bare YAML files to Kubernetes is a rather complex task, and H2O is no exception. As demonstrated in one of the previous blog posts . Greatly simplified, a cluster of H2O open source machine learning nodes is brought up in the following manner: A headless service to make initial node discovery and ...

Read more
H2O Release 3.32 (Zermelo)
by Michal Kurka | October 14, 2020 H2O Release , H2O-3

There’s a new major release of H2O, and it’s packed with new features and fixes! Among the big new features in this release, we’ve added RuleFit — an interpretable machine learning algorithm , introduced a new toolbox for model explainability, made Target Encoding work for all classes of problems, and integrated it in our AutoML framewor...

Read more
Combining the power of KNIME and H2O.ai in a single integrated workflow
by Rafael Coss, Stefan Pacinda | October 14, 2020 AutoML , Community , H2O Driverless AI , Partners , Technical , Tutorials

KNIME and H2O.ai , the two data science pioneers known for their open source platforms, have partnered to further democratize AI. Our approaches are about being open, transparent, and pushing the leading edge of AI. We believe strongly that AI is not for the select few but for everyone. We are taking another step in democratizing AI by ...

Read more
The Challenges and Benefits of AutoML
by Eve-Anne Trehin | October 14, 2020 AutoML , H2O Driverless AI , Machine Learning , Responsible AI

Machine Learning and Artificial Intelligence have revolutionized how organizations are utilizing their data. AutoML or Automatic Machine Learning automates and improves the end-to-end data science process. This includes everything from cleaning the data, engineering features, tuning the model, explaining the model, and deploying it into p...

Read more
The Benefits of Budget Allocation with AI-driven Marketing Mix Models
by Michael Proksch | September 17, 2020 AutoML , Business , Customers , GBM , GLM , Machine Learning , Solutions

Excerpt of the white paper: “The Latest in AI Technologies Reinvent Media and Marketing Analytics @ Allergan” Authors: Akhil Sood, Associate Director @ Marketing Sciences, Allergan Dr. Michael Proksch, Senior Director @ H2o.ai Vijay Raghavan, Associate Vice President @ Marketing Sciences, AllerganIntroductionThe call for accountability in...

Read more
Exploring the Next Frontier of Automatic Machine Learning with H2O Driverless AI
by Jo-Fai Chow | July 28, 2020 AutoML , H2O Driverless AI

At H2O.ai, it is our goal to democratize AI by bridging the gap between the State-of-the-Art (SOTA) in machine learning and a user-friendly, enterprise-ready platform. We have been working tirelessly to bring the SOTA from Kaggle competitions to our enterprise platform Driverless AI since its very first release. The growing list of Driver...

Read more
Sparkling Water 3.30.0.3 is out
by Jakub Hava | June 04, 2020 H2O-3 , Sparkling Water

Sparkling Water is about making machine learning simple, speedy, and scalable with Apache Spark. This blog provides an overview of the following new features: No H2O Client on Spark Driver Speedups Automatic String conversion to Categoricals No H2O Client on Spark DriverPreviously, Sparkling Water always started worker nodes eith...

Read more
Running H2O cluster on a Kubernetes cluster
by H2O.ai Team | April 14, 2020 H2O-3 , Kubernetes

H2O is an open-source, in-memory platform for distributed, scalable machine learning. A perfect match for deployment on a Kubernetes cluster, the very modern way of deploying, serving & scaling applications. With the major release 3.30.0.1, released in Q1 2020, H2O obtained first class Kubernetes support .This article explains how t...

Read more
H2O Release 3.30 (Zahradnik)
by Michal Kurka | April 07, 2020 H2O Release

There’s a new major release of H2O, and it’s packed with new features and fixes! Among the big new features in this release, we’ve introduced support for Generalized Additive Models, added an option to build many models in parallel on segments of your dataset, improved support for deploying on Kubernetes, upgraded XGBoost with newly added...

Read more
Insights From the New 2020 Gartner Magic Quadrant For Cloud AI Developer Services

We are excited to be named a Visionary in the new Gartner Magic Quadrant for Cloud AI Developer Services (Feb 2020), and have been recognized for both our completeness of vision and ability to execute in the emerging market for cloud-hosted artificial intelligence (AI) services for application developers. This is the second Gartner MQ tha...

Read more
AI & ML Platforms: My Fresh Look at H2O.ai Technology

2020: A new year, a new decade, and with that, I’m taking a new and deeper look at the technology H2O.ai offers for building AI and machine learning systems. I’ve been interested in H2O.ai since its early days as a company (it was 0xdata back then) in 2014. My involvement had been only peripheral, but now I’ve begun to work with this comp...

Read more
Key Takeaways from the 2020 Gartner Magic Quadrant for Data Science and Machine Learning

We are named a Visionary in the Gartner Magic Quadrant for Data Science and Machine Learning Platforms (Feb 2020). We have been positioned furthest to the right for completeness of vision among all the vendors evaluated in the quadrant. So let’s walk you through the key strengths of our machine learning platforms. Automatic Machine Learn...

Read more
Parallel Grid Search in H2O

H2O-3 is, at its core, a platform for distributed, in-memory computing. On top of the distributed computation platform, the machine learning algorithms are implemented. At H2O.ai, we design every operation, be it data transformation, training of machine learning models or even parsing to utilize the distributed computation model. In ord...

Read more
The Super Bowl and Data Science: Changing the NFL with the Power of Machine Learning
by Rafael Coss | January 31, 2020 Data Science , H2O-3 , Kaggle , Machine Learning

Super Bowl LIV came and went. The San Francisco 49ers vs the Kansas City Chiefs. Personally, being from the The Bay, I was rooting for the 49ers, but you can’t always get what you want. Whoever came out on top, though, we were all looking forward to a great game full of fantastic plays and the kind of gridiron tenacity where players lay i...

Read more
Grandmaster Series: How a Passion for Numbers Turned This Mechanical Engineer into a Kaggle Grandmaster

In conversation with Sudalai Rajkumar: A Kaggle Double Grandmaster and a Data Scientist at H2O.aiIt is rightly said that one should never seek praise. Instead, let the effort speak for itself. One of the essential traits of successful people is to never brag about their success but instead keep learning along the way. In the data science ...

Read more
H2O Release 3.28 (Yu)
by Michal Kurka | December 20, 2019 H2O Release

There’s a new major release of H2O, and it’s packed with new features and fixes! Among the big new features in this release, we’ve introduced support for Hierarchical GLM, added an option to parallelize Grid Search, upgraded XGBoost with newly added features, and improved our AutoML framework. The release is named after Bin Yu .Hierarchi...

Read more
Scalable AutoML in H2O
by Sanyam Bhutani | November 27, 2019 AutoML , H2O World , Machine Learning , Technical

Note: I’m grateful to Dr. Erin LeDell for the suggestions, corrections with the writeup. All of the images used here are from the talks’ slides. Erin Ledell’s talk was aimed at AutoML : Automated Machine Learning , broadly speaking, followed by an overview of H2O’s Open Source Project and the library. H2O AutoML provides an easy-to-use ...

Read more
Importing, Inspecting, and Scoring With MOJO Models Inside H2O
by H2O.ai Team | November 08, 2019 H2O-3 , Technical

Machine-learning models created with H2O may be exported in two basic ways: Binary format, Model Object, Optimized (MOJO). An H2 O model can be saved in a binary format, which is tied to the very specific version of H2 O it has been created with. There are multiple reasons for such a restriction. One of the important reasons is that...

Read more
A Deep Dive into H2O’s AutoML
by Parul Pandey | October 16, 2019 AutoML , H2O-3 , Technical

The demand for machine learning systems has soared over the past few years. This is majorly due to the success of Machine Learning techniques in a wide range of applications. AutoML is fundamentally changing the face of ML-based solutions today by enabling people from diverse backgrounds to use machine learning models to address complex ...

Read more
Make your own AI — Add Your Game to Auto-ML Models
by Karthik Guruswamy | October 15, 2019 AutoML , H2O Driverless AI , Machine Learning , Technical

When Features and Algorithms compete, your Business Use Case(s) wins! H2O Driverless AI is an Automatic Feature Engineering /Machine Learning platform to build AI/ML models on tabular data. Driverless AI can build supervised learning models for Time Series forecasts, Regression , Classification , etc. It supports a myriad of built-i...

Read more
New Innovations in Driverless AI

What’s new in Driverless AIWe’re super excited to announce the latest release of H2O Driverless AI . This is a major release with a ton of new features and functionality. Let’s quickly dig into all of that: Make Your Own AI with Recipes for Every Use Case: In the last year, Driverless AI introduced time-series and NLP recipes to meet the...

Read more
Mitigating Bias in AI/ML Models with Disparate Impact Analysis

Everyone understands that the biggest plus of using AI/ML models is a better automation of day-to-day business decisions, personalized customer service, enhanced user experience, waste elimination, better ROI, etc. The common question that comes up often though is — How can we be sure that the AI/ML decisions are free from bias/discrimina...

Read more
H2O Release 3.26 (Yau)
by Michal Kurka | July 30, 2019 H2O Release

There’s a new major release of H2O, and it’s packed with new features and fixes! Among the big new features in this release, we’ve introduced the ability to define a Custom Loss Function in our GBM implementation, and we’ve extended the portfolio of our machine learning algorithms with the implementation of the SVM algorithm. The release...

Read more
Custom Machine Learning Recipes: The ingredients for success

Last updated: 07/23/19Machine learning is akin to cooking in several ways. A perfect dish originates from a tried-and-tested recipe, has the right combination of ingredients, and is baked at just the right temperature. Successful AI solutions work on the same principle. One needs fresh and right quality ingredients in the form of data, ...

Read more
Getting started with H2O using Flow
by Parul Pandey | July 16, 2019 Flow , H2O-3 , Technical

This blog was originally published on towardsdatascience: https://towardsdatascience.com/getting-started-with-h2o-using-flow-b560b5d969b8A look into H2O’s open-source UI for combining code execution, text, plots, and rich media in a single document. Data collection is easy. Decision making is hard. Today, we have access to a humungous...

Read more
Toward AutoML for Regulated Industry with H2O Driverless AI

Predictive models in financial services must comply with a complex regime of regulations including the Equal Credit Opportunity Act (ECOA), the Fair Credit Reporting Act (FCRA), and the Federal Reserve’s S.R. 11-7 Guidance on Model Risk Management. Among many other requirements, these and other applicable regulations stipulate predictive ...

Read more
An Overview of Python’s Datatable package

This blog originally appeared on Towardsdatascience.com “There were 5 Exabytes of information created between the dawn of civilization through 2003, but that much information is now created every 2 days”: Eric Schmidt If you are an R user, chances are that you have already been using the data.ta...

Read more
H2O-3, Sparkling Water and Enterprise Steam Updates
by Venkatesh Yadav | April 10, 2019 Community , Data Science , H2O Release , Technical

We are excited to announce the new release of H2O Core, Sparkling Water and Enterprise Steam.Below are some of the new features we have added:H2O-3 Yates (3.24.0.1) – 3/31/2019Download at: http://h2o-release.s3.amazonaws.com/h2o/rel-yates/1/index.html Bug [PUBDEV-6159] – The AutoMLTest.java test suite now runs correctly on a local mach...

Read more
H2O Release 3.24 (Yates)
by Michal Kurka | April 02, 2019 H2O Release

There’s a new major release of H2O, and it’s packed with new features and fixes! Among the big new features in this release, we’ve introduced cross-version support for model import, added new features for model interpretation, provided much-improved support for reading data from Apache Hive, and included various algorithm and AutoML impr...

Read more
Boosting your ROI with AutoML & Automatic Feature Engineering
by Karthik Guruswamy | February 25, 2019 AutoML , Machine Learning

If your business has started using AI/ML tools or just started to think about it, this blog is for you. Whether you are a data scientist, VP of data science or a line of a business owner, you are probably wondering how AI will impact your organization in various ways or why your current strategies are not working somehow. If you are not ...

Read more
Key Takeaways from the Gartner Magic Quadrant For Data Science & Machine Learning

The Gartner Magic Quadrant for Data Science and Machine Learning Platforms (Jan 2019) is out and H2O.ai has been named a Visionary. The Gartner MQ evaluates platforms that enable expert data scientists, citizen data scientists and application developers to create, deploy and manage their own advanced analytic models.H2O.ai Key Highlights...

Read more
H2O New Year releases
by H2O.ai Team | January 18, 2019 H2O Release , H2O-3 , Python , R

There were two releases shortly after each other. First, on December 21st, there was a minor (fix) release 3.22.0.3 . Immediately followed by a more major release (but still on 3.22 branch) codename Xu, named after mathematician Jinchao Xu , whose work is focused on deep neural networks, besides many other fields of research.Of course, th...

Read more
New features in H2O 3.22
by Erin LeDell, Michal Kurka | November 12, 2018 H2O Release

Xia Release (H2O 3.22)There’s a new major release of H2O and it’s packed with new features and fixes! Among the big new features in this release, we introduce Isolation Forest to our portfolio of machine learning algorithms and integrates the XGBoost algorithm into our AutoML framework. The release is named after Zhihong Xia .Isolation ...

Read more
Anomaly Detection with Isolation Forests using H2O
by Martin Barus | November 06, 2018 Data Science , H2O-3

IntroductionAnomaly detection is a common data science problem where the goal is to identify odd or suspicious observations, events, or items in our data that might be indicative of some issues in our data collection process (such as broken sensors, typos in collected forms, etc.) or unexpected events like security breaches, server failu...

Read more
Launching the Academic Program … OR ... What Made My First Four Weeks at H2O.ai so Special!

We just launched the H2O.ai Academic Program at our sold-out H2O World London. With nearly 1000 people in attendance, we received the first online sign-up forms submitted by professors and students alike. This program will massively democratize AI in academia, increasing the number of AI-skilled graduates – with both technical and busine...

Read more
Welcome H2O.ai's Driverless AI Community!
by H2O.ai Team | October 30, 2018 Beginners , Community , H2O Driverless AI , H2O-3

I am very excited to announce the formation of the inaugural community for H2O Driverless AI users. The Driverless AI Community is open for anyone looking to engage with other users as well as experts from H2O.ai’s Driverless AI, Driverless AI is an award-winning automatic machine learning platform that does “AI to do AI” to solve re...

Read more
H2O for Inexperienced Users
by H2O.ai Team | August 24, 2018 Beginners , Data Science , H2O-3 , Machine Learning

Some background: I am a rising senior in highschool, and the summer of 2018, I interned at H2O.ai. With no ML experience beyond Andrew Ng’s Introduction to Machine Learning course on Coursera and a couple of his deep learning courses, I initially found myself slightly overwhelmed by the variety of new algorithms H2O has to offer in both ...

Read more
The different flavors of AutoML
by Erin LeDell | August 15, 2018 AutoML , Data Science , H2O Driverless AI , H2O-3

In recent years, the demand for machine learning experts has outpaced the supply, despite the surge of people entering the field. To address this gap, there have been big strides in the development of user-friendly machine learning software (e.g. H2O , scikit-learn , keras ). Although these tools have made it easy to train and evaluate ma...

Read more
H2O’s AutoML in Spark
by Jakub Hava | July 23, 2018 AutoML , Sparkling Water , Technical , Tutorials

This blog post demonstrates how H2O’s powerful automatic machine learning can be used together with the Spark in Sparkling Water.We show the benefits of Spark & H2O integration, use Spark for data munging tasks and H2O for the modelling phase, where all these steps are wrapped inside a Spark Pipeline. The integration between Spark and...

Read more
H2O-3 on FfDL: Bringing deep learning and machine learning closer together
by Vinod Iyengar | June 25, 2018 Community , Deep Learning , H2O-3 , Technical

This post originally appeared in the IBM Developer blog here. This post is co-authored by Animesh Singh, Nicholas Png, Tommy Li, and Vinod Iyengar. Deep learning frameworks like TensorFlow, PyTorch, Caffe, MXNet, and Chainer have reduced the effort and skills needed to train and use deep learning models. But for AI developers and data ...

Read more
H2O + Kubeflow/Kubernetes How-To
by H2O.ai Team | March 29, 2018 H2O-3

Today, we are introducing a walkthrough on how to deploy H2O 3 on Kubeflow. Kubeflow is an open source project led by Google that sits on top of the Kubernetes engine. It is designed to alleviate some of the more tedious tasks associated with machine learning. Kubeflow helps orchestrate deployment of apps through the full cycle of devel...

Read more
Sparkling Water 2.2.10 is now available!
by H2O.ai Team | March 22, 2018 AutoML , Sparkling Water

Hi Makers! There are several new features in the latest Sparkling Water. The major new addition is that we now publish Sparkling Water documentation as a website which is available here . This link is for Spark 2.2. We have also documented and fixed a few issues with LDAP on Sparkling Water. Exact steps are provided in the documentation...

Read more
Congratulations - H2O is a leader in the Gartner Magic Quadrant for Data Science and Machine Learning Platforms
by H2O.ai Team | February 25, 2018 Community , Customers , Gartner , H2O-3

Congratulations – Thanks to the support of our customer community over the past years, H2O.ai is a leader and one with the most completeness of vision in Gartner Magic Quadrant for Data Science and Machine Learning Platforms. It is an ecosystem we dedicated a good part of this decade to open up and spring. This is testimony to the incr...

Read more
New features in H2O 3.18
by H2O.ai Team | February 22, 2018 AutoML , Ensembles , H2O Release , XGBoost

Wolpert Release (H2O 3.18)There’s a new major release of H2O and it’s packed with new features and fixes! We named this release after David Wolpert , who is famous for inventing Stacking (aka Stacked Ensembles ). Stacking is a central component in H2O AutoML , so we’re very grateful for his contributions to machine learning! He is also fa...

Read more
New versions of H2O-3 and Sparkling Water available
by H2O.ai Team | December 02, 2017 H2O Release , Sparkling Water

Dear H2O Community, #H2OWorld is on Monday and we can’t wait to see you there! We’ll also be live streaming the event starting at 9:25am PST. Explore the agenda here . Today we’re excited to share that new versions of H2O-3 and Sparkling Water are available. We invite you to download them here: http://www.h2o.ai/download/ H2O-3.16 – MO...

Read more
Driverless AI Blog
by H2O.ai Team | July 13, 2017 AutoML , GPU , H2O Driverless AI

In today’s market, there aren’t enough data scientists to satisfy the growing demand for people in the field. With many companies moving towards automating processes across their businesses (everything from HR to Marketing), companies are forced to compete for the best data science talent to meet their needs. A report by McKinsey says th...

Read more
Scalable Automatic Machine Learning: Introducing H2O's AutoML
by H2O.ai Team | June 21, 2017 AutoML , Ensembles , H2O Release , Technical

Prepared by: Erin LeDell, Navdeep Gill & Ray Peck In recent years, the demand for machine learning experts has outpaced the supply, despite the surge of people entering the field. To address this gap, there have been big strides in the development of user-friendly machine learning software that can be used by non-experts and experts...

Read more
Stacked Ensembles and Word2Vec now available in H2O!

Prepared by: Erin LeDell and Navdeep Gill MathJax.Hub.Config({ tex2jax: {inlineMath: [['$','$'], ['\\(','\\)']]} }); Stacked Ensembles ensemble <- h2o.stackedEnsemble(x = x, y = y, training_frame = train, base_models = my_models) Python:ensemble = H2OStackedEnsembleEstimator(base_models=my_models) ensemble.train(x=x, y=y, training...

Read more
What is new in Sparkling Water 2.0.3 Release?
by H2O.ai Team | January 05, 2017 Community , H2O Release , Sparkling Water

This release has H2O core – 3.10.1.2Important Feature:This architectural change allows to connect to existing h2o cluster from sparkling water. This has a benefit that we are no longer affected by Spark killing it’s executors thus we should have more stable solution in environment with lots of h2o/spark node. We are working on article on ...

Read more
What is new in H2O latest release 3.10.2.1 (Tutte) ?
by H2O.ai Team | December 23, 2016 Community , H2O Release

Today we released H2O version 3.10.2.1 (Tutte). It’s available on our Downloads page, and release notes can be found here . Photo Credit: https://en.wikipedia.org/wiki/W._T._Tutte Top enhancements in this release: GLM MOJO Support: GLM now supports our smaller, faster, more efficient MOJO (Model ObJect, Optimized) format for model pu...

Read more

ERROR