Dec 4, 2019 9 min read

In mod(els) we trust

What’s inside?

Imagine you just finished building the most amazing box in the world. You can ask it any question, and it will give you the answer. It comes in any color – so long as it’s black. How much do you think you could sell it for? Squillions right? Okay, there is a catch – two, if I’m honest: first, it won’t explain the reasoning behind its answers; and second, it can only give you a yes/no answers. Still worth squillions?

As it happens, amazing black boxes like these do actually exist: today’s improvements in machine learning and deep learning are tremendous; the sky’s the limit in terms of what we can achieve using the clever algorithms they contain, and which are constantly improving. Indeed, every year, and even every month, there are new additions to – and versions of – these algorithms that enable users to handle problems in new ways, employing seemingly infinite amounts of data, as well as their own, personal experiences.

But we’re only human. If we’re to use the outputs of a model/algorithm/machine to support our decisions, we need to trust it. If it recommends where to eat, that’s fairly easy to trust (fake reviews notwithstanding) as the stakes (or steaks) aren’t that high. If the algorithms support us in our day-to-day work, influencing the way we process information to make decisions that may impact security, safety and the business itself, then we’re going to need a bigger boat of trust. Much bigger. Our job, as the data scientists standing behind these models, is creating this trust within our data science toolbox. Allow me to explain.

Explainability

A huge part of trust in an answer is understanding the WHY behind it. Recently, there’s been a surge of interest in a new field of research and implementation called Explainability. The need to know why, to unravel the reasons behind the black box’s answers, is common around the world and across all industries. What’s more, it’s not only for end users; data scientists themselves need tools to understand the model, and debug it. The trouble is, models are complex beasts; understanding why they do what they do isn’t simple.

Back in the olden days (all of 10 years ago and beyond) models were mostly linear: each feature contributed a specific weight to the final answer of the model. So it was simple to understand that if, say, a ship’s age had a heavy weighting, and the ship was old, age would be a key factor in the model’s final decision on this ship. Such models were easily explained, but their power was limited because they didn’t account for the interaction between multiple features.

Decision Trees

These days, many machine learning models are based on so-called “decision trees”: a concoction of several features whose interaction produces a specific risk, or answer. Take, for example, the combination of the following three features:

had a detention (yes/no)
average speed
time spent in bad weather.

It may be that if the ship was detained, had an average speed of 15-20 knots, and spent more than 30% of its time in bad weather it would be considered high risk. This is like walking (or sailing) along a tree, where each decision about the value of the feature branches out into different paths (see image).

Decision Tree2 — Example of a Decision Tree

These are just three branches of a tree. Within which there could be several different walks, each of which lead to different outcomes. Confused? A full model can comprise around 100 such decision trees. Each ship will have a specific walk in each of the trees, and the final score, or decision of the model, will be a formula derived from all the scores it received in all the trees.

Final Score

Now let’s imagine that we want to explain the final risk score of a ship. The same feature can have a different impact on different trees. For example, the average speed of a vessel might mean everything to a tanker routinely operating in one area, and absolutely nothing for the same tanker if it were operating elsewhere. In other words, risk can be driven by different factors depending on the uniqueness of a situation, and there is no consistency in which an average speed of x knots adds 14.2 risk points. Doing so is practically impossible.

Luckily for us, it’s a global problem. Vast sums are being invested in seeking a solution. One answer found inspiration in Game Theory. Called SHAP, it stands for SHapley Additive exPlanations. I think it’s amazing!

The idea behind SHAP is that we would like to know how much each feature contributed to the final score, or more precisely, how much a feature contributed in moving the score of the ship from the average score of all ships. We treat ship scoring as a game, where the players are the features that participate in the model. (To learn more, read this).

Image 04-12-2019 at 09 56 — Screenshot of Windward Select, explaining the features affecting a vessel’s risk of having an accident

Explainability in and of itself, though, isn’t enough. Models can’t replace humans. Even if we choose to accept a model’s output and make a decision based on it, it is always our choice. Trust is achieved when the model and the user work in harmony. For that to happen, though, we need to provide the tools to understand the data behind the models. To be sure, models aren’t always right. But by empowering the user to make the final decision, he or she can verify the model’s output and complement it with their own expertise to help make the best possible decisions.

If you want to hear more about trust in machine learning models, and about how we use machine learning – and even deep learning – to predict the risk of a ship having an accident, come along to my next talk, at the MLConference, Berlin.

Yonit Hoffman is a Senior Data Scientist at Windward

Explore more

Windward

January 11, 2024

The Journey to Success: Celebrating Windward’s 200th Customer Milestone

Windward is proud to announce a significant milestone in our journey: reaching 200 customers. The success we have achieved for and with our ocean freight, trading and shipping, and government customers stems from our ability to provide innovative technology and services that produce AI-powered, actionable insights. Customers choose Windward and then stay with us for...

Navigating deception the impact of Russia invasion on the maritime ecosystem

Windward

March 9, 2023

Webinar: Russia’s Invasion Impact on Maritime Deception and Ecosystem

Windward’s webinar brought together a diverse group of experts to cut through the noise around the impact of Russia’s war on the diverse maritime ecosystem. Exclusive data from Windward’s Maritime AI™ platform, along with insights from the industry-expert guests, enabled the sharing of unique perspectives. The webinar was filled with actionable tactics and ideas that...

Windward

March 9, 2023

Navigating deception: the impact of Russia’s invasion on the maritime ecosystem

Risks & Compliance

February 12, 2023

Not all that transmit are legit!

Looking to stay up to date with the latest deceptive shipping practices?

Windward

January 8, 2023

2022: the year of maritime insights, 2023…

By Irit Singer, Chief Marketing Officer, Windward 2022 was the year that the maritime ecosystem collectively realized that there is simply too much raw data for manual parsing and the landscape changes too rapidly. The world seemed to go crazy: Coronavirus and closures…war, sanctions, and a price cap…new deceptive shipping practices…plummeting ocean freight prices and...

In mod(els) we trust

What’s inside?

Explainability

Decision Trees

Final Score

Yonit Hoffman is a Senior Data Scientist at Windward

Trending

Explore more

The Journey to Success: Celebrating Windward’s 200th Customer Milestone

Webinar: Russia’s Invasion Impact on Maritime Deception and Ecosystem

Navigating deception: the impact of Russia’s invasion on the maritime ecosystem

Not all that transmit are legit!

2022: the year of maritime insights, 2023…

Sign up to get the latest maritime news

In mod(els) we trust

What’s inside?

Explainability

Decision Trees

Final Score

Yonit Hoffman is a Senior Data Scientist at Windward

Everything you need to know about Maritime AI™ direct to your inbox

Trending

The Journey to Success: Celebrating Windward’s 200th Customer Milestone

Webinar: Russia’s Invasion Impact on Maritime Deception and Ecosystem

Navigating deception: the impact of Russia’s invasion on the maritime ecosystem

Not all that transmit are legit!

2022: the year of maritime insights, 2023…