May 1st, 2023 by inflectra
As we deploy AI and ML-based systems and applications is: how do we test them? With traditional software systems, humans write requirements, develop the system, and then have other humans (aided by computers) test them to ensure the results match. With AI/ML-developed systems, there often are no discrete requirements. Instead, there are large data sets, models, and feedback mechanisms. In this article, originally published on AI Business, we discuss the risks of letting machines check themselves and provide some potential solutions.
We are surrounded by risks every day, in every organization and with every activity we undertake. The field of risk management is a discipline devoted to trying to identify, quantify and manage these risks so that we as humans can live safe, secure and happy lives. Every time you fly in a plane, drive a car, swallow a pill or eat a salad, you are relying on the tireless work done every day by risk managers to make that activity safe whilst still economic.
Risk management is the understanding of events that might happen, regardless of how unlikely or remote. For example, when driving a car, there is the risk that a tire might blow out, the probability is low, but the impact if it happens, is quite high. Conversely, the risk that a headlight will stop working is relatively high, but the impact if that happens is moderate, but not zero (which is why you have two headlights). When assessing risks, you need to consider the combination of impact/severity and probability/likelihood, this is called the risk exposure and determines how much effort should be put into mitigating (reducing either its severity or probability).
Traditionally, the first step in risk management is to identify all the risks, and then quantity their probability and severity. This is done by humans brainstorming possible events that can happen, and using their previous industry knowledge to make an educated guess about the probability and severity. Now one of the major issues in this process is the fact that humans (especially in teams) are affected by biases such as confirmation bias, hindsight bias, and availability bias which mean that we tend to overestimate previously known risks and either ignore or downplay obscure edge cases that might be really serious.
Using Artificial Intelligence (AI) in the field of risk management offers the potential to address these issues in several ways. Firstly, when you have large datasets of real-word evidence, AI applications and Machine Learning (ML) models both can uncover hitherto unknown risks, so called “zero day” risks. For example, an AI system that is monitoring unrelated data feeds for weather, hospital stays and traffic could identify a risk in a new automotive system (e.g., a new autopilot mode) that has been obscured from view. In addition to identifying new risks from large uncorrelated data sets, an AI system could take previously identified risks and be able to systematically quantify them for impact and probability using external environment data vs. human intuition. This would take a lot of the guesswork and variability out of classic risk management models.
Conversely, using AI and ML for safety-critical systems such as automotive or aerospace autopilots can itself introduce new risk into the system. When you build an AI/ML model, unlike a traditional algorithmic computer system, you are not defining the requirements and the coding steps to fulfill those requirements. Instead, you are providing a data set, some known rules and boundary conditions, and letting the model infer its own algorithms from the provided data.
That lets AI/ML systems create new methods that are not previously known to humans, and find correlations that are unique and breakthrough. However, these new insights are unproven, and may be only as good as the limited data-set they were based on. The risk is that you start using these models for production systems and they behave in unexpected and unpredictable ways. This is a classic example of a problem that needs independent risk analysis and risk management, either from a separate AI/ML model that can “check” the primary one, or from human experts who can validate the possible risks that could occur in the AI system.
So, introducing AI and ML into complex safety or mission-critical systems brings both benefits and challenges from a risk management perspective, a sensible framework for adopting AI/ML techniques recognizes these factors and attempts to maximize the benefits whilst mitigating the challenges.
As you begin to implement AI/ML models, making sure that you have a clear grasp of the business requirements, use cases, and boundary conditions (or constraints) is critical. By defining the limits of the data sets employed and the specific use cases that the model was trained on, will ensure that the model is only used to support activities that its original data set were representative for. In addition, having humans independent check the results predicted by the models is critical. This independent verification and validation (IV&V) will allow humans to ensure the AI models are being used appropriately, without introducing unnecessary risk into the system. For example, when deploying an autopilot to an automotive or aviation platform, such I&V will verify the plane or car behaves as expected, either in a simulator or real-world test environment.
Conversely, AI models can be used to check the risk management models and assessments made by humans to ensure they are not missing zero-day risks. In this case, the computer is acting as its own IV&V for the human decision-making process. This is a mirror image of the previous case. In this case, we’re taking advantage of machine learning models to avoid human biases and find flaws in our existing risk models.
In conclusion, instead of seeing AI as an alternative to classic risk management methods, we should see the inherent harmony between computers and humans checking each other’s assumptions and recommendations. Using a combination of manual risk assessment techniques and AI tools we can reduce the overall risk in systems while at the same time allowing unprecedented innovation and breakthroughs. If you look at many historical safety accidents (Three-mile island, Chernobyl, etc.) it is the human factor that often presents the most risk after all.
Ask an Inflectra expert:
And if you have any questions, please email or call us at +1 (202) 558-6885
SpiraTest combines test management, requirements traceability & bug-tracking
SpiraTeam brings your teams together, managing the entire application lifecycle
SpiraPlan lets you manage your programs and portfolio of projects like never before
Orchestrates your automated regression testing, functional, load and performance
The ultimate test automation platform for web, mobile, and desktop applications
The help desk system, designed specifically for software support teams
Cloud hosted, secure source code management - Git and Subversion
Exploratory testing capture tool that automatically records your testing activity
Let us deal with the IT pain so you don't have to. Or use on-premise if you prefer.
Our customers work in every industry imaginable. From financial services to healthcare and biotech to government and defense and more, we work with our customers to address their specific needs.
Our products do not enforce a methodology on you, instead they let you work your way. Whether you work in agile development, Scrum, XP, Kanban and Lean, Waterfall, hybrid, or Scaled Agile Inflectra can help.
If you want to learn more about application delivery, testing, and more take a look at our whitepapers, videos, background papers, blog, and presentations.
Our suite of Accelerators speed up your deployment and adoption of our products, increasing your return on investment and reducing the cost of ownership.
We collaborate with a wide range of organizations to bring our customers a range of services (including load testing, training, and consulting), complimentary technologies, and specialized tools for specific industries.
Learn how different organizations have benefited from using Inflectra products to manage their software testing and application develooment.
Outstanding support is the foundation of our company. We make support a priority over all other work. Take a look at our support policy.
Discover great tips, discussions, and technical solutions from fellow customers and Inflectra's technical experts.
If you can't find the answer you're looking for, please get in touch with us: over email, phone, or online.
We are constantly creating new videos to help customers learn about our products, including through in depth webinars, all freely available along with a wide selection of presentations.
We provide a number of resources to help customers learn how to get the most out of our products, with free online resources, virtual classrooms, and face to face.
Read about Inflectra, our manifesto, and values. Meet our incredible customers who are building awesome things, and our leadership team that are committed to building a great company.
The Inflectra Blog contains articles on all aspects of the software lifecycle.
In addition we have whitepapers,
background articles, videos and
presentations to help get you started.
Events are a big part of our awesome customer service. They are a chance to learn more about us, our products, and how to level up your skills with our tools.
We partner with educational institutions and individuals all over the world. We are also a great place to work and encourage you to explore joining our team.
Please contact us with your questions, feedback, comments, or suggestions. We'll get back to you as soon as possible.
When you need additional assistance (be it training, consulting, or integration services) our global certified solution provider partner network is ready to help.
At Inflectra, we are fully committed to provide our customers with the very best products and customer service. Check out some of our recent awards.
We want to help developers extend and customize our tools to fit in with their needs. We provide robust APIs, sample code, and open source projects.