Defense Media Network

DARPA and the Exploration of Artificial Intelligence

Another approach to machine learning relies on cues from the environment to reinforce good behavior and suppress bad behavior. For example, the program AlphaGo Zero can teach itself to play the board game Go at championship levels without any human input, other than the rules of the game. It starts by playing against itself, making random moves. It uses the rules of the game to score its results, and these scores reinforce winning tactics. This so-called reinforcement learning can be highly effective in situations where there are clear rewards for effective behavior. However, determining what behavior created the desired result in many real-world situations can be difficult.

As AI increasingly makes its way into industrial settings and consumer products, companies are discovering that its substantial benefits come with costs, in the form of engineering complexity and unique requirements for ongoing maintenance. The computational intensity of AI systems requires racks of servers and networking gear, which must be secured against and continuously monitored for cyber intrusions. The insatiable appetite of these systems for data often makes them dependent on many different enterprise databases, which requires ever-increasing coordination of operations across the organization. And finally, machine-learning systems must be continually retrained to keep them in sync with the world as it continually changes and evolves. This retraining requires a team of skilled data scientists and AI engineers, who tend to be in short supply.

Ultimately, people are still far more effective learners than machines. We can learn from teachers, books, observation, and experience. We can quickly apply what we’ve learned to new situations, and we learn constantly in daily life. We can also explain our actions, which can be quite helpful during the learning process. In contrast, deep learning systems do all their learning in a training phase, which must be complete before they can reliably recognize things in the world. Trying to learn while doing can create catastrophic forgetting, as backpropagation makes wholesale changes to the link weights between the nodes of the neural network. DARPA’s Lifelong Learning Machines program is exploring ways to enable machines to learn while doing without catastrophic forgetting. Such a capability would enable systems to improve on the fly, recover from surprises, and keep them from drifting out of sync with the world.

The real breakthrough for artificial intelligence will come when researchers figure out a way to learn or otherwise acquire common sense. Without common sense, AI systems will be powerful but limited tools that require human inputs to function. With common sense, an AI could become a partner in problem-solving.

The knowledge of a trained neural network is contained in the thousands of weights on its links. This encoding prevents neural networks from explaining their results in any meaningful way. DARPA is currently running a program called Explainable AI to develop new machine-learning architectures that can produce accurate explanations of their decisions in a form that makes sense to humans. As AI algorithms become more widely used, reasonable self-explanation will help users understand how these systems work, and how much to trust them in various situations.

The real breakthrough for artificial intelligence will come when researchers figure out a way to learn or otherwise acquire common sense. Without common sense, AI systems will be powerful but limited tools that require human inputs to function. With common sense, an AI could become a partner in problem-solving. Current AI systems today seem superhuman because they can do complex reasoning quickly in narrow specialties. This creates an illusion that they are smarter and more capable than they really are. For example, you can use an internet search engine to find pictures of cats sitting on suitcases. However, no current AI can use the picture to determine if the cat will fit in the suitcase. To the AI, the thing we recognize as a furry animal that purrs, uses a litter box, and ruins the single most expensive piece of furniture in the house with its needle-sharp claws is just a fuzzy two-dimensional texture. The AI has no conception of a three-dimensional cat.

What would happen if somebody put a cat in a suitcase? Suffocation as a possibility leaps to mind, because we learn while growing up to consider the likely consequences of our actions. We also know that a plush toy cat can go in an airtight suitcase with no ill effect, because we categorize things according to their properties, such as being alive, not just by their appearance (in this case, an inanimate toy that looks just like a real cat). No AI system today can do this type of reasoning, which draws on the immense amount of commonsense knowledge we accumulate over our lifetimes.

Commonsense knowledge is so pervasive in our lives that it can be hard to recognize. For example, most people could easily sort through pictures of furniture to find black couches with white legs. As an experiment, try to find such pictures on the internet. In 2018, at least, your search results will contain mostly pictures of black couches and white couches, with pictures of various other couches thrown in just in case. The color of the legs will be whatever is in fashion, because we don’t yet understand how to create AI systems that can figure out the parts of objects.

Lifelong Learning Machines AI DARPA web

Once trained, current machine-learning systems no longer adapt to their environments. DARPA’s Lifelong Learning Machines program is researching ways to enable systems to learn from surprises and adapt to changes in their environments. The Assured Autonomy program is developing approaches to produce mathematical assurance that such systems will operate safety and predictably under a wide range of operating conditions. DARPA image

AI systems with common sense ultimately could become partners in problem-solving, rather than just tools. For example, in emergency situations, people tend to make snap decisions about the cause of the problem and ignore evidence that doesn’t support their point of view. The cause of the Three Mile Island accident was a stuck-open valve that allowed cooling water to escape from the reactor containment vessel. The heat of the reactor caused the remaining water to turn to steam, which increased the vessel pressure. The operators decided that the high pressure meant that there was too much water, and made the situation worse by overriding the automatic emergency cooling system. An AI that could understand control-room conversations and vet them against its own models of reactor operation might be able to suggest alternative possibilities before the human operators commit to a particular course of action. To act as a valued partner in such situations, the AI system will need sufficient common sense to know when to speak and what to say, which will require that it have a good idea of what each person in the control room knows. Interrupting to state the obvious would quickly result in its deactivation, particularly under stressful conditions.

DARPA is mounting a major initiative to create the next generation of AI technologies, building on its five decades of AI-technology creation to define and to shape what comes next. DARPA’s substantial AI R&D investments will increase to fund efforts in the following areas:

New Capabilities. DARPA programs routinely apply AI technologies to diverse problems, including real-time analysis of sophisticated cyber attacks, detection of fraudulent imagery, human language understanding, biomedical advances, and control of prosthetic limbs. DARPA will advance AI technologies to enable automation of complex business processes, such as the lengthy and painstaking accreditation of software systems required for aviation, critical infrastructure, and military systems. Automating this accreditation process with known AI and other technologies now appears possible, and would enable deployment of safer technologies in less time.

Robust AI. As noted above, the failure modes of AI technologies are poorly understood. The data used to train such systems can be corrupted. The software itself is vulnerable to cyber attacks. DARPA is working to address these shortfalls by developing new theoretical frameworks, backed by extensive experimental evidence, to fortify the AI tools we do develop.

High-Performance AI. In combination with large data sets and software libraries, improvements in computer performance over the last decade have enabled the success of machine learning. More performance at lower electrical power is essential to allow this use of AI for data-center applications and for tactical deployments. DARPA has demonstrated analog processing of AI algorithms that operate a thousand times faster using a thousand times less power compared to state-of-the-art digital processors. New research will investigate AI-specific hardware designs and address the inefficiency of machine learning by drastically reducing requirements for labeled training data.

Next-Generation AI. DARPA has taken the lead in pioneering research to develop the next generation of AI algorithms, which will transform computers from tools into problem-solving partners. New research will enable AI systems to acquire and reason with commonsense knowledge. DARPA R&D produced the first AI successes, such as expert systems and search utilities, and more recently has advanced machine-learning tools and hardware. DARPA is now creating the next wave of AI technologies that will enable the United States to maintain its technological edge in this critical area.


1. Langley, P., Simon, H.A., Bradshaw, G.L., & Zytkow, J.M. (1993). Scientific Discovery. The MIT Press.

2. https://profiles.nlm.nih.gov/BB/A/L/A/F/

3. Op. Cit. Langley, et al, 

4. Kitano, H. “Artificial Intelligence to Win the Nobel Prize and Beyond: Creating the Engine for Scientific Discovery.” AI Magazine, Spring, 2016. 

5. Yolanda Gil, Mark Greaves, James Hendler, Haym Hirsh. “Amplify scientific discovery with artificial intelligence.” SCIENCE 10 OCT 2014 : 171-172

6. Benjamin M. Gyori, John A. Bachman, Kartik Subramanian, Jeremy L. Muhlich, Lucian Galescu, and Peter K. Sorger. 11/2017. “From word models to executable models of signaling networks using automated assembly.” Molecular Systems Biology, 13, 11, Pp. 954. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731347/

7. Gus Hahn-Powell, Dane Bell, Marco A Valenzuela-Escárcega, Mihai Surdeanu. “This before that: Causal precedence in the biomedical domain.” https://arxiv.org/pdf/1606.08089.pdf

8. Australian National Outlook. https://www.csiro.au/~/media/Major-initiatives/Australian-National-Outlook/CSIRO-MAIN_REPORT-National_Outlook_2015.pdf?la=en&hash=B7430AC9FD3C6665A53B9BDD1B2ED5356CB2FA3B

Prev Page 1 2 3 Next Page