The 20-Task bAbI Question-Answering Dataset v1.2

A dataset for question answering and text understanding in both Hindi and English

The bAbI-QA is a dataset for question answering and text understanding. The dataset is composed of a set of contexts, with multiple question-answer pairs available based on the contexts. Furthermore, the dataset is in both English and Hindi and is divided into 20 tasks:

Task 1: Single Supporting Fact

Task 2: Two Supporting Facts

Task 3: Three Supporting Facts

Task 4: Two Argument Relations

Task 5: Three Argument Relations

Task 6: Yes/No Questions

Task 7: Counting

Task 8: Lists/Sets

Task 9: Simple Negation

Task 10: Indefinite Knowledge

Task 11: Basic Coreference

Task 12: Conjunction

Task 13: Compound Coreference

Task 14: Time Reasoning

Task 15: Basic Deduction

Task 16: Basic Induction

Task 17: Positional Reasoning

Task 18: Size Reasoning

Task 19: Path Finding

Task 20: Agent’s Motivations

The "ContentElements" field contains three options, "TrainingData", "TestData" and "Dataset". The first two provide access to data formatted for common training tasks. They are extracted from the 10,000k version in English.

The full dataset "Dataset" contains more information, including the Hindi version of the dataset.

(6 columns, 161516 rows)

Examples

Basic Examples

Retrieve the ResourceObject:

In[1]:=

Out[1]=

View the data:

In[2]:=

Out[2]=

Analysis

Select the Hindi-language subset:

In[3]:=

Out[3]=

Obtain the training data:

In[4]:=

Out[4]=

Select the fifth example from "Task1":

In[5]:=

Out[5]=

Bibliographic Citation

Wolfram Research, "The 20-Task bAbI Question-Answering Dataset v1.2" from the Wolfram Data Repository (2017) https://doi.org/10.24097/wolfram.19270.data

License Information

Creative Commons Attribution 3.0 Unported (CC BY 3.0)

Data Resource History

Date Created: 22 February 2017

Source Metadata

Title: Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
Creator: Jason Weston et. al.
Publisher: arXiv:1502.05698
Date: 2015
Language: English, Hindi
Source: https://research.fb.com/projects/babi

Publisher Information

Publisher of Record: Wolfram Research