AI/ML Challenge 2020

Registering for the Challenge

The first task is to register your team for the challenge. To do that, follow the instructions on the challenge website.

Example EULA Documents

EULA ("yoola") documents are typically received as Microsoft Word or PDF documents. Examples of these are available in the Reference section.

Training Data For The Challenge

We are providing the initial set of training data thas been extracted from actual EULA documents and had identifying information, such as company name, removed. We may provide updates to the training data during the challenge, and list them in the table below.

Data set	Description	Date published
Training Data Set 1 v1	Initial batch of data. Known issues: clause text contains control characters, such as embedded Line Feed (/n) characters.	7/6/2020

Format of training data:

Clause ID	Clause Text	Classification
Integer generated for tracking individual clauses.	Section or paragraph of a EULA document that has been reviewed for acceptability to GSA.	Indication if this clause is acceptable (0) or unacceptable (1) to GSA. This is the indicator we are asking teams to help us predict

Submitting Your Solution

Below are the instructions for your Github submission. Everything must be submitted prior to the challenge deadline of August 20th, 2020. For more information, please visit the challenge website.

General Instructions:

Fork this repository to your GitHub account.
Build your solution according to the Submission Requirements below and commit it to your fork.
When you are ready to submit your solution, create a pull request from your fork to this repository (GSA/ai-ml-challenge-2020; base: master).
Name your pull request “TeamName Submission” and feel free to write a short description of your submission. Make sure to uncheck “Allow edits by maintainers” before creating your pull request.

Submission Requirements:

Create a folder in the “submissions” folder of your forked repository that contains all components of your solution and name it “YourTeamName_Submission.”
Within your solution folder, upload all relevant files (Add file > Upload files) to your forked repository according to the Submission Details below.

Submission Details:

Validation Data File (CSV document)

Name of the file: “TeamName Validation Data File”
Includes classification of clauses contained in Validation File, along with confidence scores.
- Clause ID
- Prediction: 0 - acceptable, 1 - unacceptable
- Probability Acceptable (percent)

Description of Methods Document (PDF, MS Word, or Jupyter Notebook document)
IMPORTANT: DO NOT INCLUDE ANY SENSITIVE INFORMATION IN THIS FILE.

Name of the file: “TeamName Description of Methods”
Provides a comprehensive description of the data, methods and software used to complete the solution.
Provides a demonstration of the process used to complete the model used in the solution, including data inputs and visualizations.
Clearly explains the reasons for predictions made in the Validation Data File submission.
Contains self-reported metrics of the solution, by providing:
- Brier score
- F1 score (also known as F-Measure)

Folder containing Source Code, Input Data, and Compiled Models
Note: if any of these are unavailable, explain the reason in the Description of Methods Document.

Name of the folder: “TeamName Code and Data” with subfolders named “TeamName Source Code,” “TeamName Input Data,” and “TeamName Compiled Models”
“TeamName Source Code” contains all source code used in the creation of the solution.
“TeamName Input Data” contains all input data used in the creation of the solution.
“TeamName Compiled Models” contains all compiled versions of models used in the solution.

Additional Reference Information

Several additional references have beeen provided that teams may optionally use for the challenge. For more information, view the Reference section of this repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI/ML Challenge 2020

Registering for the Challenge

Example EULA Documents

Training Data For The Challenge

Submitting Your Solution

General Instructions:

Submission Requirements:

Submission Details:

Additional Reference Information

About

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data		data
reference		reference
submissions		submissions
README.md		README.md

jbax3/ai-ml-challenge-2020

Folders and files

Latest commit

History

Repository files navigation

AI/ML Challenge 2020

Registering for the Challenge

Example EULA Documents

Training Data For The Challenge

Submitting Your Solution

General Instructions:

Submission Requirements:

Submission Details:

Additional Reference Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages