Skip to content

Latest commit

 

History

History
243 lines (180 loc) · 7.67 KB

README.md

File metadata and controls

243 lines (180 loc) · 7.67 KB

PyinderArtifact

DOI

This repository is for the implementation of Pyinder announced in the paper "Towards Effective Static Type-Error Detection for Python" in ASE 2024. Our tool, Pyinder, is a static type analysis tool for Python that is built on top of Pyre.

Towards Effective Static Type-Error Detection for Python: Accepted Version

Prerequisites

We recommend the following environment for running Pyinder:

  • OS: A Linux System with Docker support
  • Hardware: x86 CPU; 64GB RAM; 50GB Storage

(the minimum RAM requirement has been confirmed to be 32GB)

Before setting up the environment, please make sure that you have Docker installed.

docker --version
# Docker version 20.10.13, build a224086

Installation

Build Docker Image

Clone the repository:

git clone https://github.com/kupl/PyinderArtifact.git
cd PyinderArtifact

Then, build the Docker image:

docker build -t pyinder:1.0 .

In this step, Mypy, Pytype, and Pyright are installed, but Pyre is not installed due to the compatibility issue, such as linking, with Pyinder. You can install and run Pyre by following the instructions in the Install Pyre.

Run Docker Container

After building the Docker image, you can run the container:

# Run docker image
docker run --name pyinder-container --memory-reservation 32G -it pyinder:1.0

We recommend setting the memory reservation to 32GB for the container to fully run Pyinder on the large projects.

Build Pyinder

You can build Pyinder by following the instructions:

# Inside the dokcer container;
cd ~
cd Pyinder/source
make

Don't worry about the warning messages during the build process. However, if you see an error message dune: No such file or directory when running make, you need to type eval $(opam config env) before running make.

Kick the tires

Before running experiments on all projects, first run a simple test to check if the tools are working properly. If you want to run full experiments for reproducing the results in the paper, please refer to the EVALUATION.md file.

We provide an example instruction with the luigi project to run Pyinder and other tools. You can skip instructions for other tools if you only want to run Pyinder.

At first, clone the project and prepare the project for the analysis:

# Inside the dokcer container;
cd ~
cd configuration
python download_repo.py --project luigi
python setting_config.py --project luigi

Next, run the tools with the luigi project:

# Inside the dokcer container;

# It will take about 30 minutes 
# to run all tools with the luigi project.
cd ~
cd run
# Run tools with luigi projects
python pyinder_run.py -p luigi
python mypy_run.py -p luigi
python pytype_run.py -p luigi
python pyright_run.py -p luigi

Click to see the output

You can see the output of each tool in the console like this:

# luigi-1836 is analyzed... Finished process in 77.26377391815186 seconds.
# luigi-4 is analyzed... Finished process in 85.76695346832275 seconds.
# luigi-14 is analyzed... Finished process in 74.57044434547424 seconds.

The result of each tool is stored in the ~/result/<each-tool>/<luigi-proejct>/result.json directory (e.g., ~/result/pyinder/luigi-1836/result.json).


Next, post-process the results to collect the type errors and check the results:

# Inside the dokcer container;
cd ~
cd run

# Change the result log file to json file
python pyinder_change_json.py
python mypy_change_json.py
python pytype_change_json.py

# Filter out the type errors from the results
python filter_error.py

# Run cloc to check the per kloc results
cd ~
cd eval
python cloc.py

Click to see the output

When you run *_change_json.py, you can see the output that shows the success on luigi projects:

airflow-3831 is analyzed... Failed
...
luigi-1836 is analyzed... Done!
...
luigi-4 is analyzed... Done!
luigi-14 is analyzed... Done!
...
sympy-44 is analyzed... Failed

After running python filter_error.py, you can see the filtered results in the ~/result/<each-tool>/<luigi-project>/filter_error.json directory (e.g., ~/result/pyinder/luigi-1836/filter_error.json).

The command python cloc.py makes the ~/cloc directory that contains the results of cloc.


Finally, you can see the results by following these steps:

# Inside the dokcer container;
cd ~
cd eval
python check_alarm.py -p luigi # show the number of alarms by each tool
python check_correct.py -p luigi # show the number of detecting type errors by each tool
python check_time.py -p luigi # show the time taken by each tool

Click to see the output

Note: The results can be slightly different from the paper because the tools and typeshed can be updated.

The command python check_alarm.py -p luigi shows the number of alarms by each tool:

Project             Pyinder   Mypy      Pyre      Pytype    Pyright
luigi-1836          82        85        N/A       0         144
luigi-4             104       117       N/A       0         179
luigi-14            79        75        N/A       0         138
Total               265       277       0         0         461
Per 1k LOC          6.73      7.04      N/A       0.0       11.71

The command python check_correct.py -p luigi shows the number of detecting type errors by each tool:

Project             Pyinder   Mypy      Pyre      Pytype    Pyright
luigi-1836          O         O         E         X         O
luigi-4             X         X         E         X         X
luigi-14            O         X         E         X         O
Correct             2         1         0         0         2

The command python check_time.py -p luigi shows the time taken by each tool:

Project             Pyinder   Mypy      Pyre      Pytype    Pyright
luigi-1836          77.26     6.64      N/A       27.55     8.8
luigi-4             85.77     4.69      N/A       149.27    10.04
luigi-14            74.57     4.21      N/A       546.52    8.55
Total               237.6     15.54     0         723.34    27.39
Per 1k LOC          6.04      0.39      N/A       18.38     0.7

Then, you can see the results in the console or check the csv files in the ~/eval directory.

  • alarm_result.csv: the number of alarms by each tool
  • correct_result.csv: the number of detecting type errors by each tool
  • time_result.csv: the time taken by each tool

Reproducing the Results

If you want to reproduce the results in the paper, please refer to the EVALUATION.md file.

Code Structure

.
├── Dockerfile     # Dockerfile for building the environment
├── EVALUATION.md  # Full Evaluation instructions
├── LICENSE
├── Pyinder        # Pyinder source code
├── README.md      
├── configuration  # contains the configuration files and setting scripts
├── eval           # contains the evaluation scripts
└── run            # contains the scripts to run the tools

Contact

If you have any questions, please contact us at [email protected]