ArchCode

Official implementation of the paper "ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models, ACL 2024".

1. How to run it

1) prepare python virtual environment

conda create -n ArchCode python=3.10
conda activate ArchCode
pip install -r requirements.txt

2) run third party packages

i. install `expand_langchain`

git submodule update --init --recursive
pip install -e third_party/expand_langchain

ii. run `CodeExecContainer`

At a new terminal, run the following command and keep it running.

source third_party/CodeExecContainer/run.sh

iii. run Ollama (for local llm generation)

refer to Ollama for more information.

docker run -d \
  -v ollama:/root/.ollama \
  -p 11434:11434 \
  --name ollama \
  ollama/ollama
docker exec -d ollama ollama pull ollama/llama3:8b-instruct-fp16

4) save API keys

In root directory, copy api_keys_example.json to api_keys.json.
set your API keys in the file.

4) run python script

refer to Python Fire for arguments setting.

i. generation

python run.py generator \
    --config_path=configs/llama3_8b-vanilla-fr.yaml \
    - run \
    - merge_json \
    - exit

ii. evaluation

python run.py evaluator \
    --path=results/llama3_8b-archcode-fr/results_merged_1.json \
    --gt_key=passed \
    --filter_keys=[gen_tc_passed] \
    --filter_weights=[1] \
    - run \
    --k=[1,2,5,10] \
    --n=10

iii. server

python run.py server \
    - run \
    --port=8080

The API server has the following endpoints:

/generate: POST request with a json body containing the following fields:
- nl_query: str
- llm_kwargs: dict
  - model_name: str = "gpt-4o-mini"
  - platform: str = "openai"
  - greedy_kwargs: dict
    - temperature: float = 0.0
    - top_p: float = 1.0
    - max_tokens: int = 2048
  - nucleus_kwargs: dict
    - temperature: float = 0.8
    - top_p: float = 1.0
    - max_tokens: int = 2048
- candidate_num: int = 10

If you want to generate code for a given natural language query, you can send a POST request to the /generate endpoint with the following body:

{
  "nl_query": "Get the sum of two numbers",
  "llm_kwargs": {
    "model_name": "gpt-4o-mini",
    "platform": "openai",
    "greedy_kwargs": {
      "temperature": 0.0,
      "top_p": 1.0,
      "max_tokens": 2048
    },
    "nucleus_kwargs": {
      "temperature": 0.8,
      "top_p": 1.0,
      "max_tokens": 2048
    }
  },
  "candidate_num": 10
}

If you want to generate code using only function calls, use generate function

python run.py server \
    - generate \
    --nl_query="Get the sum of two numbers" \
    --llm_kwargs='{"model_name": "gpt-4o-mini", "platform": "openai", "greedy_kwargs": {"temperature": 0.0, "top_p": 1.0, "max_tokens": 2048}, "nucleus_kwargs": {"temperature": 0.8, "top_p": 1.0, "max_tokens": 2048}}' \
    --candidate_num=10

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
configs		configs
data		data
src		src
templates		templates
third_party		third_party
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
api_keys_example.json		api_keys_example.json
requirements.txt		requirements.txt
run.py		run.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ArchCode

1. How to run it

1) prepare python virtual environment

2) run third party packages

i. install `expand_langchain`

ii. run `CodeExecContainer`

iii. run Ollama (for local llm generation)

4) save API keys

4) run python script

i. generation

ii. evaluation

iii. server

2. Configs

About

Languages

ldilab/ArchCode

Folders and files

Latest commit

History

Repository files navigation

ArchCode

1. How to run it

1) prepare python virtual environment

2) run third party packages

i. install expand_langchain

ii. run CodeExecContainer

iii. run Ollama (for local llm generation)

4) save API keys

4) run python script

i. generation

ii. evaluation

iii. server

2. Configs

About

Resources

Stars

Watchers

Forks

Languages

i. install `expand_langchain`

ii. run `CodeExecContainer`