EvolveUniTest : Evolving Prompt with Genetic Algorithm Yields Powerful Instruction for Unit Test Generation

[Slides] [Paper]

Contributors: Pawit Wangsuekul, Sorn Chottananurak, Hai-Nam V. Cao, Thanh-Long V. Le

Abstract

Unit testing is a crucial aspect of software engineering, demanding considerable time and effort. To address this challenge, various automated test generation tools have been developed, such as ChatUniTest — a ChatGPT-based system developed under the Generation-Validation-Repair framework. Despite its utility, ChatUniTest's performance is hampered by the reliance on manually crafted system prompts to initiate the test generation process. Drawing inspiration from recent researches in prompt evolution, we introduce EvolveUniTest, an adaptation of ChatUniTest with EvolvePrompt, a framework employing genetic algorithms for prompt evolution. EvolvePrompt initiates from a population of system prompts, including those manually designed for ChatUniTest, iteratively generating new prompts using a large language model and enhancing the population based on a development set. Leveraging the most optimized prompt from this evolutionary process, EvolveUniTest surpasses ChatUniTest in the quality and performance of generated unit tests. It achieves higher correctness percentages, increased branch and line coverage, and improved focal method coverage.

Related Work

Usage

Step 1: Installation

First make sure you run this program in Mac or Linux system with mysql installed.

Follow the instructions below to install the project:

Clone the project: git clone https://github.com/s6007541/EvolvePrompt.git
Enter the project directory: cd EvolvePrompt
Install the requirements: pip install -r requirements.txt

Step 2: Download the LLM

We use CodeLlama-7b-Instruct for unit test generation in this project. Follow the instruction given in the offical CodeLlama repository to download it.

Step 3: Configuration

The configuration files are provided at .config/config_evoprompt.ini and .config/config_chatunitest.ini.

You need to alter few options:

project_dir: path to compiled Java project. (The path must be in English)
model_path, tokenizer_path
api_keys
host, port, database, user, password
GRAMMAR_FILE: tree-sitter java grammar file.

The options are explained as follows:

[DEFAULT]
test_number = 3 # The number of attempts to generate for each focal method.
process_number = 32 # The number of processes to use when generating tests.
dataset_dir = ../dataset/ # Dataset directory, no need to change.
result_dir = ../result/ # Result directory, no need to change.
project_dir = ../Chart/ # compiled Java project directory.
max_rounds = 2 # The maximum number of rounds to generate one test. One round for generation, 5 rounds for repairing the test.
TIMEOUT = 30 # The timeout for each test.
MAX_PROMPT_TOKENS = 3072 # The maximum number of tokens for each prompt.
MIN_ERROR_TOKENS = 500 # The minimum number of tokens for each error prompt.
PROMPT_TEMPLATE_NO_DEPS = d1_4.jinja2 # The prompt template for the method with no dependencies.
PROMPT_TEMPLATE_DEPS = d3_4.jinja2 # The prompt template for the method with dependencies.
PROMPT_TEMPLATE_ERROR = error_3.jinja2 # The prompt template for repairing the test.

LANGUAGE = "java"
GRAMMAR_FILE = ./dependencies/java-grammar.so
COBERTURA_DIR = ./dependencies/cobertura-2.1.1
JUNIT_JAR = ./dependencies/lib/junit-platform-console-standalone-1.9.2.jar
MOCKITO_JAR = ./dependencies/lib/mockito-core-3.12.4.jar:./dependencies/lib/mockito-inline-3.12.4.jar:./dependencies/lib/mockito-junit-jupiter-3.12.4.jar:./dependencies/lib/byte-buddy-1.14.4.jar:./dependencies/lib/byte-buddy-agent-1.14.4.jar:./dependencies/lib/objenesis-3.3.jar
LOG4J_JAR = ./dependencies/lib/slf4j-api-1.7.5.jar:./dependencies/lib/slf4j-log4j12-1.7.12.jar:./dependencies/lib/log4j-1.2.17.jar
JACOCO_AGENT = ./dependencies/jacoco/jacocoagent.jar
JACOCO_CLI = ./dependencies/jacoco/jacococli.jar
REPORT_FORMAT = xml # The coverage report format.

[llm]
model_path = path/to/model # The path to the LLM (CodeLlama-7b-Instruct) that you downloaded
tokenizer_path = path/to/tokenizer # The path to the tokenizer of the LLM
max_seq_len = 2048 # Parameters for the LLM. See https://github.com/facebookresearch/codellama for more information
max_batch_size = 4
temperature = 0.2
top_p = 0.95
frequency_penalty = 0
presence_penalty = 0

[openai]
api_keys = [sk-xxx] # The OpenAI api keys, you can get them from https://platform.openai.com/account/api-keys
model = gpt-3.5-turbo # gpt-3.5-turbo or gpt-4
temperature = 0.5 # See https://platform.openai.com/docs/api-reference/chat/create
top_p = 0.95
frequency_penalty = 0
presence_penalty = 0


[database]
host = 127.0.0.1
port = 3306
database = xxxx # Database name
user = xxxx # User
password = xxxx # Password

Here are the steps to generate a .so syntax file for Java language using tree-sitter on Mac and Linux systems:

Install tree-sitter. You can find the installation guide on the GitHub repository of tree-sitter (https://github.com/tree-sitter/tree-sitter).

npm install tree-sitter-cli

Get the tree-sitter-java project, which is the Java language plugin for tree-sitter. You can find the source code on the GitHub repository of tree-sitter-java (https://github.com/tree-sitter/tree-sitter-java).

git clone [email protected]:tree-sitter/tree-sitter-java.git

After getting the tree-sitter-java project, you can use the following command to generate a .so file:

cd tree-sitter-java
gcc -o java-grammar.so -shared src/parser.c -I./src

Specify the GRAMMAR_FILE option in config.ini.

GRAMMAR_FILE = path/to/java-grammar.so

Step 4: Run

EvolveUniTest (Our Project)

First, run the prompt evolution part with the following steps.

Rename .config/config_evoprompt.ini to .config/config.ini
Enter the source code directory: cd src
On one terminal, launch a flask server that hosts the LLM: torchrun server.py
On another terminal, run the Python script for prompt evolution: python evoprompt.py

Then, wait for the process to finish. The results, including the best prompt and all prompts in each generation of the prompt evolution process, are saved in prompt/evoprompt. Next, follow the steps below to run unit test generation.

Copy the prompt from prompt/evoprompt/generation_<NUM_GENERATIONS> to prompt/d1_4_system.jinja2 and prompt/d3_4_system.jinja2. Note that <NUM_GENERATIONS> is the number of generations in the prompt evolution task (default: 5). Backup the original prompt/d1_4_system.jinja2 and prompt/d3_4_system.jinja2 files.
Rename .config/config_chatunitest.ini to .config/config.ini
Enter the source code directory: cd src
On one terminal, launch a flask server that hosts the LLM: torchrun server.py (not necessary if it is already running).
On another terminal, run the Python script for unit test generation: python run.py

Wait until the process finishes. The result is saved in the result directory.

ChatUniTest Baseline

To run the ChatUniTest baseline with CodeLlama-7b-Instruct, follow the steps below.

If you run EvolveUniTest before, restore the original prompt/d1_4_system.jinja2 and prompt/d3_4_system.jinja2 files.
Rename .config/config_chatunitest.ini to .config/config.ini
Enter the source code directory: cd src
On one terminal, launch a flask server that hosts the LLM: torchrun server.py
On another terminal, run the Python script for unit test generation: python run.py

Structure

config

This directory stores the config files.

The config_evoprompt.ini and config_chatunitest.ini are for the prompt evolution task and the unit test generation task, respectively. Be sure to copy the corresponding file and rename it to config.ini when running each task.

dataset

This directory stores the dataset. Before generating unit tests for a new project, this dataset will be deleted and re-created automatically. So if you need the information inside the dataset directory, make sure to save a copy. The dataset directory includes direction_1, direction_3, and raw_data.

direction_1 contains the context without dependencies.
direction_3 contains the context with dependencies.
raw_data contains all the information about focal methods.

evolve_candidate

This directory stores the text files that contain the lists of candidate methods in each project that are selected as the development set for prompt evolution.

prompt

This directory stores the prompt templates. Prompts should be in the jinja2 template format. If you need to add a new prompt, follow these instructions:

Create a user prompt template: xxxx.jinja2.
If you need to create system prompt template, the format is xxxx_system.jinja2, the program will automatically find the system prompt template.
Ensure you've changed the template name in the configuration file.

This directory also contains the subdirectory evoprompt, which stores the result of prompt evolution.

result

The nested structure of the result directory is as follows:

scope_test + % + time + %
method_id + % + class_name + %d1
A number that denotes the different attempt, which contains all the files generated during the process, including:
1. steps_GPT_rounds.json: Raw response from the LLM.
2. steps_raw_rounds.json: The raw test extracted from the raw response, and the result of the validation process.
3. steps_imports_rounds.json: The test after import repairs, and the result of the validation process.
4. temp: Contains the latest error message or coverage result and a test java file.

src

This is the directory that stores the source code.

License

The project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.config		.config
.vscode		.vscode
evolve_candidates		evolve_candidates
image		image
pdf_files		pdf_files
prompt		prompt
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.gif		demo.gif
install_defects4j.sh		install_defects4j.sh
install_java.sh		install_java.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EvolveUniTest : Evolving Prompt with Genetic Algorithm Yields Powerful Instruction for Unit Test Generation

Abstract

Related Work

Usage

Step 1: Installation

Step 2: Download the LLM

Step 3: Configuration

Step 4: Run

EvolveUniTest (Our Project)

ChatUniTest Baseline

Structure

config

dataset

evolve_candidate

prompt

result

src

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

s6007541/EvolveUniTest

Folders and files

Latest commit

History

Repository files navigation

EvolveUniTest : Evolving Prompt with Genetic Algorithm Yields Powerful Instruction for Unit Test Generation

Abstract

Related Work

Usage

Step 1: Installation

Step 2: Download the LLM

Step 3: Configuration

Step 4: Run

EvolveUniTest (Our Project)

ChatUniTest Baseline

Structure

config

dataset

evolve_candidate

prompt

result

src

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages