SWE-GPT(SWESynInfer): SoftWare Engineering Process Data Synthesis and Inference Workflow for SWE-GPT

Overview

SWE-GPT: is an open-source large language model specifically designed for software improvement. Built upon the foundation of the Qwen/llama series base models, SWE-GPT has undergone additional training using software engineering development process data to enhance its capabilities in solving complex software engineering tasks.

SWESynInfer: three-stage software engineering process data synthesis and inference workflow. This workflow extends the publicly available AutoCodeRover framework. AutoCodeRover provides baseline processes for context retrieval and patch generation stages, our work further introduces crucial enhancements to more accurately simulate the cognitive processes of expert developers.

Model Introduction

SWE-GPT is a specialized model that focuses on addressing the unique challenges faced in software engineering. By leveraging the robust capabilities of the Qwen base models and incorporating domain-specific knowledge, this model aims to provide intelligent assistance across various aspects of software development.

Model Performance

SWE-GPT has demonstrated impressive performance in software engineering tasks:

🌟 Achieved a 30.2% solution rate on the authoritative SWE-bench Verified leaderboard for software engineering intelligent agents.
🌟 Achieved a 51.16% fault location success rate on SWE-bench Verified.
👑 Outperforms other open-source models of similar scale in software engineering-specific tasks (a 22.76% increase compared to Llama 3.1 405B).

Quick Start

Setup

First, create a virtual environment and install the required dependencies.

cd SWESynInfer
conda env create -f environment.yml
conda activate swesyninfer

# Set repo_path in setup_map.json (SWESynInfer/SWE-bench/setup_result/setup_map.json) to the local path
python scripts/1_change_testbed_path.py YOUR_ABSOLUTE_PATH/SWESynInfer/SWE-bench/repos/testbed

Model download and deployment

export VLLM_USE_MODELSCOPE=True
export CUDA_VISIBLE_DEVICES=0,1,2,3

python -m vllm.entrypoints.openai.api_server \
    --gpu-memory-utilization 0.95 \
    --served-model-name SWE-GPT \
    --model Anonymous_Model\
    --tensor-parallel-size 4 \
    --max-model-len 131072 \
    --trust-remote-code \
    --rope-scaling '{"type": "yarn", "factor": 4.0, "original_max_position_embeddings": 32768}'

# test for deployment success
conda activate swesyninfer
python scripts/2_call_vllm.py

Please note that to avoid identity leakage, we replaced the actual model name with Anonymous_Model(not available yet), which we will replace with the actual model name after the review is finalized.

Now You can run SWE-GPT on SWE-bench

python scripts/run.py conf/vanilla-lite-swebench.conf -f

Evaluation on SWE-bench

We recommend using SWE-bench docker directly for evaluation. Refer to the SWE-bench repository for more details.

Note: we have built-in testbed examples. After the review is completed, we will upload the entire testbed (about 130G)

Acknowledgments

We would like to thank the SWE-bench, AutoCodeRover, and Agentless teams for their foundational work, which played an important role in the development of SWESynInfer.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
SWE-bench		SWE-bench
app		app
conf		conf
scripts		scripts
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SWE-GPT(SWESynInfer): SoftWare Engineering Process Data Synthesis and Inference Workflow for SWE-GPT

Overview

Model Introduction

Model Performance

Quick Start

Setup

Model download and deployment

Now You can run SWE-GPT on SWE-bench

Evaluation on SWE-bench

Note: we have built-in testbed examples. After the review is completed, we will upload the entire testbed (about 130G)

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SWE-GPT(SWESynInfer): SoftWare Engineering Process Data Synthesis and Inference Workflow for SWE-GPT

Overview

Model Introduction

Model Performance

Quick Start

Setup

Model download and deployment

Now You can run SWE-GPT on SWE-bench

Evaluation on SWE-bench

Note: we have built-in testbed examples. After the review is completed, we will upload the entire testbed (about 130G)

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages