GLIB

CNN-based visual understanding for detecting UI glitches in game Apps

GLIB: Towards Automated Test Oracle for Graphically-Rich Applications
Paper URL: https://arxiv.org/abs/2106.10507

Architecture

Code-based Generation

Requirements

On Ubuntu:

python(3.5.2)
pip(20.3.4)
pytorch(0.4.0)
cuda
docker
nvidia-docker

Sign up a docker account from dockerhub

Installation and Execution

Download dataset and model:

download the UI image dataset and unzip:

unzip data.zip

data/images:

data/images/Base : 132 screenshots of game1 & game2 with UI display issues from 466 test reports.
data/images/Code : 9,412 screenshots of game1 & game2 with UI display issues generated by our Code augmentation method.
data/images/Normal: 7,750 screenshots of game1 & game2 without UI display issues collected by randomly traversing the game scene.
data/images/Rule(F) : 7,750 screenshots of game1 & game2 with UI display issues generated by our Rule(F) augmentation method.
data/images/Rule(R) : 7,750 screenshots of game1 & game2 with UI display issues generated by our Rule(R) augmentation method.
data/images/testDataSet : 192 screenshots with UI display issues from 466 test reports(exclude game1 & game2).

data/data_csv:

data/data_csv/Base : dataset for baseline method.
data/data_csv/Code : dataset for our Code Augmentation method.
data/data_csv/Rule(F) : dataset for our Rule(F) Augmentation method.
data/data_csv/Rule(R) : dataset for our Rule(R) Augmentation method.
data/data_csv/Code_plus_Rule(F) : dataset for our Code&Rule(F) Augmentation method.
data/data_csv/Code_plus_Rule(R) : dataset for our Code&Rule(R) Augmentation method.
data/data_csv/testDataSet : test dataset(normal image and real glitch images from 466 test reports).

download the pre-trained model and unzip:

unzip model.zip

model/Base : pre-trained model for baseline method.
model/Code : pre-trained model for our Code Augmentation method.
model/Rule(F) : pre-trained model for our Rule(F) Augmentation method.
model/Rule(R) : pre-trained model for our Rule(R) Augmentation method.
model/Code_plus_Rule(F) : pre-trained model for our Code&Rule(F) Augmentation method.
model/Code_plus_Rule(R) : pre-trained model for our Code&Rule(R) Augmentation method.

Environment Setup

Method1: Very simple: pull a Docker image (Recommanded)

Step1: Login with your username and password

docker login

Step2: Pull source from our docker and start a container

docker pull qwertymj/glib:0.0.1
docker container run -it --gpus all qwertymj/glib:0.0.1 /bin/bash

Step3: Open another shell to check the running container ID

docker ps

Step4: Download dataset and model

Step5: Copy the container ID and push our dataset to the container

docker cp data [container ID]:/code/data
docker cp model [container ID]:/code/model
cd /code
pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple

Method2: A bit harder: build a Docker image

Step1: Clone repo

git clone --recursive https://github.com/GLIB-game/GLIB.git

Step2: Build docker image

cd GLIB
sudo docker image build -t qwertymj/glib:0.0.1 .

Step3: Start the container

docker container run -it --gpus all qwertymj/glib:0.0.1 /bin/bash

Step4: Download dataset and model

Step5: Copy the container ID and push our dataset to the container

docker cp data [container ID]:/code/data
docker cp model [container ID]:/code/model
cd /code
pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple

Method3: Hard: set up the environment manually (Not Recommanded)

First you should make sure your linux system has installed cuda(9.0.176) and cudnn(7.4.2)

Step1: Install dependencies

apt-get update && \
apt-get install -y wget \
gnupg \
apt-transport-https \
tzdata \
net-tools \
dnsutils \
iproute2 \
gcc \
tmux \
htop \
git \
vim \
sudo \
cmake \
libgl1-mesa-glx \
libglib2.0-0 \
openssh-server

Step2: Build python virtual environment

conda create -n python3.5 python=3.5.2
conda activate python3.5

Step3: Clone the GLIB repository

git clone --recursive https://github.com/GLIB-game/GLIB.git

Step4: Install python dependencies

cd GLIB
pip install --upgrade pip
pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple

Train the CNN model

Training from scratch:

python train.py --train_data train_file_path --eval_data eval_file_path --augType Type

Example:

python train.py --train_data data/data_csv/Code/Code_train.csv --eval_data data/data_csv/Code/Code_test.csv --augType Code

Training from the pre-trained model:

python train.py --train_data train_file_path --eval_data eval_file_path --augType Type --model_path model_path

Example:

python train.py --train_data data/data_csv/Code/Code_train.csv --eval_data data/data_csv/Code/Code_test.csv --augType Code --model_path model/Code/Code.pkl

Evaluate the model

python test.py --test_data test_data_path --model model_path

Example:

python test.py --test_data data/data_csv/testDataSet/testData_test.csv --model model/Code/Code.pkl

Generate saliency map

python saliencymap.py --test_data test_data_path --model model_path

Example:

python saliencymap.py --test_data data/data_csv/testDataSet/testData_test.csv --model model/Code/Code.pkl

Configuration

Changing hyper-parameters is possible by editing the file config.py

config.EPOCH:

The max number of epochs to train the model. Stopping earlier must be done manually (kill).

config.TRAIN_BATCH_SIZE:

Batch size in training.

config.SAVE_STEP:

After how many training steps a model should be saved.

config.EVAL_STEP:

After how many training steps the model test its performance on evaluation dataset.

config.LR:

The learning rate in training.

config.EVAL_BATCH_SIZE

Batch size in evaluation step.

config.TEST_BATCH_SIZE

Batch size in test step.

Supplementary explanation

The correlation between our self-defined code & rule approaches and corresponding UI glitches:

Reult for Practical Evaluation (RQ4)

	Code	Rule(R)	Rule(F)
PC	7	3	1
Android	35	22	15
iOS	11	6	5
total	53 (48 confirmed)	31 (28 confirmed)	21 (17 confirmed)

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
data		data
model		model
AutoTestFramework.png		AutoTestFramework.png
Dockerfile		Dockerfile
GLIB_architecture.png		GLIB_architecture.png
GameDataLoader.py		GameDataLoader.py
Method_2_UIglitch.png		Method_2_UIglitch.png
NNArch.py		NNArch.py
README.md		README.md
code_gen.png		code_gen.png
config.py		config.py
requirements.txt		requirements.txt
rule_gen.png		rule_gen.png
saliencymap.py		saliencymap.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GLIB

Architecture

Code-based Generation

Requirements

Installation and Execution

Download dataset and model:

Environment Setup

Method1: Very simple: pull a Docker image (Recommanded)

Method2: A bit harder: build a Docker image

Method3: Hard: set up the environment manually (Not Recommanded)

Train the CNN model

Evaluate the model

Generate saliency map

Configuration

config.EPOCH:

config.TRAIN_BATCH_SIZE:

config.SAVE_STEP:

config.EVAL_STEP:

config.LR:

config.EVAL_BATCH_SIZE

config.TEST_BATCH_SIZE

Supplementary explanation

The correlation between our self-defined code & rule approaches and corresponding UI glitches:

Reult for Practical Evaluation (RQ4)

AutoTest FrameWork

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GLIB

Architecture

Code-based Generation

Requirements

Installation and Execution

Download dataset and model:

Environment Setup

Method1: Very simple: pull a Docker image (Recommanded)

Method2: A bit harder: build a Docker image

Method3: Hard: set up the environment manually (Not Recommanded)

Train the CNN model

Evaluate the model

Generate saliency map

Configuration

config.EPOCH:

config.TRAIN_BATCH_SIZE:

config.SAVE_STEP:

config.EVAL_STEP:

config.LR:

config.EVAL_BATCH_SIZE

config.TEST_BATCH_SIZE

Supplementary explanation

The correlation between our self-defined code & rule approaches and corresponding UI glitches:

Reult for Practical Evaluation (RQ4)

AutoTest FrameWork

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages