Skip to content

hindbr/BugReportCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BugReportCrawler

Bug URL Setting

Set bug report urls in bug_item_file.txt.

https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=1
https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=2
https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=3
https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=4
https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=5
https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=6
https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=7
https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=8
https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=9

Usage

Run the command to download bug data (linux is the project name).

python3 main.py linux

Bug XML URL Prefix Reference

Eclipse: https://bugs.eclipse.org/bugs/show_bug.cgi?ctype=xml&id=

Freedesktop: https://bugs.freedesktop.org/show_bug.cgi?ctype=xml&id=

GCC: https://gcc.gnu.org/bugzilla/show_bug.cgi?ctype=xml&id=

GNOME: https://bugzilla.gnome.org/show_bug.cgi?ctype=xml&id=

KDE: https://bugs.kde.org/show_bug.cgi?ctype=xml&id=

LibreOffice: https://bugs.documentfoundation.org/show_bug.cgi?ctype=xml&id=

Linux kernel: https://bugzilla.kernel.org/show_bug.cgi?ctype=xml&id=

LLVM: https://bugs.llvm.org/show_bug.cgi?ctype=xml&id=

OpenOffice: https://bz.apache.org/ooo/show_bug.cgi?ctype=xml&id=

Citation

You are kindly asked to acknowledge the usage of the dataset by citing the following publication:

@inproceedings{xiao2020hindbr,
  title={HINDBR: Heterogeneous Information Network Based Duplicate Bug Report Prediction},
  author={Xiao, Guanping and Du, Xiaoting and Sui, Yulei and Yue, Tao},
  booktitle={2020 IEEE 31st International Symposium on Software Reliability Engineering (ISSRE)},
  pages={195--206},
  year={2020},
  organization={IEEE}
}

About

Bug Report Web Crawler

Resources

License

Stars

Watchers

Forks

Contributors 2

  •  
  •  

Languages