Skip to content

OBI-Future/OBI-Survey

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 

Repository files navigation

Oracle Bone Inscriptions Information Processing: A Comprehensive Survey

This repository accompanies the survey Oracle Bone Inscriptions Information Processing: A Comprehensive Survey and systematically organizes benchmarks and resources across oracle bone character recognition, fragment rejoining, classification and retrieval, decipherment, and related multimodal tasks. It aims to serve as a collection for reporting progress in the field of Oracle Bone Inscriptions Processing.

We will continuously maintain and update this repository to ensure long-term value for the community.

Overview

Paper: Techrxiv
Project Page: This repository


Contributions

📧 We warmly welcome pull requests (PRs)!

Please contact the first author of this paper for queries.

If you find this repository useful, please consider giving us a ⭐. Thank you for your support!


Citation

If our work is helpful in your research, please cite our survey as:

@article{Chen_2025,
title={Oracle Bone Inscriptions Information Processing: A Comprehensive Survey},
url={http://dx.doi.org/10.22541/au.176616165.50988592/v1},
DOI={10.22541/au.176616165.50988592/v1},
publisher={Wiley},
author={Chen, Zijian and Hua, Wenjie and Li, Jinhao and Zhu, Yucheng and Zhi, Xiaona and Liu, Zhiji and Chen, Tingzhu and Zhang, Wenjun and Zhai, Guangtao},
year={2025}

Table of Contents


Online Resource

OBI Website Project Page
Yin Qi Wen Yuan https://jgw.aynu.edu.cn/home/index.html
Xiao Xue Tang https://xiaoxue.iis.sinica.edu.tw/
Guo Xue Da Shi https://www.guoxuedashi.com/
Zhui Yu Lian Zhu https://www.fdgwz.org.cn/ZhuiHeLab/Home
Omniglot https://www.omniglot.com/chinese/jiaguwen.htm
Multi-function Chinese Character Database https://humanum.arts.cuhk.edu.hk/Lexis/lexi-mf/
Yin Xu OBI Database https://obid.ancientbooks.cn/
Chinese Etymology https://hanziyuan.net
OBI AI Collaborative Platform https://www.jgwlbq.org.cn/home
------------ --------------------------------------------------------------
Museum Collection
故宫博物院 https://digicol.dpm.org.cn/list?category=18&dynasty=1265
河南博物院 https://www.chnmus.net/ch/collection/boutique/index.html?pageIndex=1&dictionaryValues=type_%E7%94%B2%E9%AA%A8%E7%AE%80%E7%89%8D&dictionaryValues=years_商
辽宁省博物馆 https://www.lnmuseum.com.cn/#/collect/digital-culture
山东博物馆 https://www.sdmuseum.com/col/col353161/index.html?uid=750411&pageNum=1
陕西历史博物馆 https://www.sxhm.com/collection.html
上海博物馆 https://www.shanghaimuseum.net/mu/frontend/pg/lib1/antique?libTypes=LIB_TYPE_0005
殷墟博物馆 https://www.ayyx.com/yxgw/collection
浙江省博物馆 https://www.zhejiangmuseum.com/cn/#/Collection/ExcellentCollection
中国国家博物馆 https://www.chnmuseum.cn/zp/zpml/201812/t20181218_26025.shtml
重庆中国三峡博物馆 https://www.3gmuseum.cn/#/collectorsEdition/disclosure?recNo=4028808a5e3b12de015e3b2c79340003&acTiveNo=4028808a5e3b12de015e3b2c79340003

OBI-Dataset

Recognition

Dataset Paper Project Page
YinQiWenYuan_detection YinQiWenYuan: Oracle Bone Character Detection Dataset Website
OracleBone-8000 AI-Powered Oracle Bone Inscriptions Recognition and Fragments Rejoining N/A
ACCID Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations N/A
O2BR OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones? Github

Rejoining

Dataset Paper Project Page
OB-Rejoin Data-Driven Oracle Bone Rejoining: A Dataset and Practical Self-Supervised Learning Scheme N/A
COBD SFF-Siam: A New Oracle Bone Rejoining Method Based on Siamese Network N/A
OBI-rejoin OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones? Github
OBFI Deep Rejoining Model and Dataset of Oracle Bone Fragment Images N/A
OBID-ACR (2026) A multi-modal dataset and method for bone-level association prediction in oracle bone inscriptions Github

Classification and Retrieval

Dataset Paper Project Page
Oracle-20k Building Hierarchical Representations for Oracle Character and Sketch Recognition N/A
OBC306 OBC306: A Large-Scale Oracle Bone Character Recognition Dataset Website
Oracle-AYNU Oracle Character Recognition by Nearest Neighbor Classification with Deep Metric Learning N/A
HWOBC HWOBC: A Handwriting Oracle Bone Character Recognition Database Website
Oracle-50k Self-supervised Learning of Orc-Bert Augmentor for Recognizing Few-Shot Oracle Characters Github
OBI-IJDH OBI Dataset for IJDH and OBI Recognition Application Website
Oracle-250 Recognition of Oracle Radical based on the Capsule network N/A
Radical-148 Recognition of Oracle Radical based on the Capsule network N/A
OBI125 Dynamic Dataset Augmentation for Deep Learning-based Oracle Bone Inscriptions Recognition Website
OBI-100 Improvement of Oracle Bone Inscription Recognition Accuracy: A Deep Learning Perspective N/A
Oracle-241 Unsupervised Structure-Texture Separation Network for Oracle Character Recognition Github
ORCD Radical-based Extract and Recognition Networks for Oracle Character Recognition N/A
OCCD Radical-based Extract and Recognition Networks for Oracle Character Recognition N/A
OracleRC RZCR: Zero-shot Character Recognition via Radical-based Reasoning N/A
Oracle-MNIST Oracle-MNIST: a Dataset of Oracle Characters for Benchmarking Machine Learning Algorithms Github
OBI component 20 Component-Level Oracle Bone Inscription Retrieval Github

Decipherment

Dataset Paper Project Page
OBI-ECC Study on the Evolution of Chinese Characters Based on Few-shot Learning: From Oracle Bone Inscriptions to Regular Script N/A
EVOBC An Open Dataset for the Evolution of Oracle Bone Characters: EVOBC Github
HUST-OBC An Open Dataset for Oracle Bone Character Recognition and Decipherment Github
ACCP Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction Github
OracleSem OracleSage: Towards Unified Visual-Linguistic Understanding of Oracle Bone Scripts through Cross-Modal Knowledge Fusion N/A
GEVOBC A Graph-based Evolutionary Dataset for Oracle Bone Characters from Inscriptions to Modern Chinese Scripts Github
PD-OBS Interpretable Oracle Bone Script Decipherment through Radical and Pictographic Analysis with LVLMs Github
PictOBI-20k PictOBI-20k: Unveiling Large Multimodal Models in Visual Decipherment for Pictographic Oracle Bone Characters Github

Others

Dataset Paper Project Page
RCRN RCRN: Real-world Character Image Restoration Network via Skeleton Extraction Github
OBIMD Oracle Bone Inscriptions Multi-modal Dataset Hugging Face
RMOBS OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography N/A
Oracle-P15k Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark Github

Paper Index (Task-oriented)

This section provides a task-oriented index of OBI Processing Tasks and Approaches papers, aligned with the task taxonomy used in this survey (Section 4 of our paper).

OBI Preprocessing: Data Augmentation & Restoration

Paper Venue & Year Focus
Dynamic Dataset Augmentation for Deep Learning-based Oracle Bone Inscriptions Recognition ACM JOCCH 2022 GAN-based dynamic augmentation
Oracle Bone Heritage Data Augmentation Based on Two-stage Decomposition GANs npj Heritage Science 2025 Two-stage decomposition GAN
Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark ACM MM 2025 Diffusion-based synthesis, long-tail
Large Kernel Convolutional Attention Based U-Net Network for Inpainting Oracle Bone Inscription PRCV 2023 U-Net based inpainting
Coarse-to-Fine Generative Model for Oracle Bone Inscriptions Inpainting ML4AL @ ACL 2024 GAN-based coarse-to-fine inpainting
Oracle Bone Inscription Image Restoration via Glyph Extraction npj Heritage Science 2025 Glyph-driven restoration
OBIFormer: A Fast Attentive Denoising Framework for Oracle Bone Inscriptions Displays 2025 Attention-based denoising
Orpaint: A Zero-shot Inpainting Model for Oracle Bone Inscription Rubbings with Visual Mamba Block SCIS 2025 Diffusion-based inpainting
Multi-modal Ancient Scripts Recognition via Deep Learning with Data Homogenization and Augmentation npj Heritage Science 2025 Cross-modal data homogenization
Generating Oracle Bone Inscriptions Based on the Structure-aware Diffusion Model npj Heritage Science 2025 Structure-aware diffusion

OBI Recognition

Traditional Pattern Recognition

Paper Venue & Year Focus
A Method of Jia Gu Wen Recognition Based on a Two-level Classification ICDAR 1995 Two-level classification, topological structure
Recognition of Oracular Bone Inscriptions Using Template Matching IJCTE 2016 Four-directional scanning, template matching
Oracle-Bone Inscriptions Recognition Based on Topological Features CSA 2019 Topological feature points, connected domains

Deep Representation Learning-Based Recognition

Paper Venue & Year Focus
Oracle Character Detection Based on Improved Faster R-CNN IEEE ICITBS 2021 Two-stage detector with feature fusion
Oracle Bone Inscription Detector Based on SSD ICIAP 2019 SSD-based small character detection
Recognition of Oracle Bone Inscriptions by Using Two Deep Learning Models IJDH 2023 YOLO + MobileNet pipeline
FDW-YOLO: An Improved YOLOv12 for Oracle Bone Inscriptions Detection ICONIP 2025 Feature diffusion pyramid, mixed convolution
Oracle Character Prototype-Guided Cyclic Disentanglement for Oracle Bone Inscriptions Detection ICPRAI 2024 Prototype guidance, contrastive disentanglement
Detecting Oracle Bone Inscriptions via Pseudo-category Labels Heritage Science 2024 Pseudo-label supervision, structural prior
Clustering-based Feature Representation Learning for Oracle Bone Inscriptions Detection npj Heritage Science 2025 Clustering-based representation learning
Radical-based Extract and Recognition Networks for Oracle Character Recognition IJDAR 2022 Radical-aware feature extraction
Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations ACM MM 2023 Radical-level supervision, zero-shot setting

OBI Rejoining

Contour Matching-Based Methods

Paper Venue & Year Focus
The Research on Rejoining of the Oracle Bone Rubbings Based on Curve Matching TALLIP 2021 Partial-to-global curve matching
Research on Key Technologies of the Computer Aided Rejoining of Oracle Bone Inscriptions ICIFE 2010 Freeman chain code, contour matching
System Design for Computer Aided Rejoining of Bones/Tortoise Shells with Inscriptions Based on Contour Matching ICCCT 2010 Shape function–based contour matching
AI-powered Oracle Bone Inscriptions Recognition and Fragments Rejoining IJCAI 2020 Time-series modeling of contour curves

Deep Learning–Assisted Methods

Paper Venue & Year Focus
Internal Similarity Network for Rejoining Oracle Bone Fragment Images Symmetry 2022 Internal similarity pooling network
SFF-Siam: A New Oracle Bone Rejoining Method Based on Siamese Network IEEE CG&A 2023 Siamese network with similarity feature fusion
Data-driven Oracle Bone Rejoining: A Dataset and Practical Self-supervised Learning Scheme KDD 2022 Self-supervised learning, dataset-driven
OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery arXiv 2025 Duplicate discovery, coarse-to-fine matching
A multi-modal dataset and method for bone-level association prediction in oracle bone inscriptions npj heritage science 2026 multi-modal deep learning method

OBI Classification and Retrieval

Supervised Deep Learning

Paper Venue & Year Focus
Building Hierarchical Representations for Oracle Character and Sketch Recognition IEEE TIP 2016 Hierarchical representation, early classification
OBC306: A Large-scale Oracle Bone Character Recognition Dataset ICDAR 2019 Large-scale dataset, CNN benchmarks
Oracle Bone Inscriptions Recognition Based on Deep Convolutional Neural Network JOIG 2020 CNN-based classification
Improvement of Oracle Bone Inscription Recognition Accuracy: A Deep Learning Perspective ISPRS IJGI 2022 Deep learning baselines
A Classification Method of Oracle Materials Based on Local Convolutional Neural Network Framework IEEE CG&A 2020 Two-stage material classification
Distinguishing Oracle Variants Based on Isomorphism and Symmetry Invariances of Oracle-bone Inscriptions IEEE Access 2020 Symmetry and invariance modeling
OraclePoints: A Hybrid Neural Representation for Oracle Character ACM MM 2023 Image–point hybrid representation
Oracle Character Image Retrieval by Combining Deep Neural Networks and Clustering Technology IAENG IJCS 2020 DNN + clustering retrieval
Oracle Bone Inscription Image Retrieval Based on Improved ResNet Network ICPR 2024 Siamese-style metric learning
A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions ICIC 2025 Cross-font retrieval

Zero-Shot and Few-Shot Learning

Paper Venue & Year Focus
OracleGCD: Generalized Category Discovery for Oracle Bone Scripts ICDAR 2025 Generalized category discovery
Ora-NSC: A Novel Semi-supervised Approach for Oracle Bone Fragment Classification with Imbalanced Classes ACM MM Asia 2025 Semi-supervised learning
OBI-CMF: Self-supervised Learning with Contrastive Masked Frequency Modeling for Oracle Bone Inscription Recognition npj Heritage Science 2025 Self-supervised contrastive learning
Linking Unknown Characters via Oracle Bone Inscriptions Retrieval Multimedia Systems 2024 Unknown character retrieval
RZCR: Zero-shot Character Recognition via Radical-based Reasoning IJCAI 2023 Radical-based zero-shot reasoning
Component-level Oracle Bone Inscription Retrieval ICMR 2024 Component-level retrieval

Cross-Modal Learning

Paper Venue & Year Focus
Unsupervised Structure-Texture Separation Network for Oracle Character Recognition IEEE TIP 2022 Structure–texture disentanglement
OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research arXiv 2025 Vision–text retrieval, agentic system

OBI Deciphering

Modern Chinese Alignment-Based Deciphering

Paper Venue & Year Focus
Sundial-GAN: A Cascade GAN Framework for Deciphering Oracle Bone Inscriptions ACM MM 2022 GAN-based simulation of oracle-to-modern character evolution
Deciphering Oracle Bone Language with Diffusion Models ACL 2024 Conditional diffusion for oracle–modern Chinese alignment
A Text–Image Dual Conditional Stable Diffusion Model for Oracle Bone Inscription Decipherment npj Heritage Science 2025 Dual-condition diffusion with visual–semantic alignment
Deciphering Ancient Chinese Oracle Bone Inscriptions Using Case-Based Reasoning ICCBR 2021 Auto-encoder–based multi-font feature retrieval
Study on the Evolution of Chinese Characters Based on Few-Shot Learning PLOS ONE 2022 Few-shot Siamese learning for character evolution
Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction ICDAR 2024 Radical/stroke reconstruction via Transformer
Component-Level Segmentation for Oracle Bone Inscription Decipherment AAAI 2025 Component-aware segmentation for decipherment
A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions ICIC 2025 Historical font intermediaries for alignment
A Graph-Based Evolutionary Dataset for Oracle Bone Characters npj Heritage Science 2025 Graph representations for oracle–modern character evolution

Visual Content Alignment-Based Deciphering

Paper Venue & Year Focus
Making Visual Sense of Oracle Bones for You and Me CVPR 2024 Human study on visual grounding of oracle glyphs
V-Oracle: Making Progressive Reasoning in Deciphering Oracle Bones ACL 2025 VQA-based progressive visual reasoning
OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography ICCV 2025 LMM-based visual alignment with structure constraints
PictOBI-20k: Unveiling Large Multimodal Models in Visual Decipherment arXiv 2025 Visual perception benchmark for pictographic OBI

Text Interpretation-Based Deciphering

Paper Venue & Year Focus
OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones? ICLR 2025 Systematic evaluation of LMM-based oracle interpretation
OracleSage: Towards Unified Visual-Linguistic Understanding of Oracle Bone Scripts arXiv 2024 Cross-modal reasoning with knowledge fusion
Interpretable Oracle Bone Script Decipherment through Radical and Pictographic Analysis with LVLMs arXiv 2025 LVLM-based interpretable decipherment
OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research arXiv 2025 Agentic system for structured oracle interpretation

About

[npj Heritage Science'26] The official GitHub page for the survey paper "Oracle Bone Inscriptions Processing: A Comprehensive Survey".

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors