IBUT: Iterative Bilingual Understanding Translation

Project Introduction

IBUT (Iterative Bilingual Understanding Translation) is a novel machine translation approach designed to enhance translation quality by generating bilingual contextual understanding through large language models (LLMs). It leverages dual learning in translation tasks to establish linguistic feedback, iteratively optimizing this understanding.

Methodology

The IBUT approach consists of four main components:

Understanding Generation:
- Uses an LLM to generate contextual understanding in both the source and target languages from the input sentence
- Contextual understanding includes key concepts, terminology, explanations, and related examples
Alignment Judgment:
- Employs the LLM as a Judgment Agent (JA) to evaluate the consistency of bilingual contextual understanding
- If inconsistencies are found, generates explicit linguistic feedback highlighting the differences and offering suggestions for improvement
Iterative Refinement:
- Refines the previously generated bilingual contextual understanding based on the feedback signals
- Repeats the alignment and refinement process within a predefined maximum number of iterations
Understanding-Based Translation:
- Inputs the optimized bilingual contextual understanding along with the sentence to be translated
- Performs translation directly via the LLM

Code Structure

model.py: LLM interface class that provides interaction with the large language model
ibut.py: IBUT implementation class that contains the full translation process
main.py: Main script demonstrating the IBUT workflow
test_ibut.py: Test script with additional test cases and evaluation methods

Usage

Basic Usage

# Import necessary classes
from model import LLMModel
from ibut import IBUT

# Initialize model and IBUT
model = LLMModel(model_name="gpt-3.5-turbo")
ibut_translator = IBUT(model, max_iterations=3)

# Perform translation
source_sentence = "气候变化是当今人类面临的最严峻挑战之一。"
translation = ibut_translator.translate(source_sentence)

print(f"Source sentence: {source_sentence}")
print(f"Translation result: {translation}")

Run Demo Script

python main.py

Run Test Script

python test_ibut.py

Notes:

•	A real LLM API must be configured for practical use

•	In model.py, modify the generate method according to the actual model used (e.g., OpenAI, DeepSeek)

•	You can control the maximum number of optimization iterations via the max_iterations parameter

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data/common		data/common
result		result
README.md		README.md
common.en		common.en
common.zh		common.zh
example.py		example.py
ibut.py		ibut.py
main.py		main.py
model.py		model.py
prompts.py		prompts.py
test_ibut.py		test_ibut.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IBUT: Iterative Bilingual Understanding Translation

Project Introduction

Methodology

Code Structure

Usage

Basic Usage

Run Demo Script

Run Test Script

About

Uh oh!

Releases

Packages

Languages

andongBlue/IBUT-Translation

Folders and files

Latest commit

History

Repository files navigation

IBUT: Iterative Bilingual Understanding Translation

Project Introduction

Methodology

Code Structure

Usage

Basic Usage

Run Demo Script

Run Test Script

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages