Datasets:

Modalities:
Tabular
Text
Formats:
json
Size:
< 1K
ArXiv:
Libraries:
Datasets
pandas
License:
Dataset Viewer
Auto-converted to Parquet Duplicate
task_id
int64
1
50
task_type
stringclasses
5 values
task_zh
stringlengths
8
58
task_en
stringlengths
23
201
applications
sequencelengths
1
2
preparation_zh
stringlengths
0
27
preparation_en
stringlengths
0
109
total_milestone_scores
int64
1
6
scoring_milestones_zh
sequencelengths
1
6
scoring_milestones_en
sequencelengths
1
6
human_expert_steps
sequencelengths
3
3
1
easy
ๅœจ12306ไธญ่ฟ›ๅ…ฅโ€œๆˆ‘็š„โ€้กต้ข๏ผŒๅœจๅธธ็”จๅŠŸ่ƒฝไธญๅฐ†appไปŽๆ ‡ๅ‡†ๆจกๅผๅˆ‡ๆขๅˆฐๆ•ฌ่€็‰ˆใ€‚
Go to the 'My' page in the 12306 app, and switch the app from standard mode to senior mode under common functions.
[ "12306 (China Railway)" ]
ๅˆ‡ๆขๅˆฐๆ ‡ๅ‡†ๆจกๅผ
switch to standard mode
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 5, 5, 8 ]
2
easy
ๅœจB็ซ™ๆœ็ดขโ€œ้ซ˜็ญ‰ๆ•ฐๅญฆโ€ๅนถๆ’ญๆ”พ็ฌฌไธ€ไธช่ง†้ข‘ใ€‚
Search 'Advanced Mathematics' in Bilibili and play the first video.
[ "BiliBili" ]
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 5, 6, 5 ]
3
easy
ไฝฟ็”จ็›ธๆœบ็š„ๅคœๆ™ฏๆจกๅผๆ‹ไธ€ๅผ ็…ง็‰‡ใ€‚
Take a photo using the camera's night mode.
[ "Camera" ]
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 3, 4, 3 ]
4
easy
ๅ…ณ้—ญๆœ€ๆ—ฉ็š„ไธ€ไธชๆ‰“ๅผ€็š„้—น้’Ÿใ€‚
Turn off the earliest enabled alarm.
[ "Clock" ]
่ฎพ็ฝฎๅนถๆ‰“ๅผ€ๆ•ฐไธช้—น้’Ÿ
Set and turn on several alarms.
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 2, 2, 2 ]
5
easy
ๆŸฅ่ฏขใ€Šไบบๅทฅๆ™บ่ƒฝใ€‹่ฟ™้ƒจ็”ตๅฝฑ็š„่ฑ†็“ฃ่ฏ„ๅˆ†ใ€‚
Check the Douban rating of the movie 'Artificial Intelligence'.
[ "Douban" ]
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 5, 4, 5 ]
6
easy
ๅœจ่ฐทๆญŒๆ—ฅๅކไธŠๅˆ›ๅปบไธ€ไธชๆ–ฐไปปๅŠก๏ผŒๆ ‡้ข˜ไธบโ€œ่ฎบๆ–‡ๅ†™ไฝœโ€ใ€‚
Create a new task on Google Calendar titled 'Writing Paper'.
[ "Google Calendar" ]
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 5, 5, 5 ]
7
easy
ๅธฎๆˆ‘ๆŸฅ่ฏขๆˆ‘็š„ๅŽไฝไผšไผšๅ‘˜ๆƒ…ๅ†ต๏ผŒๅนถๅ‘Š่ฏ‰ๆˆ‘ๆœ€ๆ™šๅฏไปฅๅ‡ ็‚น้€€ๆˆฟใ€‚
Help me check my H World membership and tell me the latest checkout time.
[ "H World" ]
็™ปๅฝ•่ดฆๅท
logging in an account
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 3, 2, 3 ]
8
easy
ๅœจไบฌไธœๆœ็ดข้บฆๅ…‹้ฃŽ๏ผŒๅนถๅฐ†็ฌฌไธ€ไธชๅ•†ๅ“ๆทปๅŠ ๅˆฐ่ดญ็‰ฉ่ฝฆใ€‚
Search for microphones on JD and add the first product to the cart.
[ "JD" ]
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 6, 5, 6 ]
9
easy
ๅฐ†้บฆๅฝ“ๅŠณapp็š„่ฏญ่จ€ๅˆ‡ๆขไธบ่‹ฑๆ–‡ใ€‚
Switch the language of the McDonald's app to English.
[ "McDonald's" ]
ๅˆ‡ๆข่ฏญ่จ€ไธบไธญๆ–‡
switch the language to Chinese
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 7, 6, 7 ]
10
easy
ๆŸฅ็œ‹ๆ˜Žๅคฉ็š„ๅคฉๆฐ”ใ€‚
Check tomorrow's weather.
[ "Weather" ]
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 2, 1, 1 ]
11
medium
ๅœจๆ—ถๅˆป่กจไธญๆŸฅ่ฏขๆ˜ŽๅคฉไปŽไธŠๆตทๅˆฐๅŒ—ไบฌ็š„็ผ–ๅทไธบG104็š„ๅˆ—่ฝฆ้ข„่ฎกๅ‡ ็‚น้’Ÿๅˆฐๅ—ไบฌใ€‚
Check the timetable for train G104 from Shanghai to Beijing tomorrow, and find out what time it is expected to arrive in Nanjing.
[ "12306 (China Railway)" ]
3
[ "่ฟ›ๅ…ฅๆ—ถๅˆป่กจๆŸฅ่ฏข็•Œ้ข", "่ฝฆๆฌกๆญฃ็กฎ", "ๅฎŒๆˆไปปๅŠก" ]
[ "enter the timetable page", "enter train number correctly", "task completion" ]
[ 13, 13, 9 ]
12
medium
ไปŽB็ซ™โ€œ็ƒญ้—จโ€้กต้ข่ฟ›ๅ…ฅๆŽ’่กŒๆฆœ๏ผŒๆ’ญๆ”พๆŽ’ๅ็ฌฌไธ€็š„่ง†้ข‘๏ผŒๅ…ณๆณจ่ฟ™ไธช่ดฆๅทๅนถๆทปๅŠ ่ฟ™ไธช่ง†้ข‘ๅˆฐๆ”ถ่—ๅคนไธญใ€‚
Go to the leaderboard from the 'Hot' page of BiliBili, play the top-ranked video, follow the account, and add the video to the favorites.
[ "BiliBili" ]
ๅ–ๆถˆๅ…ณๆณจ่ฟ™ไธช็›ฎๆ ‡่ดฆๅท๏ผŒๅ…ณ้—ญ่‡ชๅŠจ่ฟžๆ’ญๅŠŸ่ƒฝ
unfollow this target account, turn off the auto-play function
4
[ "่ฟ›ๅ…ฅๆŽ’่กŒๆฆœ", "ๆ’ญๆ”พ่ง†้ข‘", "ๅ…ณๆณจ่ดฆๅท", "ๆทปๅŠ ๆ”ถ่—" ]
[ "enter the leaderboard page", "play video correctly", "follow the account", "add to favorites" ]
[ 6, 6, 6 ]
13
medium
่ฎพ็ฝฎๆœ€้•ฟๅปถ่ฟŸ็š„ๅฎšๆ—ถ่‡ชๆ‹ใ€‚
Set the timer selfie with the longest delay.
[ "Camera" ]
ๅˆ‡ๆขๅˆฐๅŽ็ฝฎๆ‘„ๅƒๅคด
switch to the back camera
2
[ "่ฎพ็ฝฎๅปถ่ฟŸ", "ๅˆ‡ๆขๅ‰็ฝฎ" ]
[ "set delay", "switch to the front camera" ]
[ 4, 4, 6 ]
14
medium
ๅœจไธ–็•Œๆ—ถ้—ดๅˆ—่กจไธญๆทปๅŠ ้ฆ™ๆธฏใ€‚
Add Hong Kong in list of world clock.
[ "Clock" ]
ไปŽไธ–็•Œๆ—ถ้—ดๅˆ—่กจไธญ็งป้™ค้ฆ™ๆธฏ
remove Hong Kong from the list of world clock
2
[ "่ฟ›ๅ…ฅๆทปๅŠ ็•Œ้ข", "ๅฎŒๆˆไปปๅŠก" ]
[ "enter the page to add a city", "task completion" ]
[ 6, 6, 6 ]
15
medium
ๆœ็ดขๅนถ่ฟ›ๅ…ฅโ€œ่ฑ†็“ฃ็”ตๅฝฑTop250โ€ๆฆœๅ•๏ผŒ็ป™ๆŽ’ๅ็ฌฌไบŒ็š„็”ตๅฝฑๆœ€้กถ้ƒจ็š„็Ÿญ่ฏ„็‚น่ตžใ€‚
Search 'Douban Movie Top 250' leaderboard and like the top comment of the second-ranked movie.
[ "Douban" ]
่ฟ™ๆก็Ÿญ่ฏ„ๆฒกๆœ‰่ขซ็‚น่ตž
this comment has not been liked
2
[ "่ฟ›ๅ…ฅๆฆœๅ•", "ๅฎŒๆˆไปปๅŠก" ]
[ "enter the leaderboard page", "task completion" ]
[ 11, 5, 8 ]
16
medium
ๅœจ่ฐทๆญŒๆ—ฅๅކไธŠๅˆ›ๅปบไธ€ไธชๆ–ฐๆ—ฅ็จ‹๏ผŒๆ—ฅๆœŸไธบๆ˜Žๅคฉ๏ผŒๆ—ถ้—ดไธบๅ…จๅคฉ๏ผŒๆ ‡้ข˜ไธบโ€œๅ…ฌๅธๅนดไผšโ€ใ€‚
Create a new all-day event on Google Calendar for tomorrow titled 'Company Annual Meeting'.
[ "Google Calendar" ]
2
[ "ๆ—ฅๆœŸๆ—ถ้—ดๆญฃ็กฎ", "ๅฎŒๆˆไปปๅŠก" ]
[ "correct date and time", "task completion" ]
[ 8, 6, 6 ]
17
medium
ๅœจๅŽไฝไผšappไธŠๆŸฅ่ฏข่ฏ„ๅˆ†ๆœ€้ซ˜็š„ๆฑ‰ๅบญ้…’ๅบ—็š„่ตทไปทใ€‚
Use the H World app, search for the starting price of the nearest Hanting Hotel to me.
[ "H World" ]
2
[ "ไฝฟ็”จ่ฏ„ๅˆ†ๆœ€ๅฅฝๆŽ’ๅบ", "ๅฎŒๆˆไปปๅŠก" ]
[ "sort by highest rating", "task completion" ]
[ 7, 5, 8 ]
18
medium
ๅฏปๆ‰พไบฌไธœๅ•†ๅŸŽไธญๆœ€่ฟ‘ๅพ…่ฏ„ไปท็š„ไธ€ไธชๅ•†ๅ“๏ผŒ็ป™ๅ‡บๆปกๅˆ†ๅฅฝ่ฏ„๏ผŒๅนถๅ†™ไธ€ๆฎต่ฏ„่ฏญ๏ผŒๆทปๅŠ ไธคๅผ ็…ง็‰‡ใ€‚
Find the most recent product awaiting comment on JD, give it a full five-star rating, write a comment, and add two photos.
[ "JD" ]
ๆœ‰ๆœช่ฏ„ไปท็š„่ฎขๅ•
there are orders that have not been commented on
2
[ "่ฟ›ๅ…ฅ่ฏ„ไปท็•Œ้ข", "ๅฎŒๆˆไปปๅŠก" ]
[ "enter comment page", "task completion" ]
[ 14, 8, 12 ]
19
medium
็ป™ๆœ€่ฟ‘็š„ไธ€็ฌ”้บฆๅฝ“ๅŠณ่ฎขๅ•ๅผ€ๅ‘็ฅจ๏ผŒๅนถ้€‰ๆ‹ฉโ€œไธŠๆตทไบค้€šๅคงๅญฆโ€ๆŠฌๅคดใ€‚
Invoice the most recent McDonald's order, and select 'SJTU' for invoice title.
[ "McDonald's" ]
ๆœ‰ๆœชๅผ€ๅ‘็ฅจ่ฎขๅ•
there is an order and no invoice has been issued
3
[ "่ฟ›ๅ…ฅ่ฎขๅ•็•Œ้ข", "้€‰ๅ–ๆŠฌๅคดๆญฃ็กฎ", "ๅฎŒๆˆไปปๅŠก" ]
[ "enter order list page", "select invoice title correctly", "task completion" ]
[ 9, 9, 12 ]
20
medium
ๆŸฅ็œ‹ไปŠๆ™šๆ—ฅ่ฝ็š„ๆ—ถ้—ดใ€‚
Check the sunset time of tonight.
[ "Weather" ]
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 4, 2, 3 ]
21
hard
่ดญไนฐไธ€ๅผ ๆ˜ŽๅคฉไปŽไธŠๆตทๅˆฐๅŒ—ไบฌ๏ผŒๅ‡บๅ‘ๆ—ถ้—ดๅœจไน็‚นๅˆฐๅไธ€็‚นไน‹้—ด็š„้ซ˜้“็ฅจใ€‚
Buy a high-speed train ticket for tomorrow from Shanghai to Beijing, with departure time between 9 AM and 11 AM.
[ "12306 (China Railway)" ]
ๅ‡บๅ‘็ซ™ๅ’Œๅˆฐ่พพ็ซ™ไธŽ้ฆ–้กตๅฑ•็คบ็š„ไธๅŒ๏ผŒไธ”ๆœ‰็›ด่พพไฝ™็ฅจๅณๅฏ
departure and arrival stations differ from those on the homepage, and direct tickets must be available.
5
[ "ๆ—ฅๆœŸๆญฃ็กฎ", "่ฝฆ็ซ™ๆญฃ็กฎ", "่ฝฆๆฌก้€‰ๆ‹ฉๅˆ้€‚", "ๆทปๅŠ ไน˜่ฝฆไบบ", "ๅฎŒๆˆไปปๅŠก" ]
[ "correct date", "correct station", "suitable train selection", "add passenger", "complete task" ]
[ 20, 20, 18 ]
22
hard
ๅœจB็ซ™ๆœ็ดขๅนถๅ…ณๆณจโ€œX-LANCEๅฎž้ชŒๅฎคโ€่ดฆๅท๏ผŒๆ’ญๆ”พๆœ€ๆ–ฐ็š„่ง†้ข‘๏ผŒๅนถๆ’ฐๅ†™ไธ€ๆฎตๅ‹ๅ–„็š„่ฏ„่ฎบใ€‚
Search in BiliBili and follow the 'X-LANCE' Lab account, play the newest video and write a friendly comment.
[ "BiliBili" ]
ๅ–ๆถˆๅ…ณๆณจ่ฟ™ไธช่ดฆๅท๏ผŒๅ…ณ้—ญ่‡ชๅŠจ่ฟžๆ’ญๅŠŸ่ƒฝ
unfollow this account, turn off the auto-play function
4
[ "่ฟ›ๅ…ฅ่ดฆๅทไธป้กต", "ๅ…ณๆณจ่ดฆๅท", "ๆ’ญๆ”พ่ง†้ข‘", "ๆ’ฐๅ†™่ฏ„่ฎบ" ]
[ "enter homepage of this account", "follow this account", "play video correctly", "write comment" ]
[ 12, 10, 10 ]
23
hard
ๅฐ†็›ธๆœบๅˆ‡ๆขๅˆฐ่ง†้ข‘ๆจกๅผ๏ผŒๆ‰“ๅผ€่กฅๅ…‰็ฏ๏ผŒ็”ป่ดจ่ฎพ็ฝฎๆˆโ€œFHD 30FPSโ€๏ผŒๅฝ•ๅˆถๆ—ถ้—ด่ถ…่ฟ‡ไบŒๅ็ง’ไปฅๅŽๅœๆญขๅฝ•ๅˆถใ€‚
Switch the camera to video mode, turn on the fill light, set the video quality to 'FHD 30FPS', and stop recording after more than twenty seconds.
[ "Camera" ]
ๅˆ‡ๆขๅˆฐ้ž่ง†้ข‘ๆจกๅผ๏ผŒ้‡็ฝฎๆ‰€ๆœ‰่ฎพ็ฝฎ
switch to non-video mode, reset all settings
3
[ "ๆ‰“ๅผ€่กฅๅ…‰็ฏ", "ๅˆ‡ๆข็”ป่ดจ", "ๅฎŒๆˆ20sๅฝ•ๅˆถ" ]
[ "turn on the fill light", "switch video quality", "complete 20s recording" ]
[ 9, 9, 8 ]
24
hard
ๅˆ›ๅปบไธ€ไธชๅ็‚น็š„้—น้’Ÿ๏ผŒๆ ‡้ข˜ไธบโ€œไธŠ็ญโ€๏ผŒๅœจๆฏๅ‘จไธ€่‡ณๅ››ไฝฟ็”จๆŒฏๅŠจๆ้†’ใ€‚
Create an alarm at 10 o'clock titled 'Work' with vibration reminders on every Monday to Thursday.
[ "Clock" ]
5
[ "ๆ—ถ้—ดๆญฃ็กฎ", "ๆ ‡้ข˜ๆญฃ็กฎ", "ๅ‘จๆœŸ้‡ๅคๆญฃ็กฎ", "ๅ…ณ้—ญๅ“้“ƒๆ‰“ๅผ€ๆŒฏๅŠจ", "ๅฎŒๆˆไปปๅŠก" ]
[ "correct time", "correct title", "correct period repetition", "turn off the ringtone and turn on the vibration", "task completion" ]
[ 14, 45, 15 ]
25
hard
ๅœจ่ฑ†็“ฃไธญๆœ็ดข็”ตๅฝฑใ€Š่‚–็”ณๅ…‹็š„ๆ•‘่ตŽใ€‹๏ผŒๆ ‡่ฎฐไธบโ€œ็œ‹่ฟ‡โ€๏ผŒ็ป™ไบ”้ข—ๆ˜Ÿ็š„่ฏ„ๅˆ†๏ผŒๅนถ็•™ไธ‹ไธ€ๆก็งฐ่ตž็š„ๅฝฑ่ฏ„ใ€‚
Search for the movie 'The Shawshank Redemption' on Douban, mark it as 'watched', rate it with five stars, and leave a positive review.
[ "Douban" ]
ๅˆ ้™ค่ฟ™้ƒจ็”ตๅฝฑ็š„ไน‹ๅ‰็š„ๆ ‡่ฎฐใ€่ฏ„ๅˆ†ๅ’Œๅฝฑ่ฏ„
Remove the previous mark, rating, and review of this movie.
4
[ "่ฟ›ๅ…ฅ็”ตๅฝฑ่ฏฆๆƒ…็•Œ้ข", "ๆˆๅŠŸๆ ‡่ฎฐ", "ๆˆๅŠŸๆ‰“ๅˆ†", "็•™ไธ‹ๅฝฑ่ฏ„" ]
[ "enter the movie details page", "successfully marked", "successfully rated", "leave a review" ]
[ 10, 10, 9 ]
26
hard
ๅœจ่ฐทๆญŒๆ—ฅๅކไธŠๅˆ›ๅปบไธ€ไธชๅ‘จๆœŸๆ—ฅ็จ‹๏ผŒๆ ‡้ข˜ไธบโ€œ่ฎก็ฎ—ๆœบ่ง†่ง‰่ฏพ็จ‹โ€๏ผŒๆ—ถ้—ดไธบๆฏๅ‘จไธ‰ๆ™šไธŠๅ…ญ็‚นๅˆฐๅ…ซ็‚น๏ผŒไปŽๆœฌๅ‘จๅผ€ๅง‹๏ผŒ้‡ๅคๅ…ซๆฌกใ€‚
Create a recurring event on Google Calendar titled 'Computer Vision Course' scheduled for every Wednesday from 6 PM to 8 PM, starting this week and repeating eight times.
[ "Google Calendar" ]
3
[ "ๆ—ถ้—ดๆ ‡้ข˜ๆญฃ็กฎ", "ๅ‘จๆœŸ้‡ๅคๆญฃ็กฎ", "ๅฎŒๆˆไปปๅŠก" ]
[ "correct time and title", "correct period repetition", "task completion" ]
[ 20, 52, 21 ]
27
hard
ๅœจๅŽไฝไผšappไธŠๅธฎๆˆ‘่ฎขไธ€ไธชๆˆ้ƒฝ็š„้…’ๅบ—๏ผŒๅœจๅคฉๅบœๅนฟๅœบ้™„่ฟ‘๏ผŒไปŠๅคฉๅ…ฅไฝๅŽๅคฉ็ฆปๅบ—๏ผŒ้€‰ๆ‹ฉไปทๆ ผๆœ€ไฝŽๆŽ’ๅบ็š„็ฌฌไบŒไธช้…’ๅบ—๏ผŒไฝฟ็”จๅพฎไฟกๆ”ฏไป˜ใ€‚
Book a hotel in Chengdu on the H World app, near Tianfu Square, check-in today and check-out the day after tomorrow. Choose the second cheapest hotel listed when sorted by price, and use WeChat to pay.
[ "H World" ]
6
[ "ๆ—ฅๆœŸๆญฃ็กฎ", "ๅŸŽๅธ‚ๅœฐ็‚นๆญฃ็กฎ", "ไปทๆ ผๆญฃ็กฎ", "่ฟ›ๅ…ฅ้ข„่ฎข็•Œ้ข", "ๆไบค่ฎขๅ•", "ไป˜ๆฌพๆ–นๅผๆญฃ็กฎ" ]
[ "correct dates", "correct city and hotel location", "correct price", "enter booking page", "submit order", "correct payment method" ]
[ 17, 16, 15 ]
28
hard
ๅœจไบฌไธœๆœ็ดขโ€œๆ€ๅฟ…้ฉฐๅŠžๅ…ฌ่ฎพๅค‡ๆ——่ˆฐๅบ—โ€๏ผŒๆŸฅ่ฏขๅนถ่ดญไนฐๅ…ถๅบ—้“บไธญๆœ€ไพฟๅฎœ็š„้บฆๅ…‹้ฃŽ๏ผŒ้€‰ๆ‹ฉๆต…่‰ฒๆฌพๅผๅนถ่ดญไนฐไธคไปฝใ€‚
Search for AISpeech store on JD, find and buy the cheapest microphone in the store, choose the light-colored style, and buy two.
[ "JD" ]
3
[ "่ฟ›ๅ…ฅๅบ—้“บไธป้กต", "ๅ•†ๅ“้€‰ๆ‹ฉๆญฃ็กฎ", "ๅฎŒๆˆไปปๅŠก" ]
[ "enter the store homepage", "correct product selection", "task completion" ]
[ 10, 12, 12 ]
29
hard
็‚นไธ€ไปฝ็ซ‹ๅณๅˆฐๅบ—ๅ–้ค็š„โ€œ้šๅฟƒ้…1+1โ€่ถ…ๅ€ผๅฅ—้ค๏ผŒ้€‰ๆ‹ฉ้บฆ้ฆ™้ธกๅ’Œ่ฟทไฝ ๆœฑๅคๅŠ›ๆ–ฐๅœฐ๏ผŒๅŠ ๅ…ฅ่ดญ็‰ฉ่ฝฆ๏ผŒไธ‹ๅ•ๅนถ้€‰ๆ‹ฉๆ”ฏไป˜ๅฎๆ”ฏไป˜ใ€‚
Order a '1+1 Mix & Match' for immediate in-store pickup, choose McChicken and Mini Chocolate Sundae, add to cart, place the order, and choose Alipay to pay.
[ "McDonald's" ]
5
[ "ๆ‰พๅˆฐๅฅ—้ค", "้€‰ๆ‹ฉ้คๅ“", "็ป“็ฎ—่ฎขๅ•", "ไป˜ๆฌพๆ–นๅผๆญฃ็กฎ", "ๅฎŒๆˆไปปๅŠก" ]
[ "find the '1+1' meal", "select the meal", "place the order", "correct payment method", "task completion" ]
[ 15, 13, 15 ]
30
hard
ๅœจๅคฉๆฐ”ๅบ”็”จไธญๆทปๅŠ โ€œๆˆ้ƒฝโ€่ฟ™ไธชๅŸŽๅธ‚๏ผŒๅนถๅ‘Š่ฏ‰ๆˆ‘ๆ˜Žๅคฉ็š„ๆธฉๅบฆๅŒบ้—ดใ€‚
Add 'Chengdu' in the weather app and tell me the temperature range for tomorrow.
[ "Weather" ]
ๅŸŽๅธ‚ๅˆ—่กจไธญๆฒกๆœ‰ๆˆ้ƒฝ
Ensure that Chengdu is not listed in the city list.
3
[ "่ฟ›ๅ…ฅๅŸŽๅธ‚ๅˆ—่กจ", "ๆˆๅŠŸๆทปๅŠ ", "ไฟกๆฏๆญฃ็กฎ" ]
[ "enter the city list", "successfully added", "correct information" ]
[ 9, 4, 5 ]
31
indirect
ๅธฎๆˆ‘่ฎขไธ€ๅผ ๅŽๅคฉ็š„้ซ˜้“็ฅจ๏ผŒ18:00ไน‹ๅ‰ๅ›žๅŒ—ไบฌใ€‚
Book a high-speed train ticket for the day after tomorrow, returning to Beijing before 6:00 PM.
[ "12306 (China Railway)" ]
้œ€่ฆไฝฟ็”จๅฝ“ๅ‰ๅฎšไฝไฝœไธบๅ‡บๅ‘็ซ™
should use the current location as the departure station
5
[ "ๆ—ฅๆœŸๆญฃ็กฎ", "่ฝฆ็ซ™ๆญฃ็กฎ", "่ฝฆๆฌก้€‰ๆ‹ฉๅˆ้€‚", "ๆทปๅŠ ไน˜่ฝฆไบบ", "ๅฎŒๆˆไปปๅŠก" ]
[ "correct date", "correct stations", "suitable train selection", "add passenger", "task completion" ]
[ 20, 20, 18 ]
32
indirect
ๆฒกๆต้‡ไบ†๏ผŒ็Žฐๅœจๆ‰‹ๆœบไธŠๆœ‰ไป€ไนˆ่ƒฝ็œ‹็š„่ง†้ข‘๏ผŸ
I'm out of mobile data, what videos can I still watch on the phone?
[ "BiliBili" ]
ๆๅ‰็ผ“ๅญ˜ๅคšไธช่ง†้ข‘๏ผŒๅ…ณ้—ญ็ฝ‘็ปœ่ฟžๆŽฅ๏ผŒๅฏไปฅไฝฟ็”จๅ…ถไป–่ง†้ข‘ๅบ”็”จ
download multiple videos in advance, turn off the network connection, it's ok to use other video applications
2
[ "ๆ‰“ๅผ€่ง†้ข‘ๅบ”็”จ", "ๆๅ–็ผ“ๅญ˜่ง†้ข‘ไฟกๆฏ" ]
[ "open video application", "extract downloaded video information" ]
[ 4, 3, 3 ]
33
indirect
ๆŠŠๅˆšๅˆšๆ‹็š„ๆœ€ๅŽไธ€ๅผ ็…ง็‰‡ๅˆ ๆމใ€‚
Delete the last photo taken recently.
[ "Camera" ]
ๆ‹ๅ‡ ๅผ ไธๅŒ็š„็…ง็‰‡๏ผŒๅนถๅœจๅ…ถๅˆ ้™คๅคšๅผ ๆƒ…ๅ†ตๅ‘็”Ÿๆ—ถๆ‰‹ๅŠจๅœๆญขไปปๅŠก
take several different photos, and stop the task manually when multiple deletions occur
2
[ "ๅˆ ้™ค็…ง็‰‡", "ๅˆ ้™คไธ€ๅผ ็…ง็‰‡ไน‹ๅŽ็ป“ๆŸไปปๅŠก" ]
[ "delete photo", "end task after deleting one photo" ]
[ 4, 5, 4 ]
34
indirect
ไธคไธชๅฐๆ—ถๅŽๆ้†’ๆˆ‘ใ€‚
Remind me in two hours.
[ "Clock" ]
2
[ "่ฟ›ๅ…ฅ่ฎกๆ—ถๅ™จ็•Œ้ข", "ๅฎŒๆˆไปปๅŠก" ]
[ "enter the timer page", "task completion" ]
[ 7, 5, 6 ]
35
indirect
ๅธฎๆˆ‘็œ‹็œ‹็”ตๅฝฑ้™ขๅณๅฐ†ไธŠๆ˜ ็š„็”ตๅฝฑไธญๆœ€็ƒญ้—จ็š„ๆ˜ฏๅ“ชไธช๏ผŸ
Help me check which is the most popular movie among the upcoming releases in theaters.
[ "Douban" ]
2
[ "ๆ‰“ๅผ€่ฑ†็“ฃ", "ๅฎŒๆˆไปปๅŠก" ]
[ "open Douban", "task completion" ]
[ 6, 3, 4 ]
36
indirect
่ฎฐๅฝ•ๆˆ‘ๆ˜ŽๅคฉไธŠๅˆๅ็‚นๅˆฐๅไบŒ็‚นๅœจไธŠๆตทไบค้€šๅคงๅญฆ้—ต่กŒๆ กๅŒบไธŠ็š„โ€œ่‡ช็„ถ่ฏญ่จ€ๅค„็†โ€่ฏพ็จ‹ใ€‚
Record that I have a 'Natural Language Processing' class from 10 AM to 12 PM tomorrow at Shanghai Jiao Tong University, Minhang Campus.
[ "Google Calendar" ]
4
[ "่ฟ›ๅ…ฅๅˆ›ๅปบๆ—ฅ็จ‹็•Œ้ข", "ๆ—ถ้—ดๆญฃ็กฎ", "ๅœฐ็‚นๆญฃ็กฎ", "ๅฎŒๆˆไปปๅŠก" ]
[ "enter the calendar creation page", "correct time", "correct location", "task completion" ]
[ 16, 40, 17 ]
37
indirect
ๅ‘Š่ฏ‰ๆˆ‘้™„่ฟ‘ๅฏไปฅๅ…ฅไฝ็š„ๆœ€่ฟ‘้…’ๅบ—๏ผŒๅนถๅธฎๆˆ‘่ฎขไธ€ไธชใ€‚
Tell me about the nearest available hotel nearby and book it for me.
[ "H World" ]
4
[ "้€‰ๆ‹ฉๅฝ“ๅ‰ๅฎšไฝ", "้…’ๅบ—้€‰ๆ‹ฉๆญฃ็กฎ", "ๆไบค่ฎขๅ•", "ๅฎŒๆˆไปปๅŠก" ]
[ "select current location", "correct hotel selection", "submit order", "task completion" ]
[ 10, 10, 10 ]
38
indirect
ๅธฎๆˆ‘้—ฎไธ€ไธ‹ๅฎขๆœ๏ผŒ่ดญ็‰ฉ่ฝฆๆœ€ไธŠ้ข็š„้‚ฃไธชๅ•†ๅ“ๆœ‰ๆฒกๆœ‰ไผ˜ๆƒ ใ€‚
Help me ask customer service if there is any discount for the top item in the shopping cart.
[ "JD" ]
ๆๅ‰ๅฐ†ๅ•†ๅ“ๅŠ ๅ…ฅ่ดญ็‰ฉ่ฝฆ
add the item to the shopping cart in advance
3
[ "้€‰ๆ‹ฉๆญฃ็กฎๅ•†ๅ“", "่ฟ›ๅ…ฅๅฎขๆœ้กต้ข", "ๅฎŒๆˆไปปๅŠก" ]
[ "select the correct item", "enter customer service page", "task completion" ]
[ 7, 8, 8 ]
39
indirect
็‚นไธ€ไปฝโ€œ้บฆ่พฃ้ธก่…ฟๅ กๅฅ—้คโ€ๅ ‚้ฃŸ
Order a McSpicy Chicken Filet Burger Combo for dine-in.
[ "McDonald's" ]
3
[ "ๆ‰พๅˆฐๅฅ—้ค", "็ป“็ฎ—่ฎขๅ•", "ๅฎŒๆˆไปปๅŠก" ]
[ "find the combo", "place the order", "task completion" ]
[ 13, 13, 13 ]
40
indirect
ๆ˜Žๅคฉๆˆ‘้œ€่ฆๅธฆ้›จไผžๅ—๏ผŸ
Do I need to bring an umbrella tomorrow?
[ "Weather" ]
1
[ "ๅฎŒๆˆไปปๅŠก" ]
[ "task completion" ]
[ 3, 1, 2 ]
41
cross_app
ๅธฎๆˆ‘ๆŠŠๆœ€่ฟ‘็š„ไธ€ไธช็ซ่ฝฆ่กŒ็จ‹ๆทปๅŠ ๅˆฐๆ—ฅ็จ‹๏ผŒๆ ‡้ข˜ๆ ผๅผไธบโ€œ{ๅ‡บๅ‘็ซ™}โ€”โ€”{่ฝฆๆฌก}โ€”โ€”{ๅˆฐ่พพ็ซ™}โ€๏ผŒๅนถ่ฎพ็ฝฎๅ…ทไฝ“ๆ—ถ้—ด่Œƒๅ›ดใ€‚
Help me add my latest train journey to the schedule, titled in format of '{Departure Station}โ€”โ€”{Train Number}โ€”โ€”{Arrival Station}', and set exact time range.
[ "12306 (China Railway)", "Google Calendar" ]
3
[ "่ฟ›ๅ…ฅ่ฝฆ็ฅจๅˆ—่กจ", "ๆ ‡้ข˜ๆญฃ็กฎ", "ๆ—ถ้—ดๆญฃ็กฎ" ]
[ "enter ticket list page", "correct title", "correct time" ]
[ 23, 10, 12 ]
42
cross_app
ๅœจB็ซ™ๆ’ญๆ”พโ€œ่ฑ†็“ฃ็”ตๅฝฑTop250โ€ๆŽ’ๅ็ฌฌไธ‰็š„้‚ฃ้ƒจ็”ตๅฝฑใ€‚
Play the movie ranked third in the 'Douban Movie Top 250' on BiliBili.
[ "BiliBili", "Douban" ]
2
[ "็”ตๅฝฑๆญฃ็กฎ", "ๆˆๅŠŸๆ’ญๆ”พ" ]
[ "correct movie", "play successfully" ]
[ 12, 13, 12 ]
43
cross_app
ๆ‰“ๅผ€็›ธๆœบๅนถๅ‘Š่ฏ‰ๆˆ‘้•œๅคดๅ‰็š„็‰ฉๅ“ๆ˜ฏไป€ไนˆ๏ผŒๅœจไบฌไธœๆœ็ดข่ฟ™็ฑปๅ•†ๅ“๏ผŒๅนถ้€‰ๆ‹ฉ่พƒไธบ็›ธไผผ็š„ไธ€้กนๅŠ ๅ…ฅ่ดญ็‰ฉ่ฝฆใ€‚
Open the camera and tell me what item is in front of the lens, search for this type of product on JD, and select a similar item to add to the shopping cart.
[ "Camera", "JD" ]
ๅŽ็ฝฎๆ‘„ๅƒๅคดๅฏน็€ไธ€ไธชๅธธ่ง็‰ฉๅ“
point the back camera at a common object
3
[ "ๅ›ž็ญ”ๆญฃ็กฎ", "ๆˆๅŠŸๆœ็ดข", "ๅŠ ๅ…ฅ่ดญ็‰ฉ่ฝฆ" ]
[ "correct answer", "search successfully", "add to shopping cart" ]
[ 10, 8, 8 ]
44
cross_app
ไฝฟ็”จ้•ฟ็„ฆ้•œๅคดๆ‹ไธ€ๅผ ็…ง็‰‡๏ผŒๅนถๅˆ†ไบซๅˆฐๅพฎไฟกๆœ‹ๅ‹ๅœˆ๏ผŒๅ†™ไธ€ๆฎต่ฏ๏ผŒๅนถๅœจๆœ€ๅŽๅŠ ไธŠๆณจ้‡Šโ€œ่ฟ™ๆ˜ฏ็”ฑMobAๆ‰‹ๆœบๅŠฉๆ‰‹่‡ชๅŠจๅ‘ๅธƒ็š„ๆœ‹ๅ‹ๅœˆโ€ใ€‚
Take a photo with a telephoto lens and share it on WeChat Moments, write a few words with the note 'This is an automatically posted Moments by MobA Mobile phone Assistant' in the end.
[ "Camera", "WeChat" ]
ๆๅ‰็™ปๅฝ•ๅพฎไฟก๏ผŒ้•œๅคดๅฏน็€่ฟœๅค„็š„็‰ฉไฝ“
log in to WeChat in advance, point the camera at a distant object
4
[ "ๆ‹ๆ‘„็…ง็‰‡", "ไปŽๅ›พๅบ“่ฟ›ๅ…ฅๅˆ†ไบซ่œๅ•", "ๆ–‡ๆกˆๆญฃ็กฎ", "ๆˆๅŠŸๅ‘้€ๆœ‹ๅ‹ๅœˆ" ]
[ "take a photo", "enter share menu from gallery", "correct text content", "successfully send moments" ]
[ 10, 10, 10 ]
45
cross_app
ๆˆ‘ๆ˜Žๅคฉๅ‡ ็‚นๅผ€ไผš๏ผŸๅฎšไธ€ไธชๆๅ‰ไธคไธชๅฐๆ—ถ็š„้—น้’Ÿๆ้†’ๆˆ‘ใ€‚
What time is my meeting tomorrow? Set an alarm for two hours before the meeting to remind me.
[ "Google Calendar", "Clock" ]
่ฎพ็ฝฎไธ€ไธชๆ˜Žๅคฉๆ—ฉไธŠๅ็‚นๅผ€ๅง‹็š„ไผš่ฎฎๆ—ฅ็จ‹
set a meeting schedule that starts at 10 AM tomorrow morning
2
[ "ๅ›ž็ญ”ๆญฃ็กฎ", "ๆˆๅŠŸๆทปๅŠ ้—น้’Ÿ" ]
[ "correct answer", "add alarm successfully" ]
[ 10, 11, 12 ]
46
cross_app
ๆˆ‘ๅœจๆ—ฅๅކไธญ่ฎฐๅฝ•ไบ†ๆˆ‘็š„ๆ—ฅ็จ‹ๅฎ‰ๆŽ’๏ผŒ็Žฐๅœจ่ฏทไฝ ๅ‘Š่ฏ‰ๆˆ‘ๅŽปๆทฑๅœณๆ—…ๆธธ้‚ฃๅคฉ็š„ๅคฉๆฐ”ๆƒ…ๅ†ตใ€‚
Check my calendar for the schedule of my trip to Shenzhen and tell me the weather forecast for that day.
[ "Google Calendar", "Weather" ]
ๅœจๆ—ฅๅކไธญๆ–ฐๅปบไธ€ไธชๅŽๅคฉ็š„ๅ…จๅคฉๆ—ฅ็จ‹๏ผŒๅ‘ฝๅไธบโ€œๆทฑๅœณไน‹ๆ—…โ€
create a new all-day schedule for the day after tomorrow in the calendar, titled 'Shenzhen Tour'.
4
[ "่ฟ›ๅ…ฅ่ฐทๆญŒๆ—ฅๅކ", "ๆˆๅŠŸ่Žทๅ–ๆ—ฅๆœŸ", "่ฟ›ๅ…ฅๅคฉๆฐ”", "ๆˆๅŠŸๆŸฅ่ฏขๅคฉๆฐ”" ]
[ "enter google calendar", "correct date retrieval", "enter weather", "successful weather query" ]
[ 9, 6, 6 ]
47
cross_app
ๆŸฅ่ฏขๆˆ‘ๆœ€่ฟ‘็š„ไธ€็ฌ”ๅŽไฝไผš่ฎขๅ•ๅœฐๅ€๏ผŒๅนถๅœจB็ซ™ๆœ็ดขไธ€ไธ‹้‚ฃไธชๅŸŽๅธ‚็š„ๆ—…ๆธธๆ”ป็•ฅใ€‚
Check the address of my most recent order in H World and search for travel guides for that city on BiliBili.
[ "H World", "BiliBili" ]
ๆœ‰ไธ€ไธช่ฎขๅ•๏ผŒๅฏไปฅๆ˜ฏๆœชไป˜ๆฌพๆˆ–ๅ–ๆถˆไป˜ๆฌพ็Šถๆ€
there is an order, which can be in unpaid or canceled payment status
3
[ "่Žทๅ–่ฎขๅ•", "่ฟ›ๅ…ฅBiliBili", "ๆˆๅŠŸๆ’ญๆ”พ่ง†้ข‘" ]
[ "retrieve order", "enter BiliBili", "play video successfully" ]
[ 10, 10, 10 ]
48
cross_app
ๆŠŠๆœ€่ฟ‘็š„ไธ€็ฌ”ไบฌไธœ่ฎขๅ•็š„ๅ•†ๅ“้“พๆŽฅๅˆ†ไบซ็ป™ๅพฎไฟกๅฅฝๅ‹๏ผŒๅนถๅ†™ไธ€ๆฎตๆŽจ่่ฏญใ€‚
Share the product link of the most recent order of JD with my WeChat friends, and write a recommendation message.
[ "JD", "WeChat" ]
ๆœ‰ไธ€ไธช่ฎขๅ•๏ผŒๅฏไปฅๆ˜ฏๆœชไป˜ๆฌพๆˆ–ๅ–ๆถˆไป˜ๆฌพ็Šถๆ€
there is an order, which can be in unpaid or canceled payment status
4
[ "่ฟ›ๅ…ฅ่ฎขๅ•็•Œ้ข", "่ฎขๅ•ๆญฃ็กฎ", "ๆŽจ่่ฏญๅˆ้€‚", "ๆˆๅŠŸๅ‘้€" ]
[ "enter order list page", "correct order", "suitable recommendation message", "share successfully" ]
[ 12, 9, 10 ]
49
cross_app
ๅธฎๆˆ‘ๅœจ้บฆๅฝ“ๅŠณ็‚นไธ€ไธช้บฆ่พฃ้ธก็ฟ…๏ผŒๅฆ‚ๆžœ็Žฐๅœจๅคฉๆฐ”ไธ้€‚ๅˆๅ‡บ้—จๅฐฑ็‚นๅค–ๅ–๏ผŒไธ็„ถๅฐฑๅ ‚้ฃŸ
Order a dine-in Spicy McWings at McDonald's. If the current weather is not suitable for going out, order delivery instead.
[ "McDonald's", "Weather" ]
4
[ "ๆŸฅ่ฏขๅคฉๆฐ”", "้€‰ๆ‹ฉ้คๅ“", "็ป“็ฎ—่ฎขๅ•", "ๅ–้คๆ–นๅผๆญฃ็กฎ" ]
[ "check weather", "select meal", "place the order", "correct dining option" ]
[ 15, 15, 14 ]
50
cross_app
ๅฆ‚ๆžœๆ˜Žๅคฉไธ‹้›จ๏ผŒๅฐฑๅฎšไธชๆ—ฉไธŠๅ็‚น็š„้—น้’Ÿๅซๆˆ‘่ตทๅบŠ๏ผŒไธ็„ถๅฐฑๅฎšๅ…ซ็‚น็š„้—น้’Ÿใ€‚
If it rains tomorrow, set an alarm for 10 AM to wake me up; otherwise, set the alarm for 8 AM.
[ "Weather", "Clock" ]
2
[ "ๆˆๅŠŸๅˆคๆ–ญๆ˜Žๅคฉๅคฉๆฐ”", "ๆˆๅŠŸ่ฎพ็ฝฎ้—น้’Ÿ" ]
[ "correct weather prediction for tomorrow", "set alarm successfully" ]
[ 9, 8, 9 ]

๐ŸŽฎ MobA manipulates mobile phones just like how you would.

๐ŸŒ Website | ๐Ÿ“ƒ Paper | ๐Ÿค— MobBench | ๐Ÿ—ƒ๏ธ Code

็ฎ€ไฝ“ไธญๆ–‡ | English

๐Ÿ”ฅ News

  • [2024.10.18] We open-source MobA on GitHub, and our paper is available on arXiv.

๐Ÿ“– Introduction

Current mobile assistants are limited by dependence on system APIs or struggle with complex user instructions and diverse interfaces due to restricted comprehension and decision-making abilities. To address these challenges, we propose MobA, a novel Mobile phone Agent powered by multimodal large language models that enhances comprehension and planning capabilities through a sophisticated two-level agent architecture. The high-level Global Agent (GA) is responsible for understanding user commands, tracking history memories, and planning tasks. The low-level Local Agent (LA) predicts detailed actions in the form of function calls, guided by sub-tasks and memory from the GA. Integrating a Reflection Module allows for efficient task completion and enables the system to handle previously unseen complex tasks. MobA demonstrates significant improvements in task execution efficiency and completion rate in real-life evaluations, underscoring the potential of MLLM-empowered mobile assistants.

๐Ÿ”ง Deployment

MobA is still under development, and we are keeping updating the code. Please stay tuned!

System Requirements

Make sure you have installed Android Debug Bridge (ADB), and you have connected your Android device to your computer. You should be able to see your devides with command adb devices.

Environment Setup

conda create -n moba python=3.12
conda activate moba
pip install numpy opencv-python openai generativeai pillow colorama

You may also use requirements.txt to install the required packages (However it is not recommended since there are many unused packages).

Run MobA

You need to specify the configuration file in config.yaml before running MobA. You can find the configuration file in the moba folder.

vim ./moba/config.yaml
cd ./moba/agent
python executor.py

You should be able to run MobA smoothly on Windows now. You can find MobBench, the fifty tasks we tested in the paper, on huggingface.

๐Ÿ“‘ Citation

If you find our work useful, please cite us!

@misc{zhu2024moba,
      title={MobA: A Two-Level Agent System for Efficient Mobile Task Automation}, 
      author={Zichen Zhu and Hao Tang and Yansi Li and Kunyao Lan and Yixuan Jiang and Hao Zhou and Yixiao Wang and Situo Zhang and Liangtai Sun and Lu Chen and Kai Yu},
      year={2024},
      eprint={2410.13757},
      archivePrefix={arXiv},
      primaryClass={cs.MA},
      url={https://arxiv.org/abs/2410.13757}, 
}

๐Ÿ“ง Contact Us

If you have any questions, please feel free to contact me via email [email protected].

Downloads last month
76

Paper for OpenDFM/MobA-MobBench