task_id
int64 1
50
| task_type
stringclasses 5
values | task_zh
stringlengths 8
58
| task_en
stringlengths 23
201
| applications
sequencelengths 1
2
| preparation_zh
stringlengths 0
27
| preparation_en
stringlengths 0
109
| total_milestone_scores
int64 1
6
| scoring_milestones_zh
sequencelengths 1
6
| scoring_milestones_en
sequencelengths 1
6
| human_expert_steps
sequencelengths 3
3
|
|---|---|---|---|---|---|---|---|---|---|---|
1
|
easy
|
ๅจ12306ไธญ่ฟๅ
ฅโๆ็โ้กต้ข๏ผๅจๅธธ็จๅ่ฝไธญๅฐappไปๆ ๅๆจกๅผๅๆขๅฐๆฌ่็ใ
|
Go to the 'My' page in the 12306 app, and switch the app from standard mode to senior mode under common functions.
|
[
"12306 (China Railway)"
] |
ๅๆขๅฐๆ ๅๆจกๅผ
|
switch to standard mode
| 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
5,
5,
8
] |
2
|
easy
|
ๅจB็ซๆ็ดขโ้ซ็ญๆฐๅญฆโๅนถๆญๆพ็ฌฌไธไธช่ง้ขใ
|
Search 'Advanced Mathematics' in Bilibili and play the first video.
|
[
"BiliBili"
] | 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
5,
6,
5
] |
||
3
|
easy
|
ไฝฟ็จ็ธๆบ็ๅคๆฏๆจกๅผๆไธๅผ ็
ง็ใ
|
Take a photo using the camera's night mode.
|
[
"Camera"
] | 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
3,
4,
3
] |
||
4
|
easy
|
ๅ
ณ้ญๆๆฉ็ไธไธชๆๅผ็้น้ใ
|
Turn off the earliest enabled alarm.
|
[
"Clock"
] |
่ฎพ็ฝฎๅนถๆๅผๆฐไธช้น้
|
Set and turn on several alarms.
| 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
2,
2,
2
] |
5
|
easy
|
ๆฅ่ฏขใไบบๅทฅๆบ่ฝใ่ฟ้จ็ตๅฝฑ็่ฑ็ฃ่ฏๅใ
|
Check the Douban rating of the movie 'Artificial Intelligence'.
|
[
"Douban"
] | 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
5,
4,
5
] |
||
6
|
easy
|
ๅจ่ฐทๆญๆฅๅไธๅๅปบไธไธชๆฐไปปๅก๏ผๆ ้ขไธบโ่ฎบๆๅไฝโใ
|
Create a new task on Google Calendar titled 'Writing Paper'.
|
[
"Google Calendar"
] | 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
5,
5,
5
] |
||
7
|
easy
|
ๅธฎๆๆฅ่ฏขๆ็ๅไฝไผไผๅๆ
ๅต๏ผๅนถๅ่ฏๆๆๆๅฏไปฅๅ ็น้ๆฟใ
|
Help me check my H World membership and tell me the latest checkout time.
|
[
"H World"
] |
็ปๅฝ่ดฆๅท
|
logging in an account
| 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
3,
2,
3
] |
8
|
easy
|
ๅจไบฌไธๆ็ดข้บฆๅ
้ฃ๏ผๅนถๅฐ็ฌฌไธไธชๅๅๆทปๅ ๅฐ่ดญ็ฉ่ฝฆใ
|
Search for microphones on JD and add the first product to the cart.
|
[
"JD"
] | 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
6,
5,
6
] |
||
9
|
easy
|
ๅฐ้บฆๅฝๅณapp็่ฏญ่จๅๆขไธบ่ฑๆใ
|
Switch the language of the McDonald's app to English.
|
[
"McDonald's"
] |
ๅๆข่ฏญ่จไธบไธญๆ
|
switch the language to Chinese
| 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
7,
6,
7
] |
10
|
easy
|
ๆฅ็ๆๅคฉ็ๅคฉๆฐใ
|
Check tomorrow's weather.
|
[
"Weather"
] | 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
2,
1,
1
] |
||
11
|
medium
|
ๅจๆถๅป่กจไธญๆฅ่ฏขๆๅคฉไปไธๆตทๅฐๅไบฌ็็ผๅทไธบG104็ๅ่ฝฆ้ข่ฎกๅ ็น้ๅฐๅไบฌใ
|
Check the timetable for train G104 from Shanghai to Beijing tomorrow, and find out what time it is expected to arrive in Nanjing.
|
[
"12306 (China Railway)"
] | 3
|
[
"่ฟๅ
ฅๆถๅป่กจๆฅ่ฏข็้ข",
"่ฝฆๆฌกๆญฃ็กฎ",
"ๅฎๆไปปๅก"
] |
[
"enter the timetable page",
"enter train number correctly",
"task completion"
] |
[
13,
13,
9
] |
||
12
|
medium
|
ไปB็ซโ็ญ้จโ้กต้ข่ฟๅ
ฅๆ่กๆฆ๏ผๆญๆพๆๅ็ฌฌไธ็่ง้ข๏ผๅ
ณๆณจ่ฟไธช่ดฆๅทๅนถๆทปๅ ่ฟไธช่ง้ขๅฐๆถ่ๅคนไธญใ
|
Go to the leaderboard from the 'Hot' page of BiliBili, play the top-ranked video, follow the account, and add the video to the favorites.
|
[
"BiliBili"
] |
ๅๆถๅ
ณๆณจ่ฟไธช็ฎๆ ่ดฆๅท๏ผๅ
ณ้ญ่ชๅจ่ฟๆญๅ่ฝ
|
unfollow this target account, turn off the auto-play function
| 4
|
[
"่ฟๅ
ฅๆ่กๆฆ",
"ๆญๆพ่ง้ข",
"ๅ
ณๆณจ่ดฆๅท",
"ๆทปๅ ๆถ่"
] |
[
"enter the leaderboard page",
"play video correctly",
"follow the account",
"add to favorites"
] |
[
6,
6,
6
] |
13
|
medium
|
่ฎพ็ฝฎๆ้ฟๅปถ่ฟ็ๅฎๆถ่ชๆใ
|
Set the timer selfie with the longest delay.
|
[
"Camera"
] |
ๅๆขๅฐๅ็ฝฎๆๅๅคด
|
switch to the back camera
| 2
|
[
"่ฎพ็ฝฎๅปถ่ฟ",
"ๅๆขๅ็ฝฎ"
] |
[
"set delay",
"switch to the front camera"
] |
[
4,
4,
6
] |
14
|
medium
|
ๅจไธ็ๆถ้ดๅ่กจไธญๆทปๅ ้ฆๆธฏใ
|
Add Hong Kong in list of world clock.
|
[
"Clock"
] |
ไปไธ็ๆถ้ดๅ่กจไธญ็งป้ค้ฆๆธฏ
|
remove Hong Kong from the list of world clock
| 2
|
[
"่ฟๅ
ฅๆทปๅ ็้ข",
"ๅฎๆไปปๅก"
] |
[
"enter the page to add a city",
"task completion"
] |
[
6,
6,
6
] |
15
|
medium
|
ๆ็ดขๅนถ่ฟๅ
ฅโ่ฑ็ฃ็ตๅฝฑTop250โๆฆๅ๏ผ็ปๆๅ็ฌฌไบ็็ตๅฝฑๆ้กถ้จ็็ญ่ฏ็น่ตใ
|
Search 'Douban Movie Top 250' leaderboard and like the top comment of the second-ranked movie.
|
[
"Douban"
] |
่ฟๆก็ญ่ฏๆฒกๆ่ขซ็น่ต
|
this comment has not been liked
| 2
|
[
"่ฟๅ
ฅๆฆๅ",
"ๅฎๆไปปๅก"
] |
[
"enter the leaderboard page",
"task completion"
] |
[
11,
5,
8
] |
16
|
medium
|
ๅจ่ฐทๆญๆฅๅไธๅๅปบไธไธชๆฐๆฅ็จ๏ผๆฅๆไธบๆๅคฉ๏ผๆถ้ดไธบๅ
จๅคฉ๏ผๆ ้ขไธบโๅ
ฌๅธๅนดไผโใ
|
Create a new all-day event on Google Calendar for tomorrow titled 'Company Annual Meeting'.
|
[
"Google Calendar"
] | 2
|
[
"ๆฅๆๆถ้ดๆญฃ็กฎ",
"ๅฎๆไปปๅก"
] |
[
"correct date and time",
"task completion"
] |
[
8,
6,
6
] |
||
17
|
medium
|
ๅจๅไฝไผappไธๆฅ่ฏข่ฏๅๆ้ซ็ๆฑๅบญ้
ๅบ็่ตทไปทใ
|
Use the H World app, search for the starting price of the nearest Hanting Hotel to me.
|
[
"H World"
] | 2
|
[
"ไฝฟ็จ่ฏๅๆๅฅฝๆๅบ",
"ๅฎๆไปปๅก"
] |
[
"sort by highest rating",
"task completion"
] |
[
7,
5,
8
] |
||
18
|
medium
|
ๅฏปๆพไบฌไธๅๅไธญๆ่ฟๅพ
่ฏไปท็ไธไธชๅๅ๏ผ็ปๅบๆปกๅๅฅฝ่ฏ๏ผๅนถๅไธๆฎต่ฏ่ฏญ๏ผๆทปๅ ไธคๅผ ็
ง็ใ
|
Find the most recent product awaiting comment on JD, give it a full five-star rating, write a comment, and add two photos.
|
[
"JD"
] |
ๆๆช่ฏไปท็่ฎขๅ
|
there are orders that have not been commented on
| 2
|
[
"่ฟๅ
ฅ่ฏไปท็้ข",
"ๅฎๆไปปๅก"
] |
[
"enter comment page",
"task completion"
] |
[
14,
8,
12
] |
19
|
medium
|
็ปๆ่ฟ็ไธ็ฌ้บฆๅฝๅณ่ฎขๅๅผๅ็ฅจ๏ผๅนถ้ๆฉโไธๆตทไบค้ๅคงๅญฆโๆฌๅคดใ
|
Invoice the most recent McDonald's order, and select 'SJTU' for invoice title.
|
[
"McDonald's"
] |
ๆๆชๅผๅ็ฅจ่ฎขๅ
|
there is an order and no invoice has been issued
| 3
|
[
"่ฟๅ
ฅ่ฎขๅ็้ข",
"้ๅๆฌๅคดๆญฃ็กฎ",
"ๅฎๆไปปๅก"
] |
[
"enter order list page",
"select invoice title correctly",
"task completion"
] |
[
9,
9,
12
] |
20
|
medium
|
ๆฅ็ไปๆๆฅ่ฝ็ๆถ้ดใ
|
Check the sunset time of tonight.
|
[
"Weather"
] | 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
4,
2,
3
] |
||
21
|
hard
|
่ดญไนฐไธๅผ ๆๅคฉไปไธๆตทๅฐๅไบฌ๏ผๅบๅๆถ้ดๅจไน็นๅฐๅไธ็นไน้ด็้ซ้็ฅจใ
|
Buy a high-speed train ticket for tomorrow from Shanghai to Beijing, with departure time between 9 AM and 11 AM.
|
[
"12306 (China Railway)"
] |
ๅบๅ็ซๅๅฐ่พพ็ซไธ้ฆ้กตๅฑ็คบ็ไธๅ๏ผไธๆ็ด่พพไฝ็ฅจๅณๅฏ
|
departure and arrival stations differ from those on the homepage, and direct tickets must be available.
| 5
|
[
"ๆฅๆๆญฃ็กฎ",
"่ฝฆ็ซๆญฃ็กฎ",
"่ฝฆๆฌก้ๆฉๅ้",
"ๆทปๅ ไน่ฝฆไบบ",
"ๅฎๆไปปๅก"
] |
[
"correct date",
"correct station",
"suitable train selection",
"add passenger",
"complete task"
] |
[
20,
20,
18
] |
22
|
hard
|
ๅจB็ซๆ็ดขๅนถๅ
ณๆณจโX-LANCEๅฎ้ชๅฎคโ่ดฆๅท๏ผๆญๆพๆๆฐ็่ง้ข๏ผๅนถๆฐๅไธๆฎตๅๅ็่ฏ่ฎบใ
|
Search in BiliBili and follow the 'X-LANCE' Lab account, play the newest video and write a friendly comment.
|
[
"BiliBili"
] |
ๅๆถๅ
ณๆณจ่ฟไธช่ดฆๅท๏ผๅ
ณ้ญ่ชๅจ่ฟๆญๅ่ฝ
|
unfollow this account, turn off the auto-play function
| 4
|
[
"่ฟๅ
ฅ่ดฆๅทไธป้กต",
"ๅ
ณๆณจ่ดฆๅท",
"ๆญๆพ่ง้ข",
"ๆฐๅ่ฏ่ฎบ"
] |
[
"enter homepage of this account",
"follow this account",
"play video correctly",
"write comment"
] |
[
12,
10,
10
] |
23
|
hard
|
ๅฐ็ธๆบๅๆขๅฐ่ง้ขๆจกๅผ๏ผๆๅผ่กฅๅ
็ฏ๏ผ็ป่ดจ่ฎพ็ฝฎๆโFHD 30FPSโ๏ผๅฝๅถๆถ้ด่ถ
่ฟไบๅ็งไปฅๅๅๆญขๅฝๅถใ
|
Switch the camera to video mode, turn on the fill light, set the video quality to 'FHD 30FPS', and stop recording after more than twenty seconds.
|
[
"Camera"
] |
ๅๆขๅฐ้่ง้ขๆจกๅผ๏ผ้็ฝฎๆๆ่ฎพ็ฝฎ
|
switch to non-video mode, reset all settings
| 3
|
[
"ๆๅผ่กฅๅ
็ฏ",
"ๅๆข็ป่ดจ",
"ๅฎๆ20sๅฝๅถ"
] |
[
"turn on the fill light",
"switch video quality",
"complete 20s recording"
] |
[
9,
9,
8
] |
24
|
hard
|
ๅๅปบไธไธชๅ็น็้น้๏ผๆ ้ขไธบโไธ็ญโ๏ผๅจๆฏๅจไธ่ณๅไฝฟ็จๆฏๅจๆ้ใ
|
Create an alarm at 10 o'clock titled 'Work' with vibration reminders on every Monday to Thursday.
|
[
"Clock"
] | 5
|
[
"ๆถ้ดๆญฃ็กฎ",
"ๆ ้ขๆญฃ็กฎ",
"ๅจๆ้ๅคๆญฃ็กฎ",
"ๅ
ณ้ญๅ้ๆๅผๆฏๅจ",
"ๅฎๆไปปๅก"
] |
[
"correct time",
"correct title",
"correct period repetition",
"turn off the ringtone and turn on the vibration",
"task completion"
] |
[
14,
45,
15
] |
||
25
|
hard
|
ๅจ่ฑ็ฃไธญๆ็ดข็ตๅฝฑใ่็ณๅ
็ๆ่ตใ๏ผๆ ่ฎฐไธบโ็่ฟโ๏ผ็ปไบ้ขๆ็่ฏๅ๏ผๅนถ็ไธไธๆก็งฐ่ต็ๅฝฑ่ฏใ
|
Search for the movie 'The Shawshank Redemption' on Douban, mark it as 'watched', rate it with five stars, and leave a positive review.
|
[
"Douban"
] |
ๅ ้ค่ฟ้จ็ตๅฝฑ็ไนๅ็ๆ ่ฎฐใ่ฏๅๅๅฝฑ่ฏ
|
Remove the previous mark, rating, and review of this movie.
| 4
|
[
"่ฟๅ
ฅ็ตๅฝฑ่ฏฆๆ
็้ข",
"ๆๅๆ ่ฎฐ",
"ๆๅๆๅ",
"็ไธๅฝฑ่ฏ"
] |
[
"enter the movie details page",
"successfully marked",
"successfully rated",
"leave a review"
] |
[
10,
10,
9
] |
26
|
hard
|
ๅจ่ฐทๆญๆฅๅไธๅๅปบไธไธชๅจๆๆฅ็จ๏ผๆ ้ขไธบโ่ฎก็ฎๆบ่ง่ง่ฏพ็จโ๏ผๆถ้ดไธบๆฏๅจไธๆไธๅ
ญ็นๅฐๅ
ซ็น๏ผไปๆฌๅจๅผๅง๏ผ้ๅคๅ
ซๆฌกใ
|
Create a recurring event on Google Calendar titled 'Computer Vision Course' scheduled for every Wednesday from 6 PM to 8 PM, starting this week and repeating eight times.
|
[
"Google Calendar"
] | 3
|
[
"ๆถ้ดๆ ้ขๆญฃ็กฎ",
"ๅจๆ้ๅคๆญฃ็กฎ",
"ๅฎๆไปปๅก"
] |
[
"correct time and title",
"correct period repetition",
"task completion"
] |
[
20,
52,
21
] |
||
27
|
hard
|
ๅจๅไฝไผappไธๅธฎๆ่ฎขไธไธชๆ้ฝ็้
ๅบ๏ผๅจๅคฉๅบๅนฟๅบ้่ฟ๏ผไปๅคฉๅ
ฅไฝๅๅคฉ็ฆปๅบ๏ผ้ๆฉไปทๆ ผๆไฝๆๅบ็็ฌฌไบไธช้
ๅบ๏ผไฝฟ็จๅพฎไฟกๆฏไปใ
|
Book a hotel in Chengdu on the H World app, near Tianfu Square, check-in today and check-out the day after tomorrow. Choose the second cheapest hotel listed when sorted by price, and use WeChat to pay.
|
[
"H World"
] | 6
|
[
"ๆฅๆๆญฃ็กฎ",
"ๅๅธๅฐ็นๆญฃ็กฎ",
"ไปทๆ ผๆญฃ็กฎ",
"่ฟๅ
ฅ้ข่ฎข็้ข",
"ๆไบค่ฎขๅ",
"ไปๆฌพๆนๅผๆญฃ็กฎ"
] |
[
"correct dates",
"correct city and hotel location",
"correct price",
"enter booking page",
"submit order",
"correct payment method"
] |
[
17,
16,
15
] |
||
28
|
hard
|
ๅจไบฌไธๆ็ดขโๆๅฟ
้ฉฐๅๅ
ฌ่ฎพๅคๆ่ฐๅบโ๏ผๆฅ่ฏขๅนถ่ดญไนฐๅ
ถๅบ้บไธญๆไพฟๅฎ็้บฆๅ
้ฃ๏ผ้ๆฉๆต
่ฒๆฌพๅผๅนถ่ดญไนฐไธคไปฝใ
|
Search for AISpeech store on JD, find and buy the cheapest microphone in the store, choose the light-colored style, and buy two.
|
[
"JD"
] | 3
|
[
"่ฟๅ
ฅๅบ้บไธป้กต",
"ๅๅ้ๆฉๆญฃ็กฎ",
"ๅฎๆไปปๅก"
] |
[
"enter the store homepage",
"correct product selection",
"task completion"
] |
[
10,
12,
12
] |
||
29
|
hard
|
็นไธไปฝ็ซๅณๅฐๅบๅ้ค็โ้ๅฟ้
1+1โ่ถ
ๅผๅฅ้ค๏ผ้ๆฉ้บฆ้ฆ้ธกๅ่ฟทไฝ ๆฑๅคๅๆฐๅฐ๏ผๅ ๅ
ฅ่ดญ็ฉ่ฝฆ๏ผไธๅๅนถ้ๆฉๆฏไปๅฎๆฏไปใ
|
Order a '1+1 Mix & Match' for immediate in-store pickup, choose McChicken and Mini Chocolate Sundae, add to cart, place the order, and choose Alipay to pay.
|
[
"McDonald's"
] | 5
|
[
"ๆพๅฐๅฅ้ค",
"้ๆฉ้คๅ",
"็ป็ฎ่ฎขๅ",
"ไปๆฌพๆนๅผๆญฃ็กฎ",
"ๅฎๆไปปๅก"
] |
[
"find the '1+1' meal",
"select the meal",
"place the order",
"correct payment method",
"task completion"
] |
[
15,
13,
15
] |
||
30
|
hard
|
ๅจๅคฉๆฐๅบ็จไธญๆทปๅ โๆ้ฝโ่ฟไธชๅๅธ๏ผๅนถๅ่ฏๆๆๅคฉ็ๆธฉๅบฆๅบ้ดใ
|
Add 'Chengdu' in the weather app and tell me the temperature range for tomorrow.
|
[
"Weather"
] |
ๅๅธๅ่กจไธญๆฒกๆๆ้ฝ
|
Ensure that Chengdu is not listed in the city list.
| 3
|
[
"่ฟๅ
ฅๅๅธๅ่กจ",
"ๆๅๆทปๅ ",
"ไฟกๆฏๆญฃ็กฎ"
] |
[
"enter the city list",
"successfully added",
"correct information"
] |
[
9,
4,
5
] |
31
|
indirect
|
ๅธฎๆ่ฎขไธๅผ ๅๅคฉ็้ซ้็ฅจ๏ผ18:00ไนๅๅๅไบฌใ
|
Book a high-speed train ticket for the day after tomorrow, returning to Beijing before 6:00 PM.
|
[
"12306 (China Railway)"
] |
้่ฆไฝฟ็จๅฝๅๅฎไฝไฝไธบๅบๅ็ซ
|
should use the current location as the departure station
| 5
|
[
"ๆฅๆๆญฃ็กฎ",
"่ฝฆ็ซๆญฃ็กฎ",
"่ฝฆๆฌก้ๆฉๅ้",
"ๆทปๅ ไน่ฝฆไบบ",
"ๅฎๆไปปๅก"
] |
[
"correct date",
"correct stations",
"suitable train selection",
"add passenger",
"task completion"
] |
[
20,
20,
18
] |
32
|
indirect
|
ๆฒกๆต้ไบ๏ผ็ฐๅจๆๆบไธๆไปไน่ฝ็็่ง้ข๏ผ
|
I'm out of mobile data, what videos can I still watch on the phone?
|
[
"BiliBili"
] |
ๆๅ็ผๅญๅคไธช่ง้ข๏ผๅ
ณ้ญ็ฝ็ป่ฟๆฅ๏ผๅฏไปฅไฝฟ็จๅ
ถไป่ง้ขๅบ็จ
|
download multiple videos in advance, turn off the network connection, it's ok to use other video applications
| 2
|
[
"ๆๅผ่ง้ขๅบ็จ",
"ๆๅ็ผๅญ่ง้ขไฟกๆฏ"
] |
[
"open video application",
"extract downloaded video information"
] |
[
4,
3,
3
] |
33
|
indirect
|
ๆๅๅๆ็ๆๅไธๅผ ็
ง็ๅ ๆใ
|
Delete the last photo taken recently.
|
[
"Camera"
] |
ๆๅ ๅผ ไธๅ็็
ง็๏ผๅนถๅจๅ
ถๅ ้คๅคๅผ ๆ
ๅตๅ็ๆถๆๅจๅๆญขไปปๅก
|
take several different photos, and stop the task manually when multiple deletions occur
| 2
|
[
"ๅ ้ค็
ง็",
"ๅ ้คไธๅผ ็
ง็ไนๅ็ปๆไปปๅก"
] |
[
"delete photo",
"end task after deleting one photo"
] |
[
4,
5,
4
] |
34
|
indirect
|
ไธคไธชๅฐๆถๅๆ้ๆใ
|
Remind me in two hours.
|
[
"Clock"
] | 2
|
[
"่ฟๅ
ฅ่ฎกๆถๅจ็้ข",
"ๅฎๆไปปๅก"
] |
[
"enter the timer page",
"task completion"
] |
[
7,
5,
6
] |
||
35
|
indirect
|
ๅธฎๆ็็็ตๅฝฑ้ขๅณๅฐไธๆ ็็ตๅฝฑไธญๆ็ญ้จ็ๆฏๅชไธช๏ผ
|
Help me check which is the most popular movie among the upcoming releases in theaters.
|
[
"Douban"
] | 2
|
[
"ๆๅผ่ฑ็ฃ",
"ๅฎๆไปปๅก"
] |
[
"open Douban",
"task completion"
] |
[
6,
3,
4
] |
||
36
|
indirect
|
่ฎฐๅฝๆๆๅคฉไธๅๅ็นๅฐๅไบ็นๅจไธๆตทไบค้ๅคงๅญฆ้ต่กๆ กๅบไธ็โ่ช็ถ่ฏญ่จๅค็โ่ฏพ็จใ
|
Record that I have a 'Natural Language Processing' class from 10 AM to 12 PM tomorrow at Shanghai Jiao Tong University, Minhang Campus.
|
[
"Google Calendar"
] | 4
|
[
"่ฟๅ
ฅๅๅปบๆฅ็จ็้ข",
"ๆถ้ดๆญฃ็กฎ",
"ๅฐ็นๆญฃ็กฎ",
"ๅฎๆไปปๅก"
] |
[
"enter the calendar creation page",
"correct time",
"correct location",
"task completion"
] |
[
16,
40,
17
] |
||
37
|
indirect
|
ๅ่ฏๆ้่ฟๅฏไปฅๅ
ฅไฝ็ๆ่ฟ้
ๅบ๏ผๅนถๅธฎๆ่ฎขไธไธชใ
|
Tell me about the nearest available hotel nearby and book it for me.
|
[
"H World"
] | 4
|
[
"้ๆฉๅฝๅๅฎไฝ",
"้
ๅบ้ๆฉๆญฃ็กฎ",
"ๆไบค่ฎขๅ",
"ๅฎๆไปปๅก"
] |
[
"select current location",
"correct hotel selection",
"submit order",
"task completion"
] |
[
10,
10,
10
] |
||
38
|
indirect
|
ๅธฎๆ้ฎไธไธๅฎขๆ๏ผ่ดญ็ฉ่ฝฆๆไธ้ข็้ฃไธชๅๅๆๆฒกๆไผๆ ใ
|
Help me ask customer service if there is any discount for the top item in the shopping cart.
|
[
"JD"
] |
ๆๅๅฐๅๅๅ ๅ
ฅ่ดญ็ฉ่ฝฆ
|
add the item to the shopping cart in advance
| 3
|
[
"้ๆฉๆญฃ็กฎๅๅ",
"่ฟๅ
ฅๅฎขๆ้กต้ข",
"ๅฎๆไปปๅก"
] |
[
"select the correct item",
"enter customer service page",
"task completion"
] |
[
7,
8,
8
] |
39
|
indirect
|
็นไธไปฝโ้บฆ่พฃ้ธก่
ฟๅ กๅฅ้คโๅ ้ฃ
|
Order a McSpicy Chicken Filet Burger Combo for dine-in.
|
[
"McDonald's"
] | 3
|
[
"ๆพๅฐๅฅ้ค",
"็ป็ฎ่ฎขๅ",
"ๅฎๆไปปๅก"
] |
[
"find the combo",
"place the order",
"task completion"
] |
[
13,
13,
13
] |
||
40
|
indirect
|
ๆๅคฉๆ้่ฆๅธฆ้จไผๅ๏ผ
|
Do I need to bring an umbrella tomorrow?
|
[
"Weather"
] | 1
|
[
"ๅฎๆไปปๅก"
] |
[
"task completion"
] |
[
3,
1,
2
] |
||
41
|
cross_app
|
ๅธฎๆๆๆ่ฟ็ไธไธช็ซ่ฝฆ่ก็จๆทปๅ ๅฐๆฅ็จ๏ผๆ ้ขๆ ผๅผไธบโ{ๅบๅ็ซ}โโ{่ฝฆๆฌก}โโ{ๅฐ่พพ็ซ}โ๏ผๅนถ่ฎพ็ฝฎๅ
ทไฝๆถ้ด่ๅดใ
|
Help me add my latest train journey to the schedule, titled in format of '{Departure Station}โโ{Train Number}โโ{Arrival Station}', and set exact time range.
|
[
"12306 (China Railway)",
"Google Calendar"
] | 3
|
[
"่ฟๅ
ฅ่ฝฆ็ฅจๅ่กจ",
"ๆ ้ขๆญฃ็กฎ",
"ๆถ้ดๆญฃ็กฎ"
] |
[
"enter ticket list page",
"correct title",
"correct time"
] |
[
23,
10,
12
] |
||
42
|
cross_app
|
ๅจB็ซๆญๆพโ่ฑ็ฃ็ตๅฝฑTop250โๆๅ็ฌฌไธ็้ฃ้จ็ตๅฝฑใ
|
Play the movie ranked third in the 'Douban Movie Top 250' on BiliBili.
|
[
"BiliBili",
"Douban"
] | 2
|
[
"็ตๅฝฑๆญฃ็กฎ",
"ๆๅๆญๆพ"
] |
[
"correct movie",
"play successfully"
] |
[
12,
13,
12
] |
||
43
|
cross_app
|
ๆๅผ็ธๆบๅนถๅ่ฏๆ้ๅคดๅ็็ฉๅๆฏไปไน๏ผๅจไบฌไธๆ็ดข่ฟ็ฑปๅๅ๏ผๅนถ้ๆฉ่พไธบ็ธไผผ็ไธ้กนๅ ๅ
ฅ่ดญ็ฉ่ฝฆใ
|
Open the camera and tell me what item is in front of the lens, search for this type of product on JD, and select a similar item to add to the shopping cart.
|
[
"Camera",
"JD"
] |
ๅ็ฝฎๆๅๅคดๅฏน็ไธไธชๅธธ่ง็ฉๅ
|
point the back camera at a common object
| 3
|
[
"ๅ็ญๆญฃ็กฎ",
"ๆๅๆ็ดข",
"ๅ ๅ
ฅ่ดญ็ฉ่ฝฆ"
] |
[
"correct answer",
"search successfully",
"add to shopping cart"
] |
[
10,
8,
8
] |
44
|
cross_app
|
ไฝฟ็จ้ฟ็ฆ้ๅคดๆไธๅผ ็
ง็๏ผๅนถๅไบซๅฐๅพฎไฟกๆๅๅ๏ผๅไธๆฎต่ฏ๏ผๅนถๅจๆๅๅ ไธๆณจ้โ่ฟๆฏ็ฑMobAๆๆบๅฉๆ่ชๅจๅๅธ็ๆๅๅโใ
|
Take a photo with a telephoto lens and share it on WeChat Moments, write a few words with the note 'This is an automatically posted Moments by MobA Mobile phone Assistant' in the end.
|
[
"Camera",
"WeChat"
] |
ๆๅ็ปๅฝๅพฎไฟก๏ผ้ๅคดๅฏน็่ฟๅค็็ฉไฝ
|
log in to WeChat in advance, point the camera at a distant object
| 4
|
[
"ๆๆ็
ง็",
"ไปๅพๅบ่ฟๅ
ฅๅไบซ่ๅ",
"ๆๆกๆญฃ็กฎ",
"ๆๅๅ้ๆๅๅ"
] |
[
"take a photo",
"enter share menu from gallery",
"correct text content",
"successfully send moments"
] |
[
10,
10,
10
] |
45
|
cross_app
|
ๆๆๅคฉๅ ็นๅผไผ๏ผๅฎไธไธชๆๅไธคไธชๅฐๆถ็้น้ๆ้ๆใ
|
What time is my meeting tomorrow? Set an alarm for two hours before the meeting to remind me.
|
[
"Google Calendar",
"Clock"
] |
่ฎพ็ฝฎไธไธชๆๅคฉๆฉไธๅ็นๅผๅง็ไผ่ฎฎๆฅ็จ
|
set a meeting schedule that starts at 10 AM tomorrow morning
| 2
|
[
"ๅ็ญๆญฃ็กฎ",
"ๆๅๆทปๅ ้น้"
] |
[
"correct answer",
"add alarm successfully"
] |
[
10,
11,
12
] |
46
|
cross_app
|
ๆๅจๆฅๅไธญ่ฎฐๅฝไบๆ็ๆฅ็จๅฎๆ๏ผ็ฐๅจ่ฏทไฝ ๅ่ฏๆๅปๆทฑๅณๆ
ๆธธ้ฃๅคฉ็ๅคฉๆฐๆ
ๅตใ
|
Check my calendar for the schedule of my trip to Shenzhen and tell me the weather forecast for that day.
|
[
"Google Calendar",
"Weather"
] |
ๅจๆฅๅไธญๆฐๅปบไธไธชๅๅคฉ็ๅ
จๅคฉๆฅ็จ๏ผๅฝๅไธบโๆทฑๅณไนๆ
โ
|
create a new all-day schedule for the day after tomorrow in the calendar, titled 'Shenzhen Tour'.
| 4
|
[
"่ฟๅ
ฅ่ฐทๆญๆฅๅ",
"ๆๅ่ทๅๆฅๆ",
"่ฟๅ
ฅๅคฉๆฐ",
"ๆๅๆฅ่ฏขๅคฉๆฐ"
] |
[
"enter google calendar",
"correct date retrieval",
"enter weather",
"successful weather query"
] |
[
9,
6,
6
] |
47
|
cross_app
|
ๆฅ่ฏขๆๆ่ฟ็ไธ็ฌๅไฝไผ่ฎขๅๅฐๅ๏ผๅนถๅจB็ซๆ็ดขไธไธ้ฃไธชๅๅธ็ๆ
ๆธธๆป็ฅใ
|
Check the address of my most recent order in H World and search for travel guides for that city on BiliBili.
|
[
"H World",
"BiliBili"
] |
ๆไธไธช่ฎขๅ๏ผๅฏไปฅๆฏๆชไปๆฌพๆๅๆถไปๆฌพ็ถๆ
|
there is an order, which can be in unpaid or canceled payment status
| 3
|
[
"่ทๅ่ฎขๅ",
"่ฟๅ
ฅBiliBili",
"ๆๅๆญๆพ่ง้ข"
] |
[
"retrieve order",
"enter BiliBili",
"play video successfully"
] |
[
10,
10,
10
] |
48
|
cross_app
|
ๆๆ่ฟ็ไธ็ฌไบฌไธ่ฎขๅ็ๅๅ้พๆฅๅไบซ็ปๅพฎไฟกๅฅฝๅ๏ผๅนถๅไธๆฎตๆจ่่ฏญใ
|
Share the product link of the most recent order of JD with my WeChat friends, and write a recommendation message.
|
[
"JD",
"WeChat"
] |
ๆไธไธช่ฎขๅ๏ผๅฏไปฅๆฏๆชไปๆฌพๆๅๆถไปๆฌพ็ถๆ
|
there is an order, which can be in unpaid or canceled payment status
| 4
|
[
"่ฟๅ
ฅ่ฎขๅ็้ข",
"่ฎขๅๆญฃ็กฎ",
"ๆจ่่ฏญๅ้",
"ๆๅๅ้"
] |
[
"enter order list page",
"correct order",
"suitable recommendation message",
"share successfully"
] |
[
12,
9,
10
] |
49
|
cross_app
|
ๅธฎๆๅจ้บฆๅฝๅณ็นไธไธช้บฆ่พฃ้ธก็ฟ
๏ผๅฆๆ็ฐๅจๅคฉๆฐไธ้ๅๅบ้จๅฐฑ็นๅคๅ๏ผไธ็ถๅฐฑๅ ้ฃ
|
Order a dine-in Spicy McWings at McDonald's. If the current weather is not suitable for going out, order delivery instead.
|
[
"McDonald's",
"Weather"
] | 4
|
[
"ๆฅ่ฏขๅคฉๆฐ",
"้ๆฉ้คๅ",
"็ป็ฎ่ฎขๅ",
"ๅ้คๆนๅผๆญฃ็กฎ"
] |
[
"check weather",
"select meal",
"place the order",
"correct dining option"
] |
[
15,
15,
14
] |
||
50
|
cross_app
|
ๅฆๆๆๅคฉไธ้จ๏ผๅฐฑๅฎไธชๆฉไธๅ็น็้น้ๅซๆ่ตทๅบ๏ผไธ็ถๅฐฑๅฎๅ
ซ็น็้น้ใ
|
If it rains tomorrow, set an alarm for 10 AM to wake me up; otherwise, set the alarm for 8 AM.
|
[
"Weather",
"Clock"
] | 2
|
[
"ๆๅๅคๆญๆๅคฉๅคฉๆฐ",
"ๆๅ่ฎพ็ฝฎ้น้"
] |
[
"correct weather prediction for tomorrow",
"set alarm successfully"
] |
[
9,
8,
9
] |

๐ฎ MobA manipulates mobile phones just like how you would.
๐ Website | ๐ Paper | ๐ค MobBench | ๐๏ธ Code
็ฎไฝไธญๆ | English
๐ฅ News
๐ Introduction
Current mobile assistants are limited by dependence on system APIs or struggle with complex user instructions and diverse interfaces due to restricted comprehension and decision-making abilities. To address these challenges, we propose MobA, a novel Mobile phone Agent powered by multimodal large language models that enhances comprehension and planning capabilities through a sophisticated two-level agent architecture. The high-level Global Agent (GA) is responsible for understanding user commands, tracking history memories, and planning tasks. The low-level Local Agent (LA) predicts detailed actions in the form of function calls, guided by sub-tasks and memory from the GA. Integrating a Reflection Module allows for efficient task completion and enables the system to handle previously unseen complex tasks. MobA demonstrates significant improvements in task execution efficiency and completion rate in real-life evaluations, underscoring the potential of MLLM-empowered mobile assistants.
๐ง Deployment
MobA is still under development, and we are keeping updating the code. Please stay tuned!
System Requirements
Make sure you have installed Android Debug Bridge (ADB), and you have connected your Android device to your computer. You should be able to see your devides with command adb devices.
Environment Setup
conda create -n moba python=3.12
conda activate moba
pip install numpy opencv-python openai generativeai pillow colorama
You may also use requirements.txt to install the required packages (However it is not recommended since there are many unused packages).
Run MobA
You need to specify the configuration file in config.yaml before running MobA. You can find the configuration file in the moba folder.
vim ./moba/config.yaml
cd ./moba/agent
python executor.py
You should be able to run MobA smoothly on Windows now. You can find MobBench, the fifty tasks we tested in the paper, on huggingface.
๐ Citation
If you find our work useful, please cite us!
@misc{zhu2024moba,
title={MobA: A Two-Level Agent System for Efficient Mobile Task Automation},
author={Zichen Zhu and Hao Tang and Yansi Li and Kunyao Lan and Yixuan Jiang and Hao Zhou and Yixiao Wang and Situo Zhang and Liangtai Sun and Lu Chen and Kai Yu},
year={2024},
eprint={2410.13757},
archivePrefix={arXiv},
primaryClass={cs.MA},
url={https://arxiv.org/abs/2410.13757},
}
๐ง Contact Us
If you have any questions, please feel free to contact me via email [email protected].
- Downloads last month
- 76