From 0b6a4ad10ab2f6e93c7721a44e94240d4b8618d3 Mon Sep 17 00:00:00 2001
From: artist_wu <81091914+frank6200db@users.noreply.github.com>
Date: Wed, 19 Jun 2024 22:51:17 +0800
Subject: [PATCH] Update index.html
---
index.html | 16 ++++++++--------
1 file changed, 8 insertions(+), 8 deletions(-)
diff --git a/index.html b/index.html
index f16e4b7..87cfc99 100644
--- a/index.html
+++ b/index.html
@@ -175,23 +175,23 @@
- Qinchen Wu,1
+ Qinchen Wu
1,
- Difei Gao,1
+ Difei Gao1,
- Kevin Qinghong Lin,1
+ Kevin Qinghong Lin1,
Zhuoyu Wu2,
- Xiangwu Guo,1
+ Xiangwu Guo1,
- Peiran Li,1
+ Peiran Li1,
- Weichen Zhang,1
+ Weichen Zhang1,
- Hengxu Wang,1
+ Hengxu Wang1,
Mike Zheng Shou1
@@ -334,7 +334,7 @@ Main contributions
Our work places emphasis on the following three aspects
- - Dataset: Act2Cap contains 4K+ GUI video (Action frames), caption pairs collected from automatic pipeline and human demonstration.
+ - Dataset: Act2Cap contains 4K+ GUI video (Action frames), caption pairs collected from GUI layouts including WORD, EXCEL, PPT, AE, PR, WEB through automatic pipeline and human demonstration.
- Benchmark: Metric for evaluating the quality of narration generated from LLMs.
- Model baseline: Two stage model effectively designed for narrating actions in GUI.