-
Notifications
You must be signed in to change notification settings - Fork 15
/
index.html
200 lines (190 loc) · 13.8 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
<!DOCTYPE html>
<html>
<head>
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/css/bootstrap.min.css"
integrity="sha384-BVYiiSIFeK1dGmJRAkycuHAHRg32OmUcww7on3RYdg4Va+PmSTsz/K68vbdEjh4u" crossorigin="anonymous">
<link href='http://fonts.googleapis.com/css?family=Lato:300,400,900' rel='stylesheet' type='text/css'>
<link href="style.css" rel="stylesheet">
<meta charset="utf-8">
<title>AudioGPT</title>
<!-- <link href="css/bootstrap.min.css" rel="stylesheet"> -->
</head>
<body data-new-gr-c-s-check-loaded="14.1091.0" data-gr-ext-installed="">
<div class="container" >
<header role="banner">
</header>
<main role="main">
<article itemscope itemtype="https://schema.org/BlogPosting">
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
<h2 style="text-align: center;">AudioGPT</h2>
</div>
<div class="container">
<div class="text-center">
<video width="80%" controls="">
<source src="demo/AudioGPT_3.mp4" type="video/mp4">
Your browser does not support the video tag.
</video>
</div>
</div>
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
<h3>Speech</h3>
<div class="table-responsive pt-3">
<table class="table table-hover pt-2">
<thead>
<tr>
<th style="text-align: center">Task Name</th>
<th style="text-align: center">Prompt</th>
<th style="text-align: center">Inputs</th>
<th style="text-align: center">Outputs</th>
</tr>
</thead>
<tbody>
<tr><td style="text-align: center;vertical-align:middle;width: 800px">Text-To-Speech</td>
<td style="text-align: center;vertical-align:middle;width: 800px">Generate a speech with text "here we go".</td>
<td style="text-align: center;vertical-align:middle;width: 800px">/</td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/fd5cf55e.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 800px">Style Transfer</td>
<td style="text-align: center;vertical-align:middle;width: 800px">Speak using the voice of this audio. The text is "Here we go".</td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/0011_001570.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/9a40cc94.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 800px">Speech Recognition</td>
<td style="text-align: center;vertical-align:middle;width: 800px">Transcribe this speech.</td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/fd5cf55e.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 800px">Here we go.</td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 800px">Speech Enhancement</td>
<td style="text-align: center;vertical-align:middle;width: 800px">Enhance the quality of the speech signal.</td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/M05_440C0213_PED_REAL.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/b03f020c.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 800px">Speech Separation</td>
<td style="text-align: center;vertical-align:middle;width: 800px">Separate each speech from the speech mixture.</td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/447c020t_1.2106_422a0112_-1.2106.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/192da007.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 800px">Mono-to-Binaural</td>
<td style="text-align: center;vertical-align:middle;width: 800px">Transfer this mono audio into binaural audio.</td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/fd5cf55e.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/d2449644.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
<h3>Sing</h3>
<div class="table-responsive pt-3">
<table class="table table-hover pt-2">
<thead>
<tr>
<th style="text-align: center">Task Name</th>
<th style="text-align: center">Prompt</th>
<th style="text-align: center">Inputs</th>
<th style="text-align: center">Outputs</th>
</tr>
</thead>
<tbody>
<tr>
<td style="text-align: center;vertical-align:middle;width: 800px">Text-To-Sing</td>
<td style="text-align: center;vertical-align:middle;width: 800px">Please generate a piece of singing voice. Text sequence is 小酒窝长睫毛AP是你最美的记号. Note sequence is C#4/Db4 | F#4/Gb4 | G#4/Ab4 | A#4/Bb4 F#4/Gb4 | F#4/Gb4 C#4/Db4 | C#4/Db4 | rest | C#4/Db4 | A#4/Bb4 | G#4/Ab4 | A#4/Bb4 | G#4/Ab4 | F4 | C#4/Db4. Note duration sequence is 0.407140 | 0.376190 | 0.242180 | 0.509550 0.183420 | 0.315400 0.235020 | 0.361660 | 0.223070 | 0.377270 | 0.340550 | 0.299620 | 0.344510 | 0.283770 | 0.323390 | 0.360340.</td>
<td style="text-align: center;vertical-align:middle;width: 800px">/</td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/2bf90e35.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
<h3>Audio</h3>
<div class="table-responsive pt-3">
<table class="table table-hover pt-2">
<thead>
<tr>
<th style="text-align: center">Task Name</th>
<th style="text-align: center">Prompt</th>
<th style="text-align: center">Inputs</th>
<th style="text-align: center">Outputs</th>
</tr>
</thead>
<tbody>
<tr>
<td style="text-align: center;vertical-align:middle;width: 500px">Text-To-Audio</td>
<td style="text-align: center;vertical-align:middle;width: 500px">Generate an audio of a piano playing.</td>
<td style="text-align: center;vertical-align:middle;width: 500px">/</td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/b973e878.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 500px">Audio Inpainting</td>
<td style="text-align: center;vertical-align:middle;width: 500px">I want to inpaint this audio.</td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/drums-and-music-playing-with-a-man-speaking.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/7cb0d24f.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 500px">Image-To-Audio</td>
<td style="text-align: center;vertical-align:middle;width: 500px">Generate an audio of this image.</td>
<td style="text-align: center;vertical-align:middle;width: 500px"><img src="demo/violin.png" width="260" height="162"></td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/5d67d1b9.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 500px">Sound Detection</td>
<td style="text-align: center;vertical-align:middle;width: 500px">What events does this audio include?</td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/drums-and-music-playing-with-a-man-speaking.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 500px"><img src="demo/915f9a1d.png" width="260" height="162"></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 500px">Target Sound Detection</td>
<td style="text-align: center;vertical-align:middle;width: 500px">Please help me detect the target sound in the audio based on desription: "I want to detect thunder event".</td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/thunder-as-rain-falling-down.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 500px">The thunder happened in this audio from 0.0 to 9.984 seconds.</td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 500px">Sound Extraction</td>
<td style="text-align: center;vertical-align:middle;width: 500px">Extract the thunder event from this audio.</td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/thunder-as-rain-falling-down.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/fba1621e.wav" autoplay/>Your browser does not support the audio element.</audio></td>
</tr>
<tr>
<td style="text-align: center;vertical-align:middle;width: 500px">Audio-To-Text</td>
<td style="text-align: center;vertical-align:middle;width: 500px">Give me the description of this audio.</td>
<td style="text-align: center;vertical-align:middle;width: 500px"><audio controls="controls" style="width: 140px;"><source src="demo/a-group-of-sheep-are-baaing.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 500px">The audio is recording of a goat bleating nearby several times.</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="container pt-5 mt-5 shadow p-5 mb-5 bg-white rounded">
<h3>Face</h3>
<div class="table-responsive pt-3">
<table class="table table-hover pt-2">
<thead>
<tr>
<th style="text-align: center">Task Name</th>
<th style="text-align: center">Prompt</th>
<th style="text-align: center">Inputs</th>
<th style="text-align: center">Outputs</th>
</tr>
</thead>
<tbody>
<tr>
<td style="text-align: center;vertical-align:middle;width: 800px">Talking Head Synthesis</td>
<td style="text-align: center;vertical-align:middle;width: 800px">Generate a talking human portrait video.</td>
<td style="text-align: center;vertical-align:middle;width: 800px"><audio controls="controls" style="width: 140px;"><source src="demo/fd5cf55e.wav" autoplay/>Your browser does not support the audio element.</audio></td>
<td style="text-align: center;vertical-align:middle;width: 800px"><video width="320" height="240" controls><source src="demo/174e17d2.mp4" type="video/mp4"></video></td>
</tr>
</tbody>
</table>
</div>
</div>
<body>
<td style="text-align: center;vertical-align:middle;width: 800px"><img src="demo/19451824932.png" width="372" height="526"></td>
<td style="text-align: center;vertical-align:middle;width: 800px"><img src="demo/19452260913.png" width="372" height="526"></td>
<td style="text-align: center;vertical-align:middle;width: 800px"><img src="demo/19452340431.png" width="372" height="526"></td>
</body>