Spaces:

HikariDawn
/

This-and-That

Running on Zero

App Files Files Community

HikariDawn777 commited on Oct 17, 2024

Commit

533f466

1 Parent(s): 10b6682

update

Browse files

Files changed (1) hide show

app.py +21 -6

app.py CHANGED Viewed

@@ -68,20 +68,35 @@ WIDTH = 384
 MARKDOWN = \
     """
-    ## <p style='text-align: center'> This&That </p>
-    [GitHub](https://github.com/Kiteretsu77/This_and_That_VDM) | [Paper](http://arxiv.org/abs/2407.05530) | [Webpage](https://cfeng16.github.io/this-and-that/)
-    This&That is a Robotics scenario (Bridge-dataset-based for this repo) Language-Gesture-Image-conditioned Video Generation Model for Robot Planning.
     This Demo is on the Video Diffusion Model part.
-    Only GestureNet is provided in this Gradio Demo, you can check the full test code for all pretrained weight available.
-    ### Note: The index we put the gesture point by default here is [4, 10] for two gesture points or [4] for one gesture point.
-    ### Note: The result now only support is 256x384.
     ### Note: Click "Clear All" to restart everything; Click "Undo Point" to cancel the point you put
     ### Note: The first run may be long. Click "Clear All" for each run is the safest choice.
     If **This&That** is helpful, please help star the [GitHub Repo](https://github.com/Kiteretsu77/This_and_That_VDM). Thanks!
     """

 MARKDOWN = \
     """
+    <div align='center'>
+    <h1> This&That: Language-Gesture Controlled Video Generation for Robot Planning </h1> \
+        <h2 style='font-weight: 450; font-size: 1rem; margin: 0rem'>\
+            <a href='https://kiteretsu77.github.io/boyang.github.io/'>Boyang Wang</a>, \
+            <a href='https://www.linkedin.com/in/niksridhar/'>Nikhil Sridhar</a>, \
+            <a href='https://cfeng16.github.io/'>Chao Feng</a>, \
+            <a href='https://mvandermerwe.github.io/'>Mark Van der Merwe</a>, \
+            <a href='https://fishbotics.com/'>Adam Fishman</a>, \
+            <a href='https://www.mmintlab.com/people/nima-fazeli/'>Nima Fazeli</a>, \
+            <a href='https://jjparkcv.github.io/'>Jeong Joon Park</a> \
+        </h2> \
+    <a style='font-size:18px;color: #000000' href='https://github.com/Kiteretsu77/This_and_That_VDM'> [Github] </a> \
+    <a style='font-size:18px;color: #000000' href='http://arxiv.org/abs/2407.05530'> [ArXiv] </a> \
+    <a style='font-size:18px;color: #000000' href='https://cfeng16.github.io/this-and-that/'> [Project Page] </a> </div> \
+    </div>
+    This&That is a Robotics scenario (Bridge-dataset-based for this demo) Language-Gesture-Image-conditioned Video Generation Model for Robot Planning.
     This Demo is on the Video Diffusion Model part.
+    Only GestureNet is provided in this Gradio Demo, but you can check the full test code for all pretrained weight available.
+    ### Note: The index we put the gesture point by default here is [4, 10] (5th and 11th) for two gesture points or [4] (5th) for one gesture point.
+    ### Note: The resolution now only support is 256x384.
     ### Note: Click "Clear All" to restart everything; Click "Undo Point" to cancel the point you put
     ### Note: The first run may be long. Click "Clear All" for each run is the safest choice.
     If **This&That** is helpful, please help star the [GitHub Repo](https://github.com/Kiteretsu77/This_and_That_VDM). Thanks!
     """