Samples from running multimodal Efficient-Large-Model/VILA1.5-3b on video sequences using Jetson AGX Orin, captured at the live rate.

Published: 2024/5/15