Collaborative generation of unique audiovisual experiences using NFC identity cards
-
Updated
Jan 20, 2021 - TypeScript
Collaborative generation of unique audiovisual experiences using NFC identity cards
Todo o conteúdo produzido para a unidade curricular PF (Projeto FEUP), para o curso em Engenharia Informática e Computação na FEUP
Multitasking multimodal AI material that focus on human interaction and assistance
Utilizing a multimodal architecture to predict the appropriate speaker turn in a dialogue.
This repo collects Multi-modal Machine Learning papers.
AMR extension for the spatial domain, with grounded frame of reference tracking
Multi-angle Lip Multimodal Video Data
Accepted at The Web Conference 2024.
🤖 A framework for building AI Agents with LLMs, integrating multimodal generative AI technologies including voice, images, videos, and digital humans 🌈💎✨
A notebook to learn about ML for astronomy through BTSbot.
Visuo-haptic integration during texture exploration
In this course, you’ll select open source models from Hugging Face Hub to perform NLP, audio, image and multimodal tasks using the Hugging Face transformers library.
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
Dataset from the paper "The Semantic Typology of Visually Grounded Paraphrases"
Application template for choosing a hotel and tour for travel
TerraWatch is a proof of concept system developed during the TUM AI Hackathon 2024 to detect deforestation from satellite images and reason out the causes and potential environmental effects using computer vision models and multimodal large language models.
IDEFICS (Image-aware Decoder Enhanced à la Flamingo with Interleaved Cross-attentionS) is an open-access reproduction of Flamingo, a closed-source visual language model developed by Deepmind. Like GPT-4, the multimodal model accepts arbitrary sequences of image and text inputs and produces text outputs.
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."