#

multimodal

Here are 713 public repositories matching this topic...

goldkim92 / Gradient-Uncertainty-pytorch

Pytorch implementation of 'Pratical Sampling-based Bayesian Inference for multimodal distribution'

pytorch multimodal gradient-uncertainty

Updated Jan 29, 2019
Jupyter Notebook

dvcarrillo / identity-space

Collaborative generation of unique audiovisual experiences using NFC identity cards

angular ionic mobile-app web-application nfc generative-art multimodal

Updated Jan 20, 2021
TypeScript

paulinho-16 / MIEIC-PF

Todo o conteúdo produzido para a unidade curricular PF (Projeto FEUP), para o curso em Engenharia Informática e Computação na FEUP

poster presentation report group-project interfaces multimodal

Updated Oct 11, 2021

BruteChouette81 / delta

Multitasking multimodal AI material that focus on human interaction and assistance

deep-learning natural-language-understanding multimodal

Updated Apr 29, 2023
PureBasic

Merterm / COSMic

Public repo for the paper: "COSMic: A Coherence-Aware Generation Metric for Image Descriptions" by Mert İnan, Piyush Sharma, Baber Khalid, Radu Soricut, Matthew Stone, Malihe Alikhani

transformers transformer captioning-images automatic-metrics multimodal-learning multimodal multimodal-deep-learning caption-generation transformer-architecture transformer-models transformer-pytorch transformers-models

Updated Mar 23, 2022
Python

albertwy / IWAN

Code for IEEE MultiMedia Paper "Modeling Incongruity between Modalities for Multimodal Sarcasm Detection."

multimedia text-classification multimodal

Updated Dec 26, 2022
Python

jacobmarks / concept-interpolation

Interpolate between two text concepts using a CLIP model and FiftyOne Plugins!

react python plugins multimodal fiftyone

Updated Apr 4, 2024
TypeScript

ChimeraPy / Engine

Distributed computing framework for Multimodal data written in Python

python distributed-systems data-science synchronization multimodal

Updated Nov 7, 2023
Python

ZJUFanLab / SCOTCH

SCOTCH is a Single-Cell multi-modal integration method leveraging the Optimal Transport algorithm and a cell matCHing strategy

integration generegulation single-cell multi-omics multimodal spatialomics

Updated Apr 14, 2024
Jupyter Notebook

association-rosia / flair-2

Engage in a semantic segmentation challenge for land cover description using multimodal remote sensing earth observation data, delving into real-world scenarios with a dataset comprising 70,000+ aerial imagery patches and 50,000 Sentinel-2 satellite acquisitions.

computer-vision lightning deep-learning image-processing pytorch deeplearning cookiecutter-template sentinel-2 multimodal multimodal-deep-learning tta test-time-augmentation pytorch-lightning wandb timm multiclass-segmentation

Updated Mar 28, 2024
Jupyter Notebook

JanTeichertKluge / DMLSim

This library provides packages on DoubleML / Causal Machine Learning and Neural Networks in Python for Simulation and Case Studies.

machine-learning deep-learning neural-network simulation transformers transformer multi-modal causal-inference case-study bert causal multimodal multimodal-deep-learning dgp causal-machine-learning beit double-machine-learning doubleml

Updated Jun 20, 2023
Python

eliottcrancee / ParoleNet

Utilizing a multimodal architecture to predict the appropriate speaker turn in a dialogue.

nlp deep-neural-networks deep-learning multimodal multimodal-deep-learning

Updated Feb 21, 2024
Python

AdirthaBorgohain / art-critiq

A multi modal pipeline to generate three tones of reviews [harsh, constructive, kind] for a given artwork using fine-tuned Flan-T5 models.

python nlp ai computer-vision text-generation image-captioning multimodal

Updated Feb 27, 2023
Jupyter Notebook

sonhm3029 / Prescription-Text-Extraction-Pharmacy-Information-Retrieval

This project use OCR for extracting prescription information and searching for related pharmacy

ocr medical gemini multimodal llm

Updated May 18, 2024

Oztobuzz / Vista

This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations and images

open-source vietnamese dataset vista vietnamese-nlp multimodal multi-modality vision-language-model

Updated May 14, 2024
Python

kyegomez / CELESTIAL-1

Omni-Modality Processing, Understanding, and Generation

openai attention multi-modal multimodality attention-is-all-you-need attention-mechanisms multimodal multimodal-deep-learning gpt-4 gpt4 omnimodal

Updated May 3, 2024
Python

TobyYang7 / Llava_Qwen2

Visual Instruction Tuning for Qwen2 Base Model

multimodal llm llava qwen qwen2

Updated Jun 29, 2024
Python

PrithivirajDamodaran / WhatTheFood

An intentionally simple Image to Food cross-modal search. Created by Prithiviraj Damodaran.

cross-modal multimodal cross-modal-retrieval cross-modal-learning

Updated Nov 8, 2021

cu-clear / Spatial-AMR

AMR extension for the spatial domain, with grounded frame of reference tracking

annotation semantics amr srl multimodal spatial-relations frame-of-reference

Updated Oct 5, 2023

anymodality / anymodality

AnyModality is an open-source library to simplify MultiModal LLM inference and deployment.

openai gpt multimodal gpt-4 llm llmops llm-inference

Updated Nov 8, 2023
Python

Improve this page

Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."