Topic: image-text Goto Github

Some thing interesting about image-text

👇 Here are 36 public repositories matching this topic...

akshaybura / character-recognition

image-text,Character Recognition system using CNN and Streamlit

User: akshaybura

cnn deep-neural-networks image-processing image-text preprocessing python recognizing-characters streamlit tensorflow

antonlukin / poster-editor

image-text,Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.

User: antonlukin

Home Page: https://packagist.org/packages/antonlukin/poster-editor

php-gd php-image intervention poster-editor composer image-text image-processing php php-class php-library

ask0ne / ocrator

image-text,Scan text from an image and convert into speech/audio of desired language.

User: ask0ne

natural-language-processing text-to-speech image-text image-recognition pytesseract

awsaf49 / flickr-dataset

image-text,Download flickr8k, flickr30k image caption datasets

User: awsaf49

captioning-images clip image image-text siglip dataset flickr flickr30k flickr8k

charlesyang030 / mta

image-text,MTA: A Lightweight Multilingual Text Alignment Model for Cross-language Visual Word Sense Disambiguation

User: charlesyang030

image-text language-vision multilingual multimodal visualwsd

charlesyang030 / polclip

image-text,PolCLIP: A Unified Image-Text Word Sense Disambiguation Model via Generating Multimodal Complementary Representations

User: charlesyang030

image-text multimodal-wsd

darkknightsgh / text-image-text

image-text,Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.

User: darkknightsgh

flickr8k-dataset image-text information-retrieval python semantic-embedding streamlit text-image transformers huggingface-transformers

dinhanhx / visualroberta

image-text,The first public Vietnamese visual linguistic foundation model(s)

User: dinhanhx

python python-3 python3 image-captioning image-text vietnamese-nlp visual-linguistic visual-question-answering

dinhanhx / vl-datasets

image-text,Some Python scripts to load Vietnamese visual linguistic data

User: dinhanhx

image-captioning image-text python python-3 python3 vietnamese vietnamese-nlp visual-linguistic visual-question-answering

dngo-io / cover-creator

image-text,Write texts on images with php

Organization: dngo-io

php image-manipulation image-text image-processing textview

dvlab-research / tagclip

image-text,

Organization: dvlab-research

Home Page: https://arxiv.org/abs/2304.07547

clip image-text segmentation zero-shot

fatemeh-mohseni-ai / most-repeated-vocabulary-ielts

image-text,This project is a FastAPI-based web application designed to analyze C a m b r i d g e I E L T S P D F s ( B o o k s 1 − 18 ) for the most and least repeated words. It can handle both regular text-based PDFs and scanned image-based PDFs by converting them to images and extracting text using OCR (Optical Character Recognition).

User: fatemeh-mohseni-ai

fast-api ielts image-text