Topic: image-text Goto Github
Some thing interesting about image-text
Some thing interesting about image-text
image-text,Character Recognition system using CNN and Streamlit
User: akshaybura
image-text,Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.
User: antonlukin
Home Page: https://packagist.org/packages/antonlukin/poster-editor
image-text,Scan text from an image and convert into speech/audio of desired language.
User: ask0ne
image-text,Download flickr8k, flickr30k image caption datasets
User: awsaf49
image-text,MTA: A Lightweight Multilingual Text Alignment Model for Cross-language Visual Word Sense Disambiguation
User: charlesyang030
image-text,PolCLIP: A Unified Image-Text Word Sense Disambiguation Model via Generating Multimodal Complementary Representations
User: charlesyang030
image-text,Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.
User: darkknightsgh
image-text,The first public Vietnamese visual linguistic foundation model(s)
User: dinhanhx
image-text,Some Python scripts to load Vietnamese visual linguistic data
User: dinhanhx
image-text,Write texts on images with php
Organization: dngo-io
image-text,
Organization: dvlab-research
Home Page: https://arxiv.org/abs/2304.07547
image-text,This project is a FastAPI-based web application designed to analyze C a m b r i d g e I E L T S P D F s ( B o o k s 1 − 18 ) for the most and least repeated words. It can handle both regular text-based PDFs and scanned image-based PDFs by converting them to images and extracting text using OCR (Optical Character Recognition).
User: fatemeh-mohseni-ai
image-text,Raster graphics package for Fōrmulæ, in JavaScript
Organization: formulae-org
image-text,The largest multilingual image-text classification dataset. It contains fashion products.
Organization: glami
image-text,Data release for the ImageInWords (IIW) paper.
Organization: google
Home Page: https://google.github.io/imageinwords/
image-text,WWDC22: Enabling Live Text interactions with images in SwiftUI
User: huangrunhua
image-text,lmmtoolkit is a toolkit for Multi-Modal Learning
User: jianzhnie
Home Page: https://jianzhnie.github.io/llmtech/
image-text,Deep Cross-Modal Projection Learning for Image-Text Matching
User: labyrinth7x
image-text,caption generator using lavis and argostranslate
User: leeyunjai
image-text,Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment
Organization: miccunifi
image-text,10000-Image-caption-data-of-diverse-scenes
Organization: nexdata-ai
Home Page: https://www.nexdata.ai/datasets/llm/1283?source=Github
image-text,10000-Image-caption-data-of-gestures
Organization: nexdata-ai
Home Page: https://www.nexdata.ai/datasets/llm/1287?source=Github
image-text,10000-Image-caption-data-of-vehicles
Organization: nexdata-ai
Home Page: https://www.nexdata.ai/datasets/llm/1284?source=Github
image-text,10100-Image-caption-data-of-human-face
Organization: nexdata-ai
Home Page: https://www.nexdata.ai/datasets/llm/1286?source=Github
image-text,11000-Image-Video-caption-data-of-human-action
Organization: nexdata-ai
Home Page: https://www.nexdata.ai/datasets/llm/1289?source=Github
image-text,20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes
Organization: nexdata-ai
Home Page: https://www.nexdata.ai/datasets/llm/1288?source=Github
image-text,Image Captioning With MobileNet-LLaMA 3
User: reshalfahsi
image-text,Code for ALBEF: a new vision-language pre-training method
Organization: salesforce
image-text,Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
Organization: sense-gvt
image-text,A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
User: theocoombes
Home Page: http://crawling.at
image-text,A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
User: theocoombes
Home Page: http://crawling.at
image-text,mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
Organization: x-plug
Home Page: https://arxiv.org/abs/2205.12005
image-text,Keras implementation of ImageBERT from Microsoft
User: zabir-nabil
image-text,ocr文字识别算法服务
User: zhangming8
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.