Topic: simhash Goto Github
Some thing interesting about simhash
Some thing interesting about simhash
simhash,基于simHash的Web作业查重系统
User: alushu
simhash,A rewrite of Bookmate's simhash gem, which is an implementation of Moses Charikar's simhashes in Ruby.
User: armchairtheorist
simhash,Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
User: bbalet
simhash,Lab solutions for Analysis of Massive Datasets ("Analiza velikih skupova podataka") course at FER 2020/21
User: dbrcina
simhash,Code plagiarism system based on Simhash and Nicad.
User: derek-wds
simhash,Dynatrace hash library for Java
Organization: dynatrace-oss
simhash,a Golang implementation of Simhash Algorithm
User: hengfeiyang
simhash,Elixir SimHash NIFs written in Rust
User: holsee
Home Page: https://hex.pm/packages/spirit_fingers
simhash,A fast python implementation of the SimHash algorithm.
Organization: hybridtheory
simhash,Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
User: james-bowman
simhash,text de-duplication 文本去重
User: jiangnanboy
simhash,基于Java的多线程爬虫框架
User: jinshuai86
Home Page: https://jinshuai86.github.io/Spider
simhash,A barebones implementation of the simhash data sketching algorithm.
User: justinfargnoli
simhash,semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
User: keremzaman
simhash,爬虫内容处理服务(自用)
User: lifefloating
simhash,Datasets Euclidean to Hamming Conversion
User: long-gong
Home Page: https://github.com/long-gong/datasets
simhash,Rust jieba
User: luozijun
simhash,Implementacija algoritama predstavljenih na predmetu Analiza velikih skupova podataka (AVSP)
User: majajuri
simhash,:feet: Create a behavioral fingerprint based on your zsh command line history
User: manmolecular
simhash,Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
User: marcnuth
simhash,Open Source Implementation of Simhash in Python
User: memosstilvi
simhash,基于springboot和Google开源simhash算法实现的作业查重/抄袭检测/文本相似度分析可视化系统,,集成jplag、MOSS、singleCloud工具套件进行多方位查重 Ref: https://github.com/ALuShu/checksystem
User: mokeeqian
simhash,event coding using spark and stanford-core-nlp
User: nemosharma6
simhash,Knowledge extraction through Data Analysis, including Locality Sensitive Hashing (LSH).
User: nepiskopos
simhash,A text similarity by simhash
User: netkiddy
simhash,SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex
User: nnnet
simhash,This system evaluates a collection of mementos (archived web pages) to determine which are off topic. The collection can be part of an Archive-It collection, a single TimeMap, or stored in a WARC file.
Organization: oduwsdl
simhash,Implemented simhash technique to estimate duplicated pages in a given dataset. University project for Information Retrieval (Spring 2015)
User: pnikitakis
simhash,A library for cosine similarity & simhash calculation
User: preciz
Home Page: https://hex.pm/packages/similarity
simhash,Implementation for the attacks of the paper "Locality-Sensitive Hashing Does Not Guarantee Privacy! Attacks on Google's FLoC and the MinHash Hierarchy System".
User: privacy-lsh
Home Page: https://arxiv.org/abs/2302.13635
simhash,Simhash algorithm using Jcseg for word segment, jenkins-hash for hash. Written in Scala
User: qingniufly
simhash,The extended version of simhash supports fingerprint extraction of documents and images.
User: qyokizzzz
simhash,documents my master's level thesis work on building continous, topical web crawler based on mercator 1999
User: rihenperry
simhash,Interesting (non-cryptographic) hashes implemented in pure Python.
User: sean-public
simhash,Locality Sensitive Hashing
User: serega
simhash,A simple implementation of simhash algorithm by java.
User: sing1ee
simhash,Analysis of Massive Datasets FER labs
User: sskender
simhash,Simhash implementation in Javascript
User: vkandy
simhash,In this repository you can find an implementation of LSH (Local | Sensitive Hashing) and Finesse algorithms, designed to find similar data based on their hashes
User: xah30
simhash,⌨️ User Verification based on Keystroke Dynamics / Two-factor Authentication technology based on Key-Stroke
User: xenia101
simhash,Find duplicate text files.
User: zyocum
simhash,Proof-of-concept for measuring similarity of phoneme sequences using locality sensitive hashing (LSH).
User: zyocum
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.