Git Product home page Git Product logo

Professor of Data Mining

My research is to make data analysis algorithms better and faster.

In particular, I am interested in:

  • cluster analysis
  • outlier detection
  • indexing for similarity search
  • text mining

Most of my research is available as open-source, e.g., in the ELKI data mining toolkit and the kmedoids Rust crate and its Python wrapper "kmedoids" available on pypi/pip and conda-forge.

Open for work

In academia, you are expected to continuously apply for better positions and move on - that is the only way to advance your career. But I am not set on staying in academia. I would like to have more time for programming, and less administrative work, and universities cannot offer such positions.

I am good at:

  • programming since 1989 in dozens of languages (Java, Python, Rust, Perl, ...)
  • helping and teaching others resolve issues and speed up their code
  • teaching machine learning at a university level
  • improving algorithms
  • architecting Java projects with 200.000+ lines of code
  • optimization and refactoring (my Java refactoring work were even analyzed in scientific studies)
  • Python, Rust: experienced enough to publish the packages mentioned above and to contribute fixes to scikit-learn
  • automating with scripts (shell and Python, a long time ago in Perl)
  • Linux system administration (I am a member of the Debian Linux project, although not very active right now at package maintainance)

You can contact me if you have a competitive offer (no junior positions), with the following constraints:

  • software development and research are my favorite, educational roles are fine (but I'd prefer less administrative work)
  • Germany, open for relocation to central and well-connected cities such as Munich or Hannover, Zürich, Luxembourg would also be okay, but moving to the US or UK is currently not an option because of family reasons. Remote is not my favorite, I am more productive at the office, but a possibility.
  • no more sales or consulting positions, I am interested in a senior developer or senior researcher position

Erich Schubert's Projects

adbench icon adbench

Official Implement of "ADBench: Anomaly Detection Benchmark".

beamer icon beamer

A LaTeX class for producing presentations and slides

cervidae icon cervidae

Cervidae - Low-Level Data Structures and Algorithms

chroma icon chroma

the AI-native open-source embedding database

colossalai icon colossalai

Making large AI models cheaper, faster and more accessible

corenlp icon corenlp

Stanford CoreNLP: A Java suite of core NLP tools.

csrankings icon csrankings

A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.

dbscan icon dbscan

Density Based Clustering of Applications with Noise (DBSCAN) and Related Algorithms - R package

decker icon decker

A markdown based tool for slide deck creation.

diffusers icon diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

elki icon elki

ELKI Data Mining Toolkit

fastutil icon fastutil

fastutil extends the Java™ Collections Framework by providing type-specific maps, sets, lists and queues.

goxmeans icon goxmeans

An implementation of the x-means algorithm in Go.

heideltime icon heideltime

A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.

hppc icon hppc

High Performance Primitive Collections for Java

jackson-databind icon jackson-databind

General data-binding package for Jackson (2.x): works on streaming API (core) implementation(s)

jongo icon jongo

Query in Java as in Mongo shell

jsoup icon jsoup

jsoup: Java HTML Parser, with best of DOM, CSS, and jquery

lektor icon lektor

The lektor static file content management system

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.