Light

carlosal1015 / variability-classifier-latex Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 18.42 MB

LaTex files for the variability classifier project (thesis project).

Makefile 0.16% TeX 86.86% Python 11.66% Shell 1.33%

variability-classifier-latex's Introduction

On the (Predictive) Performance of Photometric Variability Classifiers

Automated, Supervised Machine Learning Approaches Using Support Vector Machines, Random Forests & Gradient Boosted Trees.

Author: Markus Beuckelmann
Supervisors: PD Dr. Coryn Bailer–Jones, Dr. Kester Smith, Dr. Dae–Won Kim
Examinors: PD Dr. Coryn Bailer–Jones, PD Dr. Christian Fendt
Institution: Max Planck Institute for Astronomy (MPIA)
Submision: July 2015
Abstract: This work assesses the predictive performance of different automated, supervised Machine Learning approaches for the classification of astrophysical variables based on their photometric variability. We extract 64 ad–hoc features in R_E , B_E and R_E − B_E from 32683 EROS–2 light curves of known periodic, semi–periodic and aperiodic sources in the Large Magellanic Cloud (LMC). To characterize the periodicity of the signals, we make use of both Lomb–Scargle and the conditional entropy (CE) algorithm for period–finding. In this context, we present a fast Python/Cython implementation of the CE algorithm. To provide further separation of quasars in feature space, we implement the structure function to quantify the source’s intrinsic stochastic variability. Using a training set containing labels for 9 superclasses and 25 subclasses provided by Kim et al. [2014], we train three different models on the extracted features, namely Support Vector Machines (SVMs), Random Forest (RF) and Gradient Boosted Trees (GBT), and optimize the model’s hyperparameters for the average, weighted F1–score for superclasses and subclasses by performing a grid search using 5–fold cross–validation. We find that the decision tree based models, RF and GBT, outperform the SVM in both superclass and subclass classification. The highest scores are achieved by the GBT classifier with an average, weighted F1–score of (98.43 ± 0.07) % for superclass classification and (86.30 ± 0.37) % for subclass classification.
This is a report accounting for a Bachelor of Science (B.Sc.) thesis in Physics at Heidelberg University (Germany).

Selected figures

...

variability-classifier-latex's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.