assert-kth / codrep Goto Github PK
View Code? Open in Web Editor NEW58069 Java source code diffs. http://arxiv.org/pdf/1807.03200
Home Page: http://arxiv.org/pdf/1807.03200
58069 Java source code diffs. http://arxiv.org/pdf/1807.03200
Home Page: http://arxiv.org/pdf/1807.03200
Created for Nghi D. Q. Bui and Lingxiao Jiang from Singapore Management University for discussions. Welcome!
I noticed that in some task files there is an extra line appended at the end of the file.
I think this this is probably a compiler warning that you appended by accident.
An example of this is 3.txt in Dataset 2. https://github.com/KTH/CodRep-competition/blob/master/Datasets/Dataset2/Tasks/3.txt
Created for Team madPL from University of Wisconsin--Madison & Microsoft Research for discussions. Welcome!
Jordan Henkel, Shuvendu Lahiri, Ben Liblit, Thomas Reps
Created for Team COINSE (Gabin An, Shin Yoo) from KAIST, South Korea, for discussions. Welcome!
Watch the repo to get notified about important news!
Created for Team CSV(@cesarsotovalero) from the Universidad Central "Marta Abreu" de Las Villas for discussions. Welcome!
Machine used for evaluation: Ubuntu 18.04 LTS, CPU Intel 2299MHZ, 16 GB RAM
Don't hesitate to comment here about the process.
Created for Team Avmb from the Univerisity of Edinburgh for discussions. Welcome!
if several predictions, the loss function should be average of maximum loss
Team name: SPbAU, Bogomolov
Error on Dataset1: 0.164
Error on Dataset2: 0.1235
Created for Jesper and Olof from Ericsson and RISE for discussions. Welcome!
Hi,
Thanks a lot for organizing this :) Hope that you don't mind the drive-by issue submission: I would like to suggest three additional, strong, but reasonable, baselines:
The reason I am suggesting this, is that these baselines seem easy "hacks" to achieve reasonable performance without any machine learning.
Created for the source{d} team. We plan to keep track of our approaches and solutions in this issue.
Hello,
Thank you for a great competition!
May you add additional information about hidden dataset
from here.
Offtopic: I recommend using kaggle-style approach and publish test dataset without solutions, so you can receive predictions and publish public score and compute a private score.
Hi all,
I just did a quick naive solution based on string distance:
Dataset | Perfect Match | In Top 10 | Recall | Loss |
---|---|---|---|---|
Bench 1 | 3791 | 4322 98% | 0.86 | 0.13615878141899027 |
Bench 2 | 9910 | 10805 97% | 0.89 | 0.10263617900182995 |
Created for Marcelo Martins(@mrezende) from IPT Sao Paulo for discussions. Welcome!
Hi @mallamanis!
According to the interesting points discussed in #13, you may submit a proposal (and we do hope so :-), so here is your participant wall!
The idea is to post here findings that are specific to your solution and to tease with the corresponding scores on Dataset1 and Dataset2.
Note that it's also perfectly OK to open other issues.
Welcome!
Created for Team OttoRepairs from Otto-von-Guericke University Magdeburg for discussions. Welcome!
As stated in the README:
"Your program does not have to predict something for all input files, if there is no clear answer, simply don't output anything, the error computation takes that into account, more information about this in Loss function below."
This is a bit ambiguous, does it mean that the output should be skipped altogether or that one could output just the filename with no line number?
Either case, it should probably either be fixed or clarified in the README.
The latter does not work (full path omitted):
echo "CodRep-competition/Datasets/Dataset1/Tasks/2703.txt 35" | python evaluate.py
Total files: 34096
Average line error: 0.999970671046 (the lower, the better)
Recall@1: 2.93289535429e-05 (the higher, the better)
echo "CodRep-competition/Datasets/Dataset1/Tasks/2703.txt" | python evaluate.py
Traceback (most recent call last):
File "evaluate.py", line 183, in
main()
File "evaluate.py", line 168, in main
prediction = inputs[1]
IndexError: list index out of range
echo "CodRep-competition/Datasets/Dataset1/Tasks/2703.txt " | python evaluate.py
Traceback (most recent call last):
File "evaluate.py", line 183, in
main()
File "evaluate.py", line 168, in main
prediction = inputs[1]
IndexError: list index out of range
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.