Git Product home page Git Product logo

c4's People

Contributors

chenning-tao avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

c4's Issues

About CLCDSA

I would like to ask how you applied the dataset from C4 to CLCDSA. When I used the Antlr tool in CLCDSA to parse code, I was unable to correctly obtain the feature values required by the CLCDSA model (due to the lack of line breaks in the code in the dataset)
image

Can't download dataset files

Hi, I am trying to build a Python code clone classifier and your dataset is very useful. But it seems that those datasets files can't be downloaded anymore due to lack of data quota:

Downloading dataset/pair_test.jsonl (33 MB)
Error downloading object: dataset/pair_test.jsonl (1101894): Smudge error: Error downloading dataset/pair_test.jsonl (1101894e6b66d1dfe3ae9d889eac781a9a549a03d9c3466fb4b3bc9671d85358): batch response: This repository is over its data quota. Account responsible for LFS bandwidth should purchase more data packs to restore access.

It will be appreciated if you could fix this or provide any other access to the dataset!

Missing necessary files model.py and bleu.py?

Hi. I'm quite interested in your work and was trying to replicate the experiments with the released code, but something goes wrong :(

The script announces that model and bleu are not found. It seems that the necesarry files model.py and bleu.py are missing? Could you please supplement these two files (and any other files necessary yet missing)?

Btw, can you also tell the necessary python libraries and their versions to run the released codes?

Thx a lot.

Something may wrong in testing part

Dear author, when I read the code in the test part, I found the following problems:
there is a 'break ' at run_con.py 441rows, I am confused by this. I want to ask you if there is an error here?
Snipaste_2023-03-16_21-34-03
thank you very much!

Share the trained model

Is it possible for you to share the trained model so that it can be used on other datasets?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.