flagopen / taco Goto Github PK

View Code? Open in Web Editor NEW

130.0 130.0 7.0 5.33 MB

License: Apache License 2.0

Python 98.86% Shell 1.14%

taco's People

Contributors

Stargazers

Watchers

Forkers

cat30year eltociear mdwoicke rongaoli tonywhite11 iiji

taco's Issues

数据污染问题

你们的工作太棒了！ TACO 是目前看到的开源数据集中，最棒的代码生成数据集。

比较好奇你们在制作数据集时，有没有考虑到数据污染问题。在LLM的时代，测试集是否被污染是一个非常重要的参考点。
我在里面论文里没有找到相关的信息。

AttributeError: module 'inspect' has no attribute 'getargspec'. Did you mean: 'getargs'?

When I use compute_metric.py to evaluate the generation results, the console noted "no module named pyext." I installed it using pip and got the following error:

Collecting pyext
  Using cached pyext-0.7.tar.gz (7.8 kB)
  Preparing metadata (setup.py) ... error
  error: subprocess-exited-with-error

  × python setup.py egg_info did not run successfully.
  │ exit code: 1
  ╰─> [9 lines of output]
      Traceback (most recent call last):
        File "<string>", line 2, in <module>
        File "<pip-setuptools-caller>", line 34, in <module>
        File "...\AppData\Local\Temp\pip-install-exx9gznl\pyext_28002897deae467da164cbba24ad8613\setup.py", line 6, in <module>
          import pyext
        File "...\AppData\Local\Temp\pip-install-exx9gznl\pyext_28002897deae467da164cbba24ad8613\pyext.py", line 117, in <module>
          oargspec = inspect.getargspec
                     ^^^^^^^^^^^^^^^^^^
      AttributeError: module 'inspect' has no attribute 'getargspec'. Did you mean: 'getargs'?
      [end of output]

  note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed

× Encountered error while generating package metadata.
╰─> See above for output.

note: This is an issue with the package mentioned above, not pip.
hint: See above for details.

After searching online, I found out that this is a Python bug and I'm using Python 3.11 to support OpenAI's models.

Upon further investigation, it appears that this issue doesn't exist in Python 3.8, so I downgraded the Python version to 3.8.

Moreover, Someone mentioned that the problem may be resolved in the second quarter of 2024: https://community.privacyidea.org/t/python-3-11-support/3115/2

I suggest providing some detailed environment setting on the guide page.

How to construct the appropriate output?

I try to evaluate codellama-7b using the easy difficulty of the TACO dataset. But I find that the output code needs to meet certain specifications in order to pass the test cases, like
s = input()\nprint(s.swapcase())
not
def solve(s):\n return s.swapcase()
How to construct the appropriate output?

flagopen / taco Goto Github PK

taco's People

Contributors

Stargazers

Watchers

Forkers

taco's Issues

数据污染问题

AttributeError: module 'inspect' has no attribute 'getargspec'. Did you mean: 'getargs'?

How to construct the appropriate output?

请问是否有在test数据集上的测试结果？

请问TACO数据集中是否包含了APPS和code_contest中的所有题目？

compute_metric.py 好像有问题

code-llama-7b-python精度对不上

题目的难度是如何确定的？/ How is the difficulty level obtained?

更新后的评测框架似乎存在重大bug?

Finetuned Models

specific performance of gpt-4

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent