Comments (6)
EasyEdit can be easily applied to any supported model. For example, if you want to edit GPT-J through ROME
, you can run:
python run_zsre_llama2.py \
--editing_method=ROME \
--hparams_dir=../hparams/ROME/gpt-j-6B \
--data_dir=./data
- The script is called
run_zsre_llama2
because currently only key status on llama2 is open for MEMIT methods.- We will provide more status in the future and rename this script to
run_zsre
- We will provide more status in the future and rename this script to
Thank you for your reminder.
If it helps you solve the problem, please close this issue.
from easyedit.
Hello,
Thank you for your response. I have tried with FT, IKE and ROME and it worked for GPT2-XL and GPT-J-6B but it doesn't work for MEMIT. It is showing "ValueError: BuilderConfig '20200501.en' not found. Available: ['20220301.aa....".
I have downloaded model.layers.4.mlp.down_proj_float32_mom2_100000.npz etc and put it in the folder ./data/stats/._hugging_cache_gpt2-xl/wikipedia_stats/. What else should I do so that I can run MEMIT as well. Please let me know.
Thank you!
from easyedit.
As you can see in README, MEMIT
cannot bypass the computation of second-order momentum, so it requires the npz related to Wikipedia. However, we have only opened the key status of llama2 at the moment.
- You can download the second-order momentum of gpt2-xl and gpt-j-6B used in ROME and MEMIT at website of MIT. https://rome.baulab.info/data/stats/
- If you are testing the performance on the llama2, you can simply use the
npz
provided by us.
from easyedit.
If it helps you solve this problem, please close the issue. And if you want to ask a new question, please open a new issue.
from easyedit.
Hello,
Thank you for your response. I have tried with FT, IKE and ROME and it worked for GPT2-XL and GPT-J-6B but it doesn't work for MEMIT. It is showing "ValueError: BuilderConfig '20200501.en' not found. Available: ['20220301.aa....".
I have downloaded model.layers.4.mlp.down_proj_float32_mom2_100000.npz etc and put it in the folder ./data/stats/._hugging_cache_gpt2-xl/wikipedia_stats/. What else should I do so that I can run MEMIT as well. Please let me know.
Thank you!
You can download the second-order momentum of gpt2-xl and gpt-j-6B used in ROME and MEMIT at website of MIT. https://rome.baulab.info/data/stats/
from easyedit.
Hello,
Thank you for your response. I have tried with FT, IKE and ROME and it worked for GPT2-XL and GPT-J-6B but it doesn't work for MEMIT. It is showing "ValueError: BuilderConfig '20200501.en' not found. Available: ['20220301.aa....".
I have downloaded model.layers.4.mlp.down_proj_float32_mom2_100000.npz etc and put it in the folder ./data/stats/._hugging_cache_gpt2-xl/wikipedia_stats/. What else should I do so that I can run MEMIT as well. Please let me know.
Thank you!You can download the second-order momentum of gpt2-xl and gpt-j-6B used in ROME and MEMIT at website of MIT. https://rome.baulab.info/data/stats/
Thank you for your reply, downloading these weights directly is the best solution, I have updated the original reply
from easyedit.
Related Issues (20)
- One bug for “examples/run_knowedit_llama2.py” HOT 2
- Question about the function edit_dataset in EasyEdit-main/easyeditor/editors/editor.py HOT 2
- Compiling code for WikiData_counterfact and Wikidata_recent HOT 8
- Question about SERAC HOT 2
- Error when runing function test_IKE_Blip2OPT_VQA() in multimodal_edit.py HOT 3
- Error when Running IKE for Wiki Counterfactual dataset HOT 10
- Builder config error when running MEMIT HOT 9
- Differences between ft_main.py and lora_main.py HOT 2
- Full datasets loading fix HOT 1
- There is a question about 'edited_model' in easyeditor/editor.py, why edit() function always return the last edited_model only? HOT 5
- Model edition does not take effects HOT 6
- 请问是否可以设置随机种子? HOT 8
- How to apply it to new models HOT 5
- Request for 'Editing GPU memory usage' update HOT 2
- MMEdit config description HOT 15
- MEMIT方法是否能使每次的权重更新相同(在代码完全一致的情况下) HOT 6
- Problems about the results of GRACE method on Llama-2-7b HOT 13
- 请问一下论文中提到的opencompass在评估时要用到吗 HOT 2
- T-Patcher support HOT 2
- Question about E-VQA HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from easyedit.