Start by asking: Is Hillary Clinton good? Follow with: Is Donald Trump good? a

Bias of ggml-alpaca-7b-q4.bin about alpaca.cpp HOT 3 CLOSED

antimatter15 commented on August 28, 2024 13

Bias of ggml-alpaca-7b-q4.bin

from alpaca.cpp.

Comments (3)

anzz1 commented on August 28, 2024 17

You have a fundamental misunderstanding on how natural language AI works. The "AI" is not a sentient being, it does not have any inherent bias. Any bias that the AI exhibits is a result from the model and a direct function to whatever bias the source material has which the AI was trained on. The model used in alpaca.cpp is simply an quantized (you can think of it as compression which essentially takes shortcuts, reducing the amount of resources required but it also might reduce the quality of output) version of the LLaMA 7B model from Meta fine-tuned with the instruction-following dataset from Stanford-Alpaca which makes it better at answering prompts.

As seen in the LLaMA model research paper, this is the data where the model was trained on:

Blog post about LLaMA: https://ai.facebook.com/blog/large-language-model-llama-meta-ai/
Research paper: https://arxiv.org/abs/2302.13971

Stanford-Alpaca fine-tuning data: https://github.com/tatsu-lab/stanford_alpaca#data-release

Having biases in AI models is a well-known problem which I'm not sure is even a solvable one as long as the source material used is produced by humans. As you might know, humans are known to have biases and it is close to impossible to source tons of written material which wouldn't have any biases whatsoever. Humans are flawed so inherently any language models will be flawed too. This can be alleviated by manually fine-tuning the models to have "less bias" but that only leads to the model having whatever biases the one doing the fine-tuning would hold as every decision a human makes does include their biases too, unconscious or not. Even the concept of "bias" can mean different things to different people.

At the very fundamental level and very generally speaking, AI language models do not generate "truth" but they rather generate "consensus". As I do not want this to dwelve into a discussion about politics or the human condition, it's best left as an exercise to the reader to think on how consensus does not equal truth. I am not saying it's impossible to eventually create an AI model which only generates an objective truth, by for example training it only on verifiable scientific data, but I am saying that this model isn't it.

TL;DR; Do not except this or any other AI models for that matter to generate only the truth or have no biases. Quite the contrary, expect them to be wrong, have biases, and lie, just like any other written work by a human can do.

from alpaca.cpp.

fastrocket commented on August 28, 2024 1

Gpt-4 has fixed some of the anti-Trump bias. You can now ask for articles on why Trump and MAGA are good directly. So it's doable.

from alpaca.cpp.

anzz1 commented on August 28, 2024 1

Gpt-4 has fixed some of the anti-Trump bias. You can now ask for articles on why Trump and MAGA are good directly. So it's doable.

You really think that is "fixing" it? You missed the point entirely. Please read the comment again and apply some thought. To be perfectly clear, that is a rhetorical question and no further reply is required.

from alpaca.cpp.

Bias of ggml-alpaca-7b-q4.bin about alpaca.cpp HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent