ded42r / google-rules-of-machine-learning-rus Goto Github PK

View Code? Open in Web Editor NEW

This project forked from thundergolfer/google-rules-of-machine-learning

15.0 15.0 1.0 120 KB

Перевод руководства Мартина Зинкевича "Правила машинного обучения"

License: MIT License

guideline machine-learning mirror translation

google-rules-of-machine-learning-rus's People

Contributors

Stargazers

Watchers

Forkers

musicdendy

google-rules-of-machine-learning-rus's Issues

Правило 43

Rule 43 - Your friends tend to be the same across different products. Your interests tend not to be.

Как перевести?
Проверить перевод всего правила -верно ли смысл передал

Правило 35

These approaches are all ways to favor data that your model has already seen.
Как перевести?

Thus, any good feature will be better than a feature that is “unknown”.
Смысле не ясен. Как перевести?

Перевод правила 20

The two most standard approaches are “discretizations” and “crosses”.
как перевести crosses?

Перевод правила 22

At the same time, some features may punch above their weight.
литературно нужно перевести.

Поправить перевод самого правила 17

Перевести более понятно фразы:

reported features
learned features.

Перевести правило 33

Since there might be daily effects, you might not predict the average click rate or conversion rate, but the area under the curve, which represents the likelihood of giving the positive example a score higher than a negative example, should be reasonably close.

Правило 30

Как корректно перевести "Importance weight"?

Правило 40

To keep things simple, each model should either be an ensemble only taking the input of other models, or a base model taking many features, but not both

Перевод: Чтобы сохранить простоту, каждая модель должна быть либо ансамблем, принимая на вход результаты других модели, либо базовой моделью, использующей множество признаков, но не оба варианта сразу.

проверить корректность перевода

You also want to enforce properties on these ensemble models.

Перевести корректно

Перевод правила 23

Consider the cost of 9 engineers sitting in a one hour meeting, and think of how many contracted human labels that buys on a crowdsourcing platform.

как перевести contracted human labels ?

Правило 27

turn their gripes into solid numbers
как перевести gripes

Перевод правила 19

Identifiers of documents being retrieved and canonicalized queries do not provide much generalization, but align your ranking with your labels on head queries.

Что такое "canonicalized queries"?
Что такое head queries?

Правило 36

Having the model be the sum of a function of the positional features and a function of the rest of the features is ideal.
не ясен смысл

Правило 25

If you predict the probability that a document is spam and then have a cutoff on what is blocked, then the precision of what is allowed through matters

поработать над переводом