Splink 4: <div class="highlight highlight-source-python notranslate position-relat

I'd possibly set the default value to 1e6</code

`linker.estimate_u_using_random_sampling` fails with default arguments, with no clear indication why about splink HOT 3 CLOSED

ADBond commented on June 23, 2024

`linker.estimate_u_using_random_sampling` fails with default arguments, with no clear indication why

from splink.

Comments (3)

ADBond commented on June 23, 2024 1

I'd possibly set the default value to 1e6 which will be quick to compute and will produce 'reasonably decent' estimates. And make sure we try to ensure in examples/tutorials we always explicitly set it to sensible values

My only concern with that is that we don't really have a mechanism for flagging that estimates may be unreliable - I'd worry some users might not look at the docs and just use the default, and then get (somewhat) unreliable estimates. I guess we could emit a warning if it's left at this value that they may want to increase the number for more serious estimation?

from splink.

RobinL commented on June 23, 2024 1

Agreed - i think a warning if it's left as the default should suffice

from splink.

RobinL commented on June 23, 2024

Tricky, especially since it's unlikely the user will have an intuition for the ''correct'/'best' parameter (it wouldn't be very good if they set it to like, 100 or something!)

I'd possibly set the default value to 1e6 which will be quick to compute and will produce 'reasonably decent' estimates. And make sure we try to ensure in examples/tutorials we always explicitly set it to sensible values

from splink.

`linker.estimate_u_using_random_sampling` fails with default arguments, with no clear indication why about splink HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent