I have created a toy example as follows: <div class="snippet-clipboard-content not

Thanks <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-u

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

In <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id=

Unexpected Results with Toy Example about pykeen HOT 6 CLOSED

pykeen commented on May 29, 2024

Unexpected Results with Toy Example

from pykeen.

Comments (6)

HeikoPaulheim commented on May 29, 2024 3

Thanks @mberr! With these changes, I also got it to converge :-)

from pykeen.

mberr commented on May 29, 2024

Hi @HeikoPaulheim,

first of all thank you for your interest in knowledge graph embeddings and our library in particular!

The result looks indeed unexpected, but I can see a few possible reasons for it:

The model is not converged. To check this, could you please take a look at the losses history? You can do so e.g. by

from matplotlib import pyplot as plt
plt.plot(result.losses)

# Or, if you want to use our builtin plot method for this
result.plot_losses()

*addition from @cthoyt, there's actually a convenience function for this already :) I added it in the above code block under the Or, ....

Does the loss decrease? If not, you may need to tune some parameters, e.g. the optimizer's learning rate, or the number of epochs you train the model. You can do so by passing custom optimizer_kwargs and training_kwargs to the pipeline, e.g.

result = pipeline(
    training_triples_factory=tf,
    testing_triples_factory=tf,
    model='TransE',
    model_kwargs=dict(embedding_dim=2),
    random_seed=1,
    device='cpu',
    optimizer_kwargs=dict(lr=1.0e-01),
    training_kwargs=dict(num_epochs=1024),
)

As a general rule of thumb, increase the number of epochs until you observe some form of convergence. Start with a large learning rate. It you observe divergence, decrease the learning rate. Please notice that there is some stochasticity in the training, since we sample negative examples for positive ones. Thus, the loss may fluctuate naturally. To better see the trend, you can smooth the loss by averaging over a window of epochs.

We use a margin-based loss with TransE by default. Thus, it suffices if the model predicts scores such that the scores of positive triples and negative triples are at least one margin apart. Once the model has reached this state, if will not improve further upon these examples, as the embeddings are "good enough". Hence, an optimal solution with margin-based loss might not look like the exact geometric solution. If you want to change that you can switch to a loss function which does not use a margin, e.g. the softplus loss. You can do this by passing loss="softplus" to the pipeline, e.g.

result = pipeline(
    training_triples_factory=tf,
    testing_triples_factory=tf,
    model='TransE',
    model_kwargs=dict(embedding_dim=2),
    random_seed=1,
    device='cpu',
    loss='softplus',
)

When applying both changes, I obtain a result which is closer to what you would expect.

from pykeen.pipeline import pipeline

result = pipeline(
    training_triples_factory=tf,
    testing_triples_factory=tf,
    model='TransE',
    model_kwargs=dict(embedding_dim=2),
    random_seed=1,
    device='cpu',
    loss='softplus',
    optimizer_kwargs=dict(lr=1.0e-01),
    training_kwargs=dict(num_epochs=128),
)

from pykeen.

cthoyt commented on May 29, 2024

@mberr will you add a PR with the code for generating that plot? I'm sure other people will find it useful!

from pykeen.

mberr commented on May 29, 2024

@mberr will you add a PR with the code for generating that plot? I'm sure other people will find it useful!

The loss plot is already in the repository, although I re-implemented it 🙈

As for the second plot, its usage seems to be restricted to this toy example, as long as we do not use some dimensionality reduction strategy to project the embedding space into two dimensions.

Moreover, the relation arrow heads somehow got lost in my plot compared to @HeikoPaulheim 's version.

Here is the code used to reproduce my second plot:

from matplotlib import pyplot as plt

# draw entities
e_emb = result.model.entity_embeddings.weight.detach().numpy()
e_id_to_label = {
    eid: el
    for el, eid in tf.entity_to_id.items()
}
for i, e in enumerate(e_emb):
    plt.scatter(*e, color='black')
    plt.annotate(e_id_to_label[i], e)

# draw relations
r_emb = result.model.relation_embeddings.weight.detach().numpy()
r_id_to_label = {
    rid: rl
    for rl, rid in tf.relation_to_id.items()
}
for i, r in enumerate(r_emb):
    plt.arrow(0, 0, *r)
    plt.annotate(r_id_to_label[i], r)

from pykeen.

mberr commented on May 29, 2024

As an additional information: The default value of num_epochs is 5, which might be more appropriate for larger datasets than for smaller ones.

pykeen/src/pykeen/pipeline.py

Line 789 in 852e25e

training_kwargs.setdefault('num_epochs', 5)

from pykeen.

cthoyt commented on May 29, 2024

In #99, I've built on @mberr's code as well as implemented the functionality for generating this plot for arbitrarily large entity and relation embeddings using a choice of dimensionality reduction techniques that come with sklearn (e.g., PCA, KPCA, Gaussian Process). It also comes with a user-facing function PipelineResult.plot() that does all the hard work and makes this:

There's a demo notebook here with the toy example: https://github.com/pykeen/pykeen/blob/add-entity_plot/notebooks/Pipeline%20Plots%20Demo.ipynb. ~~@mberr @HeikoPaulheim could you suggest tweaks to make it converge and get the desired results?~~ Update: I forgot to copy the loss='softplus', even after @mberr wrote up such a nice explanation of why this was important. Now it's looking much nicer as of 18abc44.

from pykeen.

Unexpected Results with Toy Example about pykeen HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent