hazyresearch / hyphc Goto Github PK

Hyperbolic Hierarchical Clustering.

Python 85.44% Shell 1.13% Cython 13.43%

hyphc's Issues

decoding trees?

The paper says that to decode the binary tree from the embedding, a top-down greedy approach is used.
However from the code, it seems that still a bottom up approach is used based on the angle similarity matrix, is that correct?

Say if I want to obtain a hierarchical clustering on the nodes given the final Poincare embeddings, is doing a single linkage agglomerative clustering using the angle similarity matrix between normalized embedding points equivalent to the decoding tree algorithm implemented in this code?

How to compute hyperbolic embeddings for leafs of an existing tree

Hi!

Before doing more flexible learning, I'd like to learn hyperbolic embeddings for leaves of an existing tree (a tree from image region merging agglomerative clustering procedure). Such embeddings that when applying the tree decoding algorithm from HypHC, it would give back my original agglomerative clustering tree .

Can I do it within the framework / losses of HypHC?

I thought of using the tree shortest path for the similarity matrix.

The tricky part is that the similarity matrix obtained this way is extremely sparse, so randomly sampled triplets almost always have 0 similarities, so the model learns nothing.

Would you have any advices?

Thank you :)

Other examples of parameter configuration

Hi,

Could you please share examples of your parameter configurations on the mid- and large-scale datasets? For example, one on Segmentation and one on CIFAR-100.

Thank you!

Changing the curvature

Hi,

Thanks for the great work and well documented code.

Currently, it seems like the hyperbolic embedding lives in a Poincare disk/ball with curvature -1. In my own application, I would like to work with arbitrary curvature -c, c>0.

So my question is can I do so by just modifying utils/poincare.py? Or I also need to make changes in optim and elsewhere?

Thanks in advance for your valuable time.

Best,
Eli

Bug? Always zero gradient for model.scale

Hi @ines-chami!

At https://github.com/HazyResearch/HypHC/blob/master/model/hyphc.py#L42 :

init_size=1e-3        # in config.py also "init_size": 1e-3
max_scale=1. - 1e-3   # in config.py also "max_scale": 1 - 1e-3
self.scale = nn.Parameter(torch.Tensor([init_size]), requires_grad=True)

min_scale = 1e-2 #self.init_size
max_scale = self.max_scale
return F.normalize(embeddings, p=2, dim=1) * self.scale.clamp_min(min_scale).clamp_max(max_scale)

So self.scale (initialized always to init_size = 1e-3) is always outside the clamp range (min_scale = 1e-2 and max_scale = 1 - 1e-3), and so always receives zero gradient.

Is it expected / by design or was it some debug setting min_scale = 1e-2 which by mistake was not removed?

How can I visualize the hyperbolic embedding

Hi, all,

After I run the scripts examples/run_zoo.sh and example/run_glass.sh, the trained model and log files are saved at ./embeddings as follows:

root@Lab-PC:/data/code14/HypHC/embeddings# tree .
.
├── glass
│   └── bc24ee553d3178d956bbb7a68d6058f5a48edb408c82a49e613a344d71602abf
│       ├── config.json
│       ├── model_0.pkl
│       └── train_0.log
└── zoo
    └── ca286289368cbe66abdfe32406cffed3ff5e6102252c0fc6502519713db04b83
        ├── config.json
        ├── model_0.pkl
        └── train_0.log

How can I visualize the hyperbolic embedding similar to demo file HypHC.gif?

Thanks~

Any plan for a sklearn-like API?

Thank you for sharing your fantastic work! I am just wondering do you have any plan to develop a sklearn-like interface.

For example:

alg = HypHC(*args)
alg.fit(X)
label = alg.predict(X)

Thank you!

hazyresearch / hyphc Goto Github PK

hyphc's Issues

decoding trees?

How to compute hyperbolic embeddings for leafs of an existing tree

Other examples of parameter configuration

Changing the curvature

Bug? Always zero gradient for model.scale

How can I visualize the hyperbolic embedding

Any plan for a sklearn-like API?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent