Git Product home page Git Product logo

mask-attention-free-transformer's People

Contributors

x-lai avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

mask-attention-free-transformer's Issues

Question about mask in cross attention component

Hi @X-Lai , Thanks for sharing this great work!

What is the purpose of attn_masks in your transformer decoder? In your paper, you mentioned that mask-attention-free-transformer.

# get mask
pred_masks = torch.einsum('nbd,mbd->bnm', output_norm, mask_feats_batched) #[bsz, num_queries, max_length]
attn_masks = (pred_masks.sigmoid() < self.attn_mask_thresh).bool() #[bsz, tgt_len, src_len]
for b in range(lengths.shape[0]):
length = lengths[b]
attn_masks[b, (attn_masks[b, :, :length].sum(-1) == length)] = False
attn_masks[b, :, length:] = True
attn_masks = attn_masks.unsqueeze(1).expand(-1, self.nhead, -1, -1).contiguous().flatten(0,1)

Thank you.

What is the license for the code?

Hi,

I'm interested in potentially building on some of your code. Can you please clarify what usage license this repository is released under?

Question about the pretrained weights in training

I observed that you utilized the pretrained weights from SSTNet during the training process. However, the SSTNet model's original input does not encompass normal information. In the context of incorporating normal information into the model, the question arises: which pretrained weights should be employed post the integration of normal information?

Reproduction of the experimental results in Fig. 1

Hi, how can I reproduce the experimental results shown in Fig.1? I tried modifying the epoch parameter in the configs directly, but the results were significantly different. Can you assist me with this issue?

Questions about the training loss and backbone configuration

In loss.py, I think a score loss is also used to train the model but it is not mentioned in the paper.
Can you provide some insight into this? I might be overlooking something...

Additionally, is there a specific reason for using the Minkowski engine as a backbone in the S3DIS dataset?"

How to generate superpoints for S3DIS

Dear authors,

Thanks for your job. My question is that are there any methods or repos can be referred to generate superpoints for S3DIS. And is there any plan to release the training code for S3DIS?

Best

The normal information in training

Hi, XinLai, thank you for you great work.

  1. I found that the code supports the sstnet pretrain weight, if I want to train with the normal information, could the pretrain backbone be released, or the backbone is trained the same as sstnet, just need to change the input channel into 9?
  2. I'd like to know more about the S3DIS training too.
    Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.