You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Why is the initial control point queries the sum of shared control point query embedding and coarse bounding box coordinate embedding, while the initial character queries are the sum of shared character query embedding and 1D positional encoding? What I mean is, if coarse bounding box coordinate embedding is useful, why not add it to initial character queries? The same confusion also exists for 1D positional encoding. If the coarse bounding box coordinate embedding and 1D positional encoding are removed at the same time, will the performance of the model decrease? Looking forward to your reply, thanks!
The text was updated successfully, but these errors were encountered:
Why is the initial control point queries the sum of shared control point query embedding and coarse bounding box coordinate embedding, while the initial character queries are the sum of shared character query embedding and 1D positional encoding? What I mean is, if coarse bounding box coordinate embedding is useful, why not add it to initial character queries? The same confusion also exists for 1D positional encoding. If the coarse bounding box coordinate embedding and 1D positional encoding are removed at the same time, will the performance of the model decrease? Looking forward to your reply, thanks!
The text was updated successfully, but these errors were encountered: