You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jul 26, 2019. It is now read-only.
I used the following notebook to fine tune it using BERT-keras.
(As described in the paper, I just added a classification layer using the CLS token of the output of BERT).
However, there is great differences in performance and results between this notebook and the script used in the official version for fine tuning :
BERT-keras
Official BERT
Pearson
0.0254
0.8956
Spearman
0.0289
0.7942
MSE
2.2691
0.5456
Training time
9h
10min
Note : Pearson / Spearman and correlation metrics used to evaluate the accuracy on the STS-B dataset
Why there is such a difference between the 2 approach ?