prithivida/bert-for-patents-64d简介
发布时间:2026-05-17 01:00:09
文章来源:www.cxwl.com
访问次数:5
Motivation
This model is based on anferico/bert-for-patents – a BERTLARGE model (See next section for details below). By default, the pre-trained model’s output embeddings with size 768 (base-models) or with size 1024 (large-models). However, when you store Millions of embeddings, this can require quite a lot of memory/storage. So have reduced the embedding dimension to 64 i.e 1/16th of 1024 using Principle Component Analysis (PCA) and it still gives a comparable performance. Yes! PCA gives better performance than NMF. Note: This process neither improves the runtime, nor the memory requirement for running the model. It only reduces the needed space to store embeddings, for example, for semantic search using vector databases.
BERT for Patents
BERT for Patents is a model trained by Google on 100M+ patents (not just US patents).
If you want to learn more about the model, check out the blog post, white paper and GitHub page containing the original TensorFlow checkpoint.

Projects using this model (or variants of it):
- Patents4IPPC (carried out by Pi School and commissioned by the Joint Research Centre (JRC) of the European Commission)
标签:漫画下载,pdf漫画下载,跨境电商,媒体,独立站,百度文库,站联,影音网站,PanDownload,其它网站
关于文章《prithivida/bert-for-patents-64d简介》特别声明
《prithivida/bert-for-patents-64d简介》更新日期为:2026-05-17 01:00:09;目前浏览的小伙伴达到5,初夏导航所有作品(图文、音视频)均由用户自行上传分享,仅供网友学习交流。若您的权利被侵害,请联系

