Wikipedia talk:Size of Wikipedia

Latest comment: 2 months ago by Blackballnz in topic Is this number correct?

Embeddings Size

edit

Would be nice to get a summary on vector embeddings with all-MiniLM-L6-v2, as well as discussion of potential tradeoffs regarding partial decompression, and compression algorithms... Wesxdz (talk) 02:33, 4 October 2023 (UTC)Reply

It's about 120GB, roughly the same size as Wikipedia text currently.
https://huggingface.co/datasets/Cohere/wikipedia-22-12-en-embeddings?ref=txt.cohere.com Wesxdz (talk) 23:34, 16 October 2023 (UTC)Reply
Thank you so much. Johnny Au (talk/contributions) 02:13, 1 August 2024 (UTC)Reply

Is this number correct?

edit

Hi, am I missing something here?

Wikipedia continues to grow, and the number of articles on Wikipedia is increasing by about 140,000 a month (as of January 2024). The number of articles added to Wikipedia every month reached its peak in 2006, at over 50,000 new articles a month.

Is this a typo & 140,000 should be 14,000? Blackballnz (talk) 02:23, 21 September 2024 (UTC)Reply