RITCH - R Parser for the ITCH-Protocol
Allows to efficiently parse, filter, and write binary ITCH Files (Version 5.0) containing detailed financial transactions as distributed by NASDAQ to an R data.table.
Last updated 7 months ago
cpp
4.32 score 18 stars 23 scripts 303 downloadsrtiktoken - A Byte-Pair-Encoding (BPE) Tokenizer for OpenAI's Large Language Models
A thin wrapper around the tiktoken-rs crate, allowing to encode text into Byte-Pair-Encoding (BPE) tokens and decode tokens back to text. This is useful to understand how Large Language Models (LLMs) perceive text.
Last updated 4 months ago
rustcargo
4.22 score 11 stars 3 scripts 547 downloadsdataverifyr - A Lightweight, Flexible, and Fast Data Validation Package that Can Handle All Sizes of Data
Allows you to define rules which can be used to verify a given dataset. The package acts as a thin wrapper around more powerful data packages such as 'dplyr', 'data.table', 'arrow', and 'DBI' ('SQL'), which do the heavy lifting.
Last updated 1 years ago
verification
4.13 score 27 stars 7 scripts 325 downloadsrbm25 - A Light Wrapper Around the 'BM25' 'Rust' Crate for Okapi BM25 Text Search
BM25 is a ranking function used by search engines to rank matching documents according to their relevance to a user's search query. This package provides a light wrapper around the 'BM25' 'rust' crate for Okapi BM25 text search. For more information, see Robertson et al. (1994) <https://trec.nist.gov/pubs/trec3/t3_proceedings.html>.
Last updated 2 months ago
rustcargo
3.90 score 4 stars 210 downloads