tokenizers â ð¥ Fast State-of-the-Art Tokenizers optimized for Research and Production
å®è£ é£æåºŠ
Easy
æšè«ã»åŠç¿ã³ã¹ã
Medium
æ³å®çšé
ããŒã¯ãã€ã¶ãŒã®é«éå
æŠèŠ
ããŒã¯ãã€ã¶ãŒã®é«éåãç®çãšããŠããã©ã€ãã©ãªã
äœãæ°ããã
ããŒã¯ãã€ã¶ãŒã®é«éåãç®çãšããŠããã©ã€ãã©ãªã
äœã«äœ¿ããã
ããŒã¯ãã€ã¶ãŒã®é«éå
å®è£ æ å ±
- GitHub URL
- ãã
å®è£ ãã§ãã¯ãªã¹ã
å®è£ ãŸãã¯é åžããŒãž
OKã³ãŒããŸãã¯ã¢ãã«é åžããŒãžããæ€èšŒãå§ããããŸãã
äžæ¬¡æ å ±ãªã³ã¯
OKGitHub
æ€èšŒãããã
OKå®è£ ãŸãã¯ã¢ãã«é åžããŒãžãã詊ããå¯èœæ§ãé«ãã§ãã
èšç®è³æº
æªååŸæšè«äžå¿ãªã軜ãã§ãããååŠç¿æã¯GPUãå¿ èŠã«ãªãå¯èœæ§ããããŸãã
ã©ã€ã»ã³ã¹
æªååŸé åžå ã®LICENSEãã¢ãã«ã«ãŒããPaperã®å©çšæ¡ä»¶ã確èªããŠãã ããã
åçšå©çš
æªååŸç ç©¶å©çšéå®ãããŒã¿ã»ããç±æ¥å¶éãAPIèŠçŽã®æç¡ã確èªããŠãã ããã
èªç€ŸããŒã¿ã§è©Šããªã
è£œé æ¥ã»ææéçºã®Excel/CSVããŒã¿ã«èœãšã蟌ãããã®æåã®æé ã§ãã
- 1ãŸãèªç€ŸããŒã¿ããå ¥åæ¡ä»¶ãç®ç倿°ãè©äŸ¡ãããææšã«åããŠæŽçããŸãã
- 2LightGBMãRandom Forestãªã©ã®ããŒã¹ã©ã€ã³ãå ã«äœãããã®ææ³ãšæ¯èŒããŸãã
- 3è©äŸ¡ææšã¯R2/RMSEãAUCãç°åžžæ€ç¥ã®åçŸçãå®éšåæ°åæžçãªã©ãçŸå Žã®æææ±ºå®ã«è¿ããã®ãéžã³ãŸãã
- 4SHAPãç¹åŸŽééèŠåºŠã§ãå¹ããŠããå åãç©çã»ååŠã»å·¥çšç¥èãšççŸããªãã確èªããŸãã
å®è£ é£æåºŠ
Easy - å®è£ ãŸãã¯ã¢ãã«é åžããŒãžãã詊ããå¯èœæ§ãé«ãã§ãã
å¿ èŠãªãœãŒã¹
- GPUç®å®: Medium
- ããŒã¿ã»ãã: è«æã»ãªããžããªåŽã®æå®ã確èªããŠãã ããã
- åŠç¿èŠåŠ: æšè«ã ãã§è©Šããå¯èœæ§ããããŸãã
- æšè«äžå¿ãªã軜ãã§ãããååŠç¿æã¯GPUãå¿ èŠã«ãªãå¯èœæ§ããããŸãã
å®åã§äœ¿ãå Žåã®æ³šæç¹
- ã©ã€ã»ã³ã¹ãšåçšå©çšæ¡ä»¶ã¯ãPaper / GitHub / Hugging Face ã®é åžå ã§ç¢ºèªããŠãã ããã
- 粟床ãåçŸæ§ãèšç®ã³ã¹ãã¯ããŒã¿ã»ãããè©äŸ¡æ¡ä»¶ã«äŸåããŸãã
- å人æ å ±ãæ©å¯ããŒã¿ãæ±ãå Žåã¯ãå ¥åããŒã¿ã®ä¿åå ãšå€éšAPIå©çšæ¡ä»¶ã確èªããŠãã ããã
é¢é£èšäº
vllm â A high-throughput and memory-efficient inference and serving engine for LLMs
ãã®ãªããžããªã§ã¯ãç§çãªAIãã©ãããã©ãŒã ã§ããDocGPTãæäŸããŠããŸãã
transformers â ð€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
ð€ Transformersã¯ãããã¹ãã»ããžã§ã³ã»é³å£°ãªã©è€éãªã¢ãã«å®çŸ©ããµããŒããããã¬ãŒã ã¯ãŒã¯ã§ãã€ã³ãã§ã¬ã³ã¹ã¿ãŒããã¬ãŒãã³ã°ã«äœ¿çšã§ããã
system_prompts_leaks â Extracted system prompts from Anthropic - Claude Fable 5, Opus 4.8, Claude Code, Claude Design. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google - Gemini 3.5 Flash, 3.1 Pro, Antigravity. xAI - Grok, Cursor, Copilot, VS Code, Perplexity, and more. Updated regularly.
æ¬è«æã¯ãèšèªã¢ãã«ã®æé©åã«äœ¿çšããã Hyperparameter Transfer ãéåãããã¬ãŒã ã¯ãŒã¯ãéçºããŸãããã®ãã¬ãŒã ã¯ãŒããŒã¯ã¯ã3 ã€ã®ã¡ããªãã¯ã¹ã䜿çšãããã®ãã¡ã® 1 ã€ã¯ãhyperpa
openvino â OpenVINO⢠is an open source toolkit for optimizing and deploying AI inference
ãªãŒãã³ãœãŒã¹ã®AIæšè«æé©åãšå±éçšããŒã«ãããã§ãã