Hawk and Griffin: Mastering Long-Context Extrapolation in AI Post date January 14, 2025 Post author By Gating Post categories In ai-extrapolation, deep-learning, efficient-ai, griffin-model, hawk-model, language-models, long-context-ai, token-prediction
Griffin Model: Advancing Copying and Retrieval in AI Tasks Post date January 14, 2025 Post author By Gating Post categories In ai-extrapolation, copying-tasks, deep-learning, efficient-ai, griffin-model, language-models, retrieval-tasks, transformers
Hawk and Griffin Models: Superior Latency and Throughput in AI Inference Post date January 14, 2025 Post author By Gating Post categories In ai-inference, deep-learning, efficient-ai, griffin-model, hawk-model, high-throughput, low-latency, transformers
Efficient Training: Scaling Griffin Models for Large-Scale AI on TPUs Post date January 14, 2025 Post author By Gating Post categories In ai-model-scaling, ai-research, deep-learning, efficient-training, griffin-model, model-parallelism, scalable-ai, tpu-optimization
Hawk and Griffin Models: Superior NLP Performance with Minimal Training Data Post date January 13, 2025 Post author By Gating Post categories In ai-research, deep-learning, efficient-ai, griffin-model, hawk-model, llama-v2, nlp-performace, rnn-models
Griffin Models: Outperforming Transformers with Scalable AI Innovation Post date January 13, 2025 Post author By Gating Post categories In ai-research, chinchilla-scaling, deep-learning, efficient-ai, griffin-model, rnn-models, scalable-ai, transformers
Recurrent Models Scale as Efficiently as Transformers Post date January 13, 2025 Post author By Gating Post categories In deep-learning, efficient-ai, griffin-model, hybrid-ai, nlp-scaling, rnn-models, sequence-processing, transformers