Independent Science + Technology

Category: in-context-learning

Training and Testing Data Formats for AnLLM Models

Post date October 11, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Anchor-based Large Language Models: More Experimental Results

Post date October 11, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Practical LLMs for Real-World Applications

Post date October 11, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Anchor-based Large Language Models: Analysis

Post date October 11, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Benchmarking AnLLMs: Insights from OpenBookQA to BoolQ

Post date October 10, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Pre-Training AnLLMs: Leveraging RedPajama Data for Enhanced Performance

Post date October 10, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Anchor-based Large Language Models: Experiments and Implementation

Post date October 10, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Improving Real-Time Inference with Anchor Tokens

Post date October 10, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

The Role of Anchor Tokens in Self-Attention Networks

Post date October 10, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Unlocking the Mechanics of Decoder-Only Transformers and Self-Attention

Post date October 10, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

How Anchor Tokens Transform Sequence Information Compression in LLMs

Post date October 10, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, decoder-only-architecture, gpu-memory-optimization, in-context-learning, natural-language-modeling, transformer-architecture

Anchor-based Large Language Models

Post date October 10, 2024
Post author By Anchoring
Post categories In anchor-based-llms, anchor-self-attention-network, anllms, gpu-memory-optimization, hackernoon-top-story, in-context-learning, natural-language-modeling, transformer-architecture

Where does In-context Translation Happen in Large Language Models: Where does In-context MT happen?

Post date August 30, 2024
Post author By Computational Technology for All
Post categories In context-masking-experiments, fine-tuning-llms, gpt-models, in-context-learning, large-language-models, machine-translation, supervised-neural-mt-models, translation-models

Where does In-context Translation Happen in Large Language Models: Conclusion

Post date August 30, 2024
Post author By Computational Technology for All
Post categories In context-masking-experiments, fine-tuning-llms, gpt-models, in-context-learning, large-language-models, machine-translation, supervised-neural-mt-models, translation-models

Where does In-context Translation Happen in Large Language Models: Further Analysis

Post date August 30, 2024
Post author By Computational Technology for All
Post categories In context-masking-experiments, fine-tuning-llms, gpt-models, in-context-learning, large-language-models, machine-translation, supervised-neural-mt-models, translation-models

Where does In-context Translation Happen in Large Language Models: Inference Efficiency

Post date August 30, 2024
Post author By Computational Technology for All
Post categories In context-masking-experiments, fine-tuning-llms, gpt-models, in-context-learning, large-language-models, machine-translation, supervised-neural-mt-models, translation-models

Where does In-context Translation Happen in Large Language Models: Abstract and Background

Post date August 30, 2024
Post author By Computational Technology for All
Post categories In context-masking-experiments, fine-tuning-llms, gpt-models, in-context-learning, large-language-models, machine-translation, supervised-neural-mt-models, translation-models

Where does In-context Translation Happen in Large Language Models: Characterising Redundancy in Laye

Post date August 30, 2024
Post author By Computational Technology for All
Post categories In context-masking-experiments, fine-tuning-llms, gpt-models, in-context-learning, large-language-models, machine-translation, supervised-neural-mt-models, translation-models

Where does In-context Translation Happen in Large Language Models: Data and Settings

Post date August 30, 2024
Post author By Computational Technology for All
Post categories In context-masking-experiments, fine-tuning-llms, gpt-models, in-context-learning, large-language-models, machine-translation, supervised-neural-mt-models, translation-models

Where does In-context Translation Happen in Large Language Models: Appendix

Post date August 30, 2024
Post author By Computational Technology for All
Post categories In context-masking-experiments, fine-tuning-llms, gpt-models, in-context-learning, large-language-models, machine-translation, supervised-neural-mt-models, translation-models

How To Create A Video Summarizer Powered By AI, In 20 Minutes

Post date May 8, 2023
Post author By Yeyu Huang
Post categories In ai, chatgpt, in-context-learning, llamaindex, openai

Nothing left to load.