Meta LayerSkip Llama3.2 1B: Achieving Fast LLM Inference with Self-Speculative Decoding locally Post date October 31, 2024 Post author By Md Monsur ali Post categories In early-exit, layerskip, llm, meta, self-speculative