Textbooks Are All You Need: Examples for Section 5

In this study, researchers from Microsoft introduce phi-1, a new large language model for code, with significantly smaller size than competing models.


This content originally appeared on HackerNoon and was authored by Knapsack

:::info Authors:

(1) Suriya Gunasekar, Microsoft Research;

(2) Yi Zhang, Microsoft Research;

(3) Jyoti Aneja, Microsoft Research;

(4) Caio C´esar Teodoro Mendes, Microsoft Research;

(5) Allie Del Giorno, Microsoft Research;

(6) Sivakanth Gopi, Microsoft Research;

(7) Mojan Javaheripi, Microsoft Research;

(8) Piero Kauffmann, Microsoft Research;

(9) Gustavo de Rosa, Microsoft Research;

(10) Olli Saarikivi, Microsoft Research;

(11) Adil Salim, Microsoft Research;

(12) Shital Shah, Microsoft Research;

(13) Harkirat Singh Behl, Microsoft Research;

(14) Xin Wang, Microsoft Research;

(15) S´ebastien Bubeck, Microsoft Research;

(16) Ronen Eldan, Microsoft Research;

(17) Adam Tauman Kalai, Microsoft Research;

(18) Yin Tat Lee, Microsoft Research;

(19) Yuanzhi Li, Microsoft Research.

:::

C Examples for Section 5

In this section, we provide example pairs of codes captured with different AST match rates. Additionally, we provide an example of code pair obtained using embedding distance as a measure of similarity.

\ AST match rate = 1.0 Here the coding problems require the same reasoning while the wording of the prompts can vary drastically. Particularly, the prompt uses a real-world event, i.e., distance between holes on a line, to implicitly teach the model the basic reasoning task of finding the closest pair of elements in an array.

\

\ AST match rate = 0.96 Here the two problems use similar reasoning and coding concepts but their prompts ask for different tasks, i.e., returning a pair of numbers versus computing their average.

\

\ AST match rate ≤ 0.9 When the AST match rate ≤ 0.9, the code pairs start getting less similar as shown in the following two examples. Here, the AST match rate is 0.9 and 0.83, respectively.

\

\ Embedding Distance = 0.16 Here the two problems have similar Python Docstrings, function names, as well as the code structure which can be extracted with using the L2 distance between the normalized CodeGen-Mono 350M embedding for each of them.

\

\

:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

\


This content originally appeared on HackerNoon and was authored by Knapsack


Print Share Comment Cite Upload Translate Updates
APA

Knapsack | Sciencx (2024-09-12T13:00:19+00:00) Textbooks Are All You Need: Examples for Section 5. Retrieved from https://www.scien.cx/2024/09/12/textbooks-are-all-you-need-examples-for-section-5/

MLA
" » Textbooks Are All You Need: Examples for Section 5." Knapsack | Sciencx - Thursday September 12, 2024, https://www.scien.cx/2024/09/12/textbooks-are-all-you-need-examples-for-section-5/
HARVARD
Knapsack | Sciencx Thursday September 12, 2024 » Textbooks Are All You Need: Examples for Section 5., viewed ,<https://www.scien.cx/2024/09/12/textbooks-are-all-you-need-examples-for-section-5/>
VANCOUVER
Knapsack | Sciencx - » Textbooks Are All You Need: Examples for Section 5. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2024/09/12/textbooks-are-all-you-need-examples-for-section-5/
CHICAGO
" » Textbooks Are All You Need: Examples for Section 5." Knapsack | Sciencx - Accessed . https://www.scien.cx/2024/09/12/textbooks-are-all-you-need-examples-for-section-5/
IEEE
" » Textbooks Are All You Need: Examples for Section 5." Knapsack | Sciencx [Online]. Available: https://www.scien.cx/2024/09/12/textbooks-are-all-you-need-examples-for-section-5/. [Accessed: ]
rf:citation
» Textbooks Are All You Need: Examples for Section 5 | Knapsack | Sciencx | https://www.scien.cx/2024/09/12/textbooks-are-all-you-need-examples-for-section-5/ |

Please log in to upload a file.




There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.