본문 바로가기

코딩, 유튜브,shorts

코드 라마(2023.11.10)

2023.11.10

 

Introducing Code Llama, a state-of-the-art large language model for coding (meta.com)

 

Takeaways


  • Code Llama is a state-of-the-art LLM capable of generating code, and natural language about code, from both code and natural language prompts.
  • Code Llama is free for research and commercial use.
  • Code Llama is built on top of Llama 2 and is available in three models:
    • Code Llama, the foundational code model;
    • Codel Llama - Python specialized for Python;
    • and Code Llama - Instruct, which is fine-tuned for understanding natural language instructions.
  • In our own benchmark testing, Code Llama outperformed state-of-the-art publicly available LLMs on code tasks

Code Llama는 코드와 자연어 프롬프트 모두에서 코드와 코드에 대한 자연어를 생성할 수 있는 최첨단 LLM입니다.
Code Llama는 연구 및 상업적 용도로 무료입니다.


Code Llama는 Llama 2를 기반으로 구축되었으며 세 가지 모델로 제공됩니다.
1)기본 코드 모델인 Code Llama
 2)Codel Llama - Python:      Python에 특화된 입니다.
3) Code Llama - Instruct는 자연어 지침을 이해하기 위해 미세 조정되었습니다.


자체 벤치마크 테스트에서 Code Llama는 코드 작업에 대해 공개적으로 사용 가능한 최첨단 LLM보다 성능이 

뛰어났습니다.

 

Today, we are releasing Code Llama, a large language model (LLM) that can use text prompts to generate code.

오늘 우리는 텍스트 프롬프트를 사용하여 코드를 생성할 수 있는 LLM(대형 언어 모델)인 Code Llama를 출시합니다.

 

Code Llama is state-of-the-art for publicly available LLMs on code tasks, and has the potential to make workflows faster and more efficient for current developers and lower the barrier to entry for people who are learning to code.

 

Code Llama has the potential to be used as a productivity and educational tool to help programmers write more robust, well-documented software.

 

Code Llama는 코드 작업에 대해 공개적으로 사용 가능한 LLM을 위한 최신 기술로, 현재 개발자의 워크플로를 더 빠르고 효율적으로 만들고 코드를 배우는 사람들의 진입 장벽을 낮출 수 있는 잠재력을 가지고 있습니다.

 

Code Llama는 프로그래머가 보다 강력하고 잘 문서화된 소프트웨어를 작성하는 데 도움이 되는 생산성 및 교육 도구로

사용될 수 있는 잠재력을 가지고 있습니다.

 

The generative AI space is evolving rapidly, and we believe an open approach to today’s AI is the best one for developing new AI tools that are innovative, safe, and responsible. We are releasing Code Llama under the same community license as Llama 2.

생성적 AI 공간은 빠르게 진화하고 있으며, 우리는 오늘날의 AI에 대한 개방형 접근 방식이 혁신적이고 안전하며 책임감 있는 새로운 AI 도구를 개발하는 데 가장 적합한 접근 방식이라고 믿습니다. 
우리는 Llama 2와 동일한 커뮤니티 라이센스에 따라 Code Llama를 출시하고 있습니다.

 

 

How Code Llama works 코드 라마 작동 방식

Code Llama is a code-specialized version of Llama 2 that was created by further training Llama 2 on its code-specific datasets, sampling more data from that same dataset for longer. Essentially, Code Llama features enhanced coding capabilities, built on top of Llama 2. It can generate code, and natural language about code, from both code and natural language prompts (e.g., “Write me a function that outputs the fibonacci sequence.”) It can also be used for code completion and debugging. It supports many of the most popular languages being used today, including Python, C++, Java, PHP, Typescript (Javascript), C#, and Bash.


Code Llama는 Llama 2의 코드 전문 버전으로, 코드별 데이터 세트에 대해 Llama 2를 추가로 훈련하고 동일한 데이터 세트에서 더 많은 데이터를 더 오랫동안 샘플링하여 생성되었습니다. 기본적으로 Code Llama는 Llama 2를 기반으로 구축된 향상된 코딩 기능을 제공합니다. 코드와 자연어 프롬프트 모두에서 코드와 코드에 대한 자연어를 생성할 수 있습니다(예: "피보나치 수열을 출력하는 함수를 작성해 주세요."). 코드 완성 및 디버깅에도 사용할 수 있습니다. Python, C++, Java, PHP, Typescript(Javascript), C# 및 Bash를 포함하여 오늘날 사용되는 가장 널리 사용되는 많은 언어를 지원합니다.

 

We are releasing three sizes of Code Llama with 7B, 13B, and 34B parameters respectively. Each of these models is trained with 500B tokens of code and code-related data. The 7B and 13B base and instruct models have also been trained with fill-in-the-middle (FIM) capability, allowing them to insert code into existing code, meaning they can support tasks like code completion right out of the box.

 

 

The three models address different serving and latency requirements. The 7B model, for example, can be served on a single GPU. The 34B model returns the best results and allows for better coding assistance, but the smaller 7B and 13B models are faster and more suitable for tasks that require low latency, like real-time code completion.

 

 

 

1. License Rights and Redistribution.

a. Grant of Rights. You are granted a non-exclusive, worldwide, non-transferable and royalty-free limited license under Meta’s intellectual property or other rights owned by Meta embodied in the Llama Materials to use, reproduce, distribute, copy, create derivative works of, and make modifications to the Llama Materials.

b. Redistribution and Use.

i. If you distribute or make the Llama Materials, or any derivative works thereof, available to a third party, you shall provide a copy of this Agreement to such third party.

ii. If you receive Llama Materials, or any derivative works thereof, from a Licensee as part of an integrated end user product, then Section 2 of this Agreement will not apply to you.

iii. You must retain in all copies of the Llama Materials that you distribute the following attribution notice within a “Notice” text file distributed as a part of such copies: “Llama 2 is licensed under the LLAMA 2 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved.”

 

 

1. 라이센스 권리 및 재배포.

ㅏ. 권리 부여. 귀하는 Llama 자료를 사용, 재생산, 배포, 복사하고 파생물을 생성하고 제작할 수 있도록 Meta의 지적 재산권 또는 Meta가 소유한 Llama 자료에 포함된 기타 권리에 따라 비독점적이고 전 세계적이며 양도 불가능하고 로열티가 없는 제한된 라이센스를 부여받습니다. 라마 재료 수정.

비. 재배포 및 사용.

나. 귀하가 Llama 자료 또는 그 파생물을 제3자에게 배포하거나 제공하는 경우, 귀하는 해당 제3자에게 본 계약의 사본을 제공해야 합니다.

ii. 통합 최종 사용자 제품의 일부로 라이센스 사용자로부터 Llama 자료 또는 그 파생물을 받는 경우 본 계약의 섹션 2가 귀하에게 적용되지 않습니다.

iii. 귀하는 배포하는 Llama 자료의 모든 사본에 해당 사본의 일부로 배포되는 "공지" 텍스트 파일 내에 다음 저작자 표시를 유지해야 합니다. "Llama 2는 LLAMA 2 커뮤니티 라이센스에 따라 라이센스가 부여됩니다. Copyright © Meta Platforms, Inc . 판권 소유."

'코딩, 유튜브,shorts' 카테고리의 다른 글

TTS(Text-to-Speech)/무료 폰트  (0) 2024.02.18
구글의 gemini(제미나이)(2023.12.14)  (0) 2023.12.14
netplwiz  (0) 2023.10.06
원드라이브 오류(2023.10.04)  (0) 2023.10.04
bigcode  (0) 2023.05.12