記事にした動機
チームでcopilot chatの使用するにあたってソースが学習に使用されないかを確認する必要があったので公式ドキュメントでの記載を探したのですが,意外と見つからなかったので記事にしました。
お役に立てば幸いです。
個人用リポジトリの場合
allow or prevent GitHub using your data, select or deselect Allow GitHub to use my code snippets from the code editor for product improvements.
やりかた
https://github.com/settings/copilot
自分のCopilotの設定でAllow GitHub to use my code snippets from the code editor for product improvements *
のチェックを外します。
Copilot for businessを使用している場合
What personal data is used by GitHub Copilot for Business and how? Copilot for Business collects three kinds of personal data: user engagement data, prompts, and suggestions. User engagement data is information about events that are generated when iterating with a code editor. A prompt is a compilation of IDE code and relevant context (IDE comments and code in open files) that the GitHub Copilot extension sends to the AI model to generate suggestions. A suggestion is one or more lines of proposed code and other output returned to the GitHub Copilot extension after a prompt is received and processed by the GitHub Copilot model.
Copilot for Business uses the source code in your IDE only to generate a suggestion. It also performs several scans to identify and remove certain information within a prompt. Prompts are only transmitted to the AI model to generate suggestions in real-time and are deleted once the suggestions are generated. Copilot for Business also does not use your code to train the Azure OpenAI model. GitHub Copilot for Individual users, however, can opt in and explicitly provide consent for their code to be used as training data. User engagement data is used to improve the performance of the Copilot Service; specifically, it’s used to fine-tune ranking, sort algorithms, and craft prompts.
個人用リポジトリとの違い
- copilotに適用するポリシーをorganaizationsメンバーにまとめて適用できる
-
デフォルトでコードが学習に使用されない
ので特に設定の必要はない。