GitHub Reverses Course: Will Train AI Models on User Data Starting April 24

Available in: 中文
2026-03-29T19:55:15.604Z·1 min read
GitHub will begin using customer interaction data — including inputs, outputs, code snippets, and context — to train its AI models starting April 24, marking a dramatic reversal of its previous pri...

GitHub will begin using customer interaction data — including inputs, outputs, code snippets, and context — to train its AI models starting April 24, marking a dramatic reversal of its previous privacy commitments.

The Policy Change

Who's Affected

What Data GitHub Collects

The Privacy Implications

Private Repositories Are No Longer Truly Private

The policy FAQ explicitly states: "If a Copilot user has their settings set to enable model training on their interaction data, code snippets from private repositories can be collected and used for model training while the user is actively engaged with Copilot while working in that repository."

This means private repos are "GitHub private*" — the asterisk denoting that GitHub's definition of "private" has limits.

Community Reaction

Context

GitHub cites similar policies at Anthropic, JetBrains, and parent Microsoft as justification. Chief Product Officer Mario Rodriguez claims using Microsoft employee data led to "meaningful improvements" including higher suggestion acceptance rates.

Source: The Register

↗ Original source · 2026-03-29T00:00:00.000Z
← Previous: Major Academic Conference Catches Illicit AI Use: Hundreds of Papers RejectedNext: Anthropic Implements Peak-Hour Throttling for Claude to Manage Growing Demand →
Comments0