Use of AI in Contributions #239

xmulligan · 2025-05-13T19:20:48Z

xmulligan
May 13, 2025
Maintainer

We currently have no written policy around the use of generative AI in code contributions. We may want to establish a policy that does things like require contributors to disclose when AI tooling was used, holding AI-assisted submissions to the same standard of originality, quality, and attribution as any other contribution,
and encourages contributors to learn, understand, and reason through the code they're submitting, rather than deferring entirely to tooling.

AI tooling in general can be a useful accelerant but if a contributor cannot explain the code they’re submitting, or isn’t able to reason through feedback, that’s a red flag.

A few things we might want to consider when developing:

How do we ensure contributors appropriately credit prior work, whether human- or AI-generated?

Should we require contributors to disclose if AI tools were used? If so, how do we define acceptable usage?

How do we handle low-context, potentially AI-generated PRs that add to the review burden and may reflect limited understanding of the codebase?

How do we handle cases where trust is broken through copy-paste or AI-mediated submissions, particularly when the contributor appears early in their OSS journey?

lizrice · 2025-05-20T14:56:46Z

lizrice
May 20, 2025
Maintainer

This is a nice clear example of how another project is handling it (HT @geakstr for pointing this out)

3 replies

joestringer May 23, 2025
Maintainer

Contributions must not include content generated by large language models or other probabilistic tools, including but not limited to Copilot or ChatGPT. This policy covers code, documentation, pull requests, issues, comments, and any other contributions to the Servo project.

There's a beauty of simplicity in this policy. I would be curious if there are any learnings from that community around the implementation / enforcement of the policy.

qmonnet Jun 5, 2025
Collaborator

As much as I understand the need to protect the project from AI-related issues, in particular IP theft / missing credits, or the multiplication of dubious Pull Requests with a limited understanding of what's going on from the authors, I find this policy very strict, and I'm not sure this is going in the right direction when looking at the current trends in programming.

Let's be clear: I don't have an ideal solution in mind, it's hard to find the right balance.

But to me, it feels like this policy basically tells users “You must disable Copilot (or your other LLMs) in your IDE when working on Cilium. Using Cursor or another one of the kind? Change your IDE”. As I see it:

People who are getting used to AI tool to work with will be reluctant to do so. It's never fun to be asked to bend your workflow and habits to work on a specific project.
For some developers, there is an undeniable gain of productivity with AI tools. I'm not really thinking of asking an agent to implement a whole feature, but more of using Copilot etc. as a smart-complete feature. I've found it powerful to help me complete unit tests, help me with syntax or formatting nits, or to propagate code edits through so-called “smart jumps” at different locations of a file. These actions, as far as I can tell, involve no particular copyright issue (but of course I'm aware that completing a full function at once, for example, may cause the tool to copy the code from elsewhere, so the problem remains to strike a balance). As I understand it, Servo's policy rejecting all forms of code edits from LLMs means rejecting “simple” smart completion, too.
From a more philosophical perspective, it looks (to me) like AI is here to stay in the programming world. Is rejecting all aspects of it (or so it sounds) for contributing to Cilium the right approach, or should we (cautiously!) consider adopting it as part of the evolution of the ecosystem?

Again, I'm not saying we should blindly approve all contributions, and working on a proper policy is an excellent initiative (thanks for all your work and reflection, Joe!). At the same time, rejecting all forms of generated content sounds extreme to me, in today's context. I see that Dylan expressed a similar opinion as I was typing these words ( 🙌 ), I relate to his sentiment.

joestringer Oct 21, 2025
Maintainer

It appears that git is considering a similar policy, in part due to the lack of legal clarity, but explicitly highlights that using LLMs to lint or debug could be beneficial (if used wisely).

https://lwn.net/ml/all/20251001140310.527097-1-christian.couder@gmail.com/

joestringer · 2025-05-23T19:50:34Z

joestringer
May 23, 2025
Maintainer

Credit and legal implications 🧵

How do we ensure contributors appropriately credit prior work, whether human- or AI-generated?

The current approach in the Cilium project is that we rely heavily on DCO and trust in contributors to read this document and sign off that they are following its steps.

Parts (a) or (b) seem quite difficult to honestly certify without either creating the contribution yourself or building on work from a reputable website like GitHub that clearly denotes the appropriate licensing. Part (c) is of course a delegation to someone else who certified one of (a) or (b).

Do we know of any LLM tools that limit the training data to licenses compatible with the project? Is there an audit trail of the source material to provide a reasonable basis to state that the work is covered by an appropriate open source license?

4 replies

joestringer May 23, 2025
Maintainer

We require all contributors to certify their work, and at least in the cilium/cilium repository there is a bot that checks for this certification and links to what it means / how to solve it. This is a bit less direct and may betray the seriousness of the requirement for the contributor to follow the DCO. We might consider making this stronger by adjusting the bot to respond with the full DCO inline in the PR when sign-off is omitted, and also include the current link to the docs page that helps to describe how to certify your changes.

MrFreezeex May 29, 2025
Collaborator

IMO the license issue and low code quality is two separate kind of problems. If the issue is about licensing I would say that the debate should be centered around legals issue and that if we ban LLM the documentation should clearly be stating this as first reason to do so.

If the main problem is low quality contribution/vibe coded contribution like I think @xmulligan is implying in the OP, the potential checkbox might (debatable) be centered more around "yes I used an LLM but I did double checked things and I know what I am doing". For instance, one could use copilot as a slightly more intelligent auto completion tool or use LLM chat bot in a way that only influence indirectly his contribution (for instance analyzing some error/code and still writing his actual contribution without direct LLM help) and maybe those shouldn't have the same treatment as someone who (vibe) coded mostly by discussing with an LLM and changing almost none of the output. Although this is super debatable and also assuming there is no legal issue in the first place!

joestringer May 29, 2025
Maintainer

I would like to raise a situation we have observed in relation to the DCO, sign-off and legal aspects. PR 1 was posted to a Cilium project repository. Some time later, a different author posted PR 2 to the repository, and the content had a significant resemblance to PR 1. When we asked the author of PR 2 about the content, they explained that they did not realize it was almost the same and that the submission was created using an LLM IDE. The author had signed the DCO but it appears they had unknowingly copied another author's work to claim it as their own. The author of PR 2 ultimately withdrew the submission. From what I understand, some of the newer LLM IDEs provide the ability to pull content from the web as part of an agentic / iterative development process. I suspect this capability may have been used in this case.

My take away from the above case is that LLM IDEs will copy others' work without proper attribution, and that contributors signing off a DCO for such submissions may be falsely asserting the origin or rights to submit the content to the project. The case outlined above is already alarming as attribution is an important part of open source development, but it could be worse - submissions could be based on code that cannot be accepted given the licensing of the original source.

I am aware that there are tools to attempt to detect such cases, and I do not trust them at this time.

joestringer Aug 22, 2025
Maintainer

I came across this guidance from the Apache Software Foundation that recommends all commits that were authored with generative AI tooling should disclose the usage with a commit tag like Generated-by:, similar to the DCO and other tags we currently apply inside commit messages:

https://www.apache.org/legal/generative-tooling.html

joestringer · 2025-05-23T19:55:56Z

joestringer
May 23, 2025
Maintainer

Should we require contributors to disclose if AI tools were used?

Yes.

One possible approach: Include a checkbox in the PR template with language that reflects a default assumption that someone is using LLM tooling. Ensure that contributors must take explicit action to certify that an LLM was not used.

3 replies

joestringer May 29, 2025
Maintainer

^ If we do this, we should also be clear about what "LLM usage" means. I think there are at least two distinct categories: Using an LLM to write contributions or using an LLM as a fancy search engine to learn. Both may suffer from well-known accuracy issues, but the latter seems less likely to raise some of the legal issues.

joestringer Jun 5, 2025
Maintainer

I'll split "write contributions" above into a few extra categories as I think this may also be useful for discussion purposes:

Autocomplete of trivial steps (k8s resources, function signatures, spellcheck)
Autocomplete of full functions or paragraphs of text
Unsupervised iterative generative AI (aka "agentic") where there is a software process overseeing one or more iterations invoking generative AI tools

joestringer Aug 22, 2025
Maintainer

@zacharysarah had shared this "AI Influence Level" system which would provide an indicator of the degree to which AI was used to create work. In an ideal world I think there'd be direct tooling integration to reflect this AIL metric automatically. Levels 0-5 indicating increasing levels of involvement by AI, where 3-5 are driven AI-first with human guidance.

We could add an opt-in item into the PR template that asks "Which AI Influence Level was used to create this change?".

joestringer · 2025-05-23T20:12:03Z

joestringer
May 23, 2025
Maintainer

How do we handle low-context, potentially AI-generated PRs that add to the review burden and may reflect limited understanding of the codebase?

One idea is to require high context before reviewing a PR. That is to say, reviewers should do an initial 2m triage of the motivation for the PR before reviewing the work. If the context is insufficient, request context and move on. In cilium/cilium we often go through this pattern anyway just as a way to assist reviewer workload, since we already face similar problems with sharing the understanding between contributors and reviewers.

0 replies

joestringer · 2025-05-23T20:29:57Z

joestringer
May 23, 2025
Maintainer

How do we handle cases where trust is broken through copy-paste or AI-mediated submissions, particularly when the contributor appears early in their OSS journey?

I don't have a full answer, here, but I have some thoughts. Your question highlights in particular a category of contributors: those who are unaware of the norms, expectations and policies of the project. On that note I think it would help if the project has a clear policy that states what is acceptable behavior with respect to the use of AI/LLM tooling so that we can take action on an objective basis as situations arise.

As a broader discussion point, we should not assume any level of trust as a baseline from new contributors. I'll pull in references to the University of Minnesota debacle with the Linux kernel (TechCrunch coverage) and the xz-utils exploit (Ars Technica coverage) here for some examples of misplaced trust in the open source world. These are not AI/LLM examples, but they provide some broader context. We could consider taking inspiration around the handling of these incidents from these other projects.

0 replies

joestringer · 2025-05-30T17:17:49Z

joestringer
May 30, 2025
Maintainer

Using LLMs to summarize code in order to make it easier to start working on Cilium 🧵

Someone shared with me a workflow where they interactively analyzed logic in the Cilium tree using some modern LLM-based tooling. This was not for drafting content changes for the project, but rather to inquire about how something works in Cilium and to locate the relevant code. For newer contributors who are not familiar with the Cilium architecture, how to navigate the codebase, or how to answer questions they have about how Cilium operates, it seems like this may provide a way to "rubber duck" and explore the codebase more easily.

1 reply

liyihuang May 30, 2025
Collaborator

I also used the cursor to help me contribute the actual code in my previous PRs. From my experience,

it can generate the code following the project's style when the logic is clear(help me to finish the function).
it can write the unit test but it often overcomplicated the unit test
simplify the code logic.

I personally feel the most helpful way to use it is to ask questions around how this works when you pointing to certain lines so you dont have to the keyword to search.

To be honest, I started to pick up golang/programming this year and my personal feel is LLM sometimes can generate the better code than I can write.

dylandreimerink · 2025-06-05T09:04:49Z

dylandreimerink
Jun 5, 2025
Collaborator

I personally use Copilot regularly and I find it very helpful. I do think that it is important to clarify how I use it, which is essentially as a better form of auto complete (which I realize is the most basic way to use LLMs). It helps me type out verbose and predictable parts of my code such as function signatures, error handling and similar additions to code which I want to write already, but instead of doing it manually I press .

I feel that the way I use tools like this does not effect the quality of my code, since I will always end up with what I had in mind. The same goes for copyright, concerns. Output is similar if not exactly the same to what I would have typed manually, just faster.

LLMs are a tool, which like any other can be used for good and evil, it all depends on the user. I personally feel that outright banning LLMs is not the way to go. I would rather hold people accountable for all code they submit no matter how they made it.

0 replies

xmulligan · 2025-06-05T09:48:38Z

xmulligan
Jun 5, 2025
Maintainer Author

This is my suggestion for a policy:

To maintain the high quality and trustworthiness of contributions to the Cilium project, we provide the following guidance on the use of large language models (LLMs), such as GitHub Copilot, ChatGPT, or similar tools:

Acceptable Use

We recognize that LLMs can be helpful tools, for example:

Using AI tools to explain parts of the codebase you don’t understand
Autocompleting routine or boilerplate code (e.g., error handling, test scaffolding, function signatures)
Reformatting existing content
Drafting documentation that is later reviewed and refined by the contributor
Using LLMs to brainstorm ideas for implementation—but writing the actual code yourself
Drafting unit tests with AI and then reviewing and simplifying them before submission

These uses are acceptable provided the contributor:

Fully understands and reviews the output
Accepts personal responsibility for the content
Ensures the output adheres to project guidelines and licensing requirements

Unacceptable Use

It is not acceptable to:

Submit code, documentation, or discussion content generated by an LLM that you do not fully understand
Use LLMs to generate contributions without performing human validation and contextual reasoning
Rely solely on LLM output for technical problem-solving or architectural decision-making
Submitting work produced via AI tooling that copies from external codebases without correct attribution or licensing
Submitting contributions where the author cannot explain or justify the code when asked in review

Transparency & Attribution

We do not require contributors to declare when LLMs were used for basic productivity support. However, if LLMs were used to generate non-trivial text or code (e.g., new features, documentation pages, complex design proposals), contributors are encouraged to note that LLMs were used and explain the human review process applied.

DCO and Licensing

The Developer Certificate of Origin (DCO) still applies. AI tools often lack transparency into their training data and may produce copyrighted output. If you submit AI-assisted code, you are certifying that it does not infringe on any licenses and that you have the right to contribute it under the project's license.

If you’re unsure, don’t submit it.

Guiding Principle

Cilium is a community-driven project that values expertise, clear communication, and personal responsibility. Whether or not an LLM was involved, contributors are fully accountable for the correctness, security, and clarity of their contributions.

5 replies

xmulligan Jun 5, 2025
Maintainer Author

Not sure if it would be too heavy handed, but we could also include something about blocking/banning if AI is continually misused.

joestringer Jun 5, 2025
Maintainer

I find the line for these specific items is a bit blurry:

Autocompleting routine or boilerplate code (e.g., error handling, test scaffolding, function signatures)
Reformatting existing content
Drafting documentation that is later reviewed and refined by the contributor
Drafting unit tests with AI and then reviewing and simplifying them before submission

First I'll consider non-generative approaches for autocomplete. I think that the general sense within the industry around traditional forms of autocomplete are that the templates and tables used to provide autocomplete information are reasonable for use in OSS code. We haven't even entertained the idea of banning such non-generative usage of autocomplete. So, I raise the question of how generative approaches to autocomplete are different? If it is a matter of auto-completing well-known templated forms of code such as a Kubernetes resource, that seems functionally very close to existing non-generative approaches.

As the autocomplete becomes more sophisticated, I think that the line of responsibility between the contributor and the LLM becomes less clear in the moment of using the tool. I can imagine some common scenarios where the contributor may defer too far to the tooling:

A junior contributor may be unsure about the solution and just accept function proposals by an LLM, or
Someone who just didn't get enough sleep thinks that they are writing the correct thing with LLM assistance and they don't exercise the self-control to properly review the output.

I completely agree that ultimately the contributor should be held responsible for the contributions, and I do think that the text above accounts for this concern. At the same time, I worry that it is ultimately the reviewers that will be burdened with more work any time that the contributor has a lapse in responsibility regarding LLM use. From that perspective I would personally be in favor of an approach that discourages LLM use by default in creating content unless we are confident to delegate trust to the contributor.

Regarding unit tests, I think this is one of the more problematic cases. It is very easy to slip into poor habits with writing unit tests of generating too much boilerplate and making the codebase harder to change because the tests are too tied to the implementation. There are many examples out there that already inhibit development, including a lot of cases within the Cilium codebase . All of these examples are now fed into LLM tools to generate yet more poorly considered unit testing logic. It is also very common that for each line of "business logic" there are multiple lines of testing logic, which exacerbates review workloads.

joestringer Jun 5, 2025
Maintainer

Transparency & Attribution

We do not require contributors to declare when LLMs were used for basic productivity support. However, if LLMs were used to generate non-trivial text or code (e.g., new features, documentation pages, complex design proposals), contributors are encouraged to note that LLMs were used and explain the human review process applied.

What is "basic productivity support"?

As a reviewer, I want to see an honest answer to "Is any of the content I review generated by an LLM?". I don't care if you used LLMs to search for information or to summarize/explain some code during design / analysis (in this case, answer "no"), but if an LLM was used to create any of the code, writing, etc. that are part of the submission, I want to be aware.

The next step is "Recognizing that I have used an LLM to create some of the content of this submission, I have reviewed every line and every word of this submission to ensure it meets the project guidelines". I want this to be the ambition even for submissions not generated with LLM tooling. I think that the last part of the suggested policy above covers this point.

Maybe one practical point I might add to the existing text is "If in doubt, label your submission as containing LLM-generated content".

pchaigno Jun 12, 2025
Collaborator

I don't quite understand why documentation and unit tests would get exceptions. They are as tricky to write properly as the main logic itself and they are also susceptible to all of the LLM issues Joe mentioned.

gandro Jun 25, 2025
Maintainer

contributors are encouraged to note that LLMs were used and explain the human review process applied.

I would even to go as far and require this. I know this is hard to enforce, but I think without making declaration mandatory, it will be hard to gauge if the author even used LLMs, and if so followed the "acceptable use" (i.e. only used LLMs to produce code in areas they understand).

As a reviewer, it is both time-consuming and frustrating trying to understand the reasoning behind a piece of odd looking code, only to realize that there is no reason or logic behind it. If the code is LLM-generated, then I have no interest in suggesting changes to auto-generated non-sense line by line. However, if the author did not use LLMs and perhaps is a junior developer (everyone makes mistakes after all), I'm more than happy to provide in-depth feedback on how certain subsystems work, how to improve things and link to learning resources.

Having the author declare that they used LLM to generate a certain section of the code (e.g. Helm boilerplate) helps me as a reviewer to disengage early if it becomes clear that code doesn't make sense, indicating that author didn't follow acceptable use.

joestringer · 2025-06-05T16:59:37Z

joestringer
Jun 5, 2025
Maintainer

Contributors must take responsibility for the full content they submit in a PR 🧵

It is not acceptable to:

Submit code, documentation, or discussion content generated by an LLM that you do not fully understand

Use LLMs to generate contributions without performing human validation and contextual reasoning
Rely solely on LLM output for technical problem-solving or architectural decision-making

Submitting work produced via AI tooling that copies from external codebases without correct attribution or licensing

Submitting contributions where the author cannot explain or justify the code when asked in review

The first and last points are a bit tricky here: I agree to some degree around the sentiment, but I think it's not a very practical position to take. For the sake of discussion, let's take LLM tooling out of the picture and just consider contributions more generally. It is somewhat common that someone does not fully understand the codebase or their own contribution. They may not be able to fully explain / justify the code. Part of the point of the review process is to identify gaps where the contributor may lack context or may not fully understand the project and have a discussion about the abstractions, shape of the problem, possible solutions and so on. Although I think it would be ideal if everyone had a full picture of what changes need to be made and could explain the contribution all the way, this is a high bar to set. We need to recognize that there will be contributors with different experience levels and we'll need to meet those contributors where they're at. Ideally that interaction will develop the contributions to the point that they generally improve the project and do not create significant problems, and while we're at it, provide a positive experience for the contributors to help motivate them to continue to contribute in a healthy way.

The most practical response to my feedback above would be to loosen the language a little bit to set expectations that when you submit a change, you are trying to improve something in the project to the benefit of the community, and that you will honestly engage in the review process to the best of your ability. Something I would be happy to see written as unacceptable would be to use LLMs as part of review discussion.

Submitting work produced via AI tooling that copies from external codebases without correct attribution or licensing

We've already seen people violate this without even knowing. We can state that this is the policy, but I think we have to recognize that if we set expectations that LLM usage is generally OK, there will be submissions in this category that everyone will miss. Maybe if we're lucky that will not cause an incident, but I have very little interest in having to deal with such an incident because we set a policy that is too liberal. I guess to summarize.. I agree with this sentence, but I think the only practical way to enforce it is to ban non-trivial use of LLMs (probably, anything more than a line or two here or there).

0 replies

joestringer · 2025-06-05T17:34:39Z

joestringer
Jun 5, 2025
Maintainer

Contributors should not use LLMs to engage in pull request discussion 🧵

via @xmulligan

Not sure if it would be too heavy handed, but we could also include something about blocking/banning if AI is continually misused.

I'd like to share another experience I've had in the Cilium community recently. A new contributor submitted a pull request to fix a "good first issue". When I reviewed the submission, I asked a specific technical question and provided a reference link to what I was talking about so that the contributor could understand the basis of my question. I was curious to learn about the technical solution, but I also wanted to check whether the contributor had experienced the issue they were fixing and whether they were doing a drive-by contribution or whether there is potential to mentor the contributor to become a regular contributor. The contributor responded to my question with some text that was clearly generated by an LLM. The response did not make sense, and it did not match the communication style that the contributor used elsewhere in the PR. I asked the contributor whether they used an LLM to generate the answer as the response was very clearly not written by them. The contributor responded in their own voice that they had not used an LLM. That is to say, they directly lied in response to my question.

We currently do not have a policy regarding LLM usage, so if the contributor acknowledged usage and engaged in discussion directly then I think we might have been able to navigate the pull request process. However, I think now it is a matter for the code of conduct.

I feel that there is a trend recently where developers are trying to generate a good-looking CV/Resume by flooding OSS projects with low-quality submissions. These contributors attempt to get involved in the project not for the benefit of the community but only for their own enrichment. As we are starting to see abuse of the Cilium community of this kind, I would like to see clear guidance in place that highlights that such behavior is unacceptable, and provides guidance to the maintainers and committers about how to respond to such incidents.

I recognize that this incident is not caused by the use of LLM, but that LLM tooling lowers the cost for abusive behavior in OSS communities. From my perspective if the project has a blanket rejection on accepting content created using LLM tools, then the line is crystal clear about whether such submissions cross the line and I could point to the policy and close the PR.

3 replies

lizrice Jun 6, 2025
Maintainer

It seems likely that a lot of folks will be using tools like Copilot for "fancy autocomplete" (as Dylan & Quentin pointed out). But what if we ban LLM-generated text in GitHub issue/PR text and comments? (As well as a requirement to state if you used LLM in the coding). That way the contributor would have to describe their change and interact with the community in their own voice rather than that of an LLM

qmonnet Jun 6, 2025
Collaborator

For what it's worth, I find that Liz's proposal is not unreasonable.

I could imagine some gain of productivity by using LLMs to write commit descriptions, although I've not really used it so far; but this is an area where I'd be ready to make a compromise, and accept a policy that says “no generated content in commit/PR description”, for the sake of clarity.

Comments in discussions on GitHub Pull Requests' threads are a different thing, I don't think AI-generated have their place in there at all (unless very clearly indicated, such as marking some snippet as a quote and telling where it comes from to illustrate a technical argument, or having some clearly-identified bot take part of the conversation such as Copilot reviews, were the project to use that).

youngnick Jun 12, 2025
Collaborator

I've definitely seen comments and PR text that smelled strongly of being completely AI-generated. Both the formatting and the content felt very mechanical.

aanm · 2025-06-06T15:43:18Z

aanm
Jun 6, 2025
Maintainer

Are we trying to define the "Turing Test"?

I would be in favor of accepting anything regardless of its origin as long as the intent of the PR is clear, no low effort (typos etc), no huge 'automatic' refactoring like change uint32 to uint because of linter X said it so.
Now, if we look carefully we are basically discussing the quality bar of a PR. We had this discussion before LLMs were a thing. Anyone could have open a PR 5 years ago that would have the changes I mentioned but it would take them 1h to prepare it. Today we have the exact same issue except an LLM does it in 2 seconds.
We should not be discussing the usage of LLMs, we should be discussing quality PRs regardless on how they have been created.

2 replies

joestringer Jun 6, 2025
Maintainer

I agree that intent and quality are important components to this discussion. However, I also think that LLMs create materially different properties in the contribution process. Most prominently, LLMs have been trained on a wide range of content without respect for consent, attribution or legal process. LLMs will happily copy significant content that has been made available under some license, violate the license and hide the attribution. I think this creates legal and ethical risks for any project that accepts nontrivial contributions built using generative AI tools.

liyihuang Jun 6, 2025
Collaborator

To be honest, I agreed on this one.

In last few months, I started to use cursor to help me understand the codebase and write the PR. I'm new to golang and cilium codebase, and I personally feel LLM helps me to get a better quality of my PR. Sometimes, the boundary of understanding, suggesting and writing the code can be blurry.

I also have to admit that LLM lower the bar for people to raise the PR but I think have an extra check box doesn't help. People can always lie to that check box.

xmulligan · 2025-07-29T16:02:37Z

xmulligan
Jul 29, 2025
Maintainer Author

I honestly don't think this one is too bad either

https://github.com/agentgateway/agentgateway/blob/main/CODE_OF_CONDUCT.md

1 reply

joestringer Jul 29, 2025
Maintainer

There's similar ideas in this blog post: https://distantprovince.by/posts/its-rude-to-show-ai-output-to-people/ .

thebsdbox · 2025-08-23T13:01:04Z

thebsdbox
Aug 23, 2025

Additional project proposals around AI usage:

0 replies

xmulligan · 2025-09-15T17:17:18Z

xmulligan
Sep 15, 2025
Maintainer Author

Airflow has an AGENTS.md https://github.com/apache/airflow/blob/main/AGENTS.md

"Somehow I have a feeling that quality of the contributing docs is nowadays super important - precisely because the AI agents that people use actuallly read those instructions - as opposed to many humans who often did not read instructions before contributions."

0 replies

xmulligan · 2025-10-06T14:21:34Z

xmulligan
Oct 6, 2025
Maintainer Author

I also quite like this one from OTel

https://github.com/open-telemetry/community/blob/main/policies/genai.md

0 replies

xmulligan · 2025-10-10T10:10:24Z

xmulligan
Oct 10, 2025
Maintainer Author

Seems like Harbor has an AI-slop label. Seems a bit brutal though 😅

https://github.com/goharbor/harbor/pulls?q=is%3Apr+author%3AArvindhworks+is%3Aclosed

1 reply

joestringer Oct 13, 2025
Maintainer

Labeling something as slop certainly evokes certain emotions. As carthartic as that may be, I don't think it sets the right tone for participation in a project. That said, labeling something as containing generative AI content could be useful.

For any given label, it allows people with write privileges to communicate a property of the labelled PR/issue, and additionally later filter issues for similar matches. If we were to have a label for content that we think either may or does include LLM generated content, what would we use that label for?

Ideas:

Encourage reviewers / committers to apply a higher level of scrutiny to the changes
Track contributor patterns based on their usage of these tools to encourage positive involvement
Track which changes are made to repositories for later review (legal, quality)

joestringer · 2025-10-21T16:23:44Z

joestringer
Oct 21, 2025
Maintainer

The git Summer of Code AI guidelines are pretty well written and clearly outline many of the aspects of the problem space: https://git.github.io/General-Application-Information/

0 replies

xmulligan · 2025-10-22T08:55:23Z

xmulligan
Oct 22, 2025
Maintainer Author

I quite like the way that RedHat frames the issues here

https://www.redhat.com/en/blog/ai-assisted-development-and-open-source-navigating-legal-issues

1 reply

joestringer Oct 23, 2025
Maintainer

I can't agree with the plagiarism and DCO aspects here. I think the argument is "I haven't seen it so it doesn't exist". As I documented above, we have seen LLM tools directly take entire PRs from GitHub and recommend to a naive contributor to post it as their own.

xmulligan · 2025-10-24T11:30:05Z

xmulligan
Oct 24, 2025
Maintainer Author

Saw this elsewhere and I agree with the sentiment:

Our big concern right now is AI-led contributions, not the AI-assisted ones. We have tons of people who see our issues and think they can just delegate 99% of the work to the AI and arrive at a good result.

0 replies

xmulligan · 2025-10-30T10:42:33Z

xmulligan
Oct 30, 2025
Maintainer Author

I kind of like this line "Human review is required for all code that is generated or assisted by a large language model."

https://github.com/bootc-dev/infra/pull/30/files

0 replies

xmulligan · 2025-11-18T14:32:07Z

xmulligan
Nov 18, 2025
Maintainer Author

I like how Zulip focuses on the why not the how and succinctness

https://github.com/zulip/zulip/blob/main/CONTRIBUTING.md#ai-use-policy-and-guidelines

3 replies

youngnick Nov 20, 2025
Collaborator

This is a particularly good sentence, I think:

A good rule of thumb is that if you can't make yourself carefully read some LLM output that you generated, nobody else wants to read it either.

joestringer Nov 24, 2025
Maintainer

Zulip have just updated the policy with a new section targeted at LLM misinformation propagated by unwitting contributors:

518254592-d3fb018e-45e4-4915-99dc-188ccec8c030

joestringer Feb 5, 2026
Maintainer

I actually really like the way the Zulip policy focuses on the contributor and uses the imperative form to directly instruct contributors how they should interact with the project in context of GenAI/LLMs.

joestringer · 2025-11-18T19:51:10Z

joestringer
Nov 18, 2025
Maintainer

Python splits it clearly into "acceptable uses", "unacceptable uses" and "considerations for success":
https://devguide.python.org/getting-started/generative-ai/

0 replies

joestringer · 2025-12-02T20:03:45Z

joestringer
Dec 2, 2025
Maintainer

In light of security incidents like camoleak, I think we should strongly recommend against committers from using generative AI tools. The lack of determinism in the tooling combined with classic security bugs presents a risk that is greater than the benefits of the tooling.

If committers take responsibility for those risks, they must have at least one secondary deterministic control to limit the risk. For instance, when granting GitHub API access, create a token with read-only permissions only on the required repositories. I note that while this would mitigate problems associated with write tokens, it could still leak content of private repositories through read access. It would also be very easy to misconfigure this to grant too much access, so this should not be taken lightly. If the tooling doesn't provide secondary controls, the tooling must not be used.

Any actions undertaken by generative tooling must be reviewed by the user of those tools (for instance if generating text, the contributor must review the content).

While the camoleak exploit has been patched by GitHub, I don't currently see the ability to configure secondary controls in GitHub Copilot today, so I think we should avoid all Copilot use through the GitHub website.

0 replies

xmulligan · 2025-12-11T18:52:34Z

xmulligan
Dec 11, 2025
Maintainer Author

Overall like a lot of the sentiment in this write up

https://rfd.shared.oxide.computer/rfd/0576

0 replies

xmulligan · 2025-12-23T09:31:58Z

xmulligan
Dec 23, 2025
Maintainer Author

Could be an interesting talk here too

https://fosdem.org/2026/schedule/event/AJGB73-the_synthetic_senior_rethinking_free_software_mentorship_in_the_ai_era/

0 replies

HadrienPatte · 2026-01-12T13:26:50Z

HadrienPatte
Jan 12, 2026
Collaborator

Relevant issue: cilium/cilium#43656

0 replies

xmulligan · 2026-01-23T09:48:07Z

xmulligan
Jan 23, 2026
Maintainer Author

Not a bad take either ghostty-org/ghostty#10412

1 reply

youngnick Jan 27, 2026
Collaborator

I like this one, it feels like quite a good fit with our "trusted group of committers" model.

xmulligan · 2026-01-28T11:08:18Z

xmulligan
Jan 28, 2026
Maintainer Author

Seems like we are starting to coalesce on some themes. I'll ask to add this to the Cilium Dev Summit to discuss there. Maybe we can come up with a policy we can all agree on later this year.

0 replies

Use of AI in Contributions #239

Uh oh!

xmulligan May 13, 2025 Maintainer

Replies: 28 comments · 28 replies

Uh oh!

lizrice May 20, 2025 Maintainer

Uh oh!

Uh oh!

joestringer May 23, 2025 Maintainer

Uh oh!

qmonnet Jun 5, 2025 Collaborator

Uh oh!

joestringer Oct 21, 2025 Maintainer

Uh oh!

Uh oh!

joestringer May 23, 2025 Maintainer

Uh oh!

joestringer May 23, 2025 Maintainer

Uh oh!

Uh oh!

MrFreezeex May 29, 2025 Collaborator

Uh oh!

Uh oh!

joestringer May 29, 2025 Maintainer

Uh oh!

joestringer Aug 22, 2025 Maintainer

Uh oh!

joestringer May 23, 2025 Maintainer

Uh oh!

joestringer May 29, 2025 Maintainer

Uh oh!

joestringer Jun 5, 2025 Maintainer

Uh oh!

Uh oh!

joestringer Aug 22, 2025 Maintainer

Uh oh!

Uh oh!

joestringer May 23, 2025 Maintainer

Uh oh!

joestringer May 23, 2025 Maintainer

Uh oh!

Uh oh!

joestringer May 30, 2025 Maintainer

Uh oh!

Uh oh!

liyihuang May 30, 2025 Collaborator

Uh oh!

Uh oh!

dylandreimerink Jun 5, 2025 Collaborator

Uh oh!

xmulligan Jun 5, 2025 Maintainer Author

Acceptable Use

Unacceptable Use

Transparency & Attribution

DCO and Licensing

Guiding Principle

Uh oh!

xmulligan Jun 5, 2025 Maintainer Author

Uh oh!

Uh oh!

joestringer Jun 5, 2025 Maintainer

Uh oh!

Uh oh!

joestringer Jun 5, 2025 Maintainer

Uh oh!

xmulligan
May 13, 2025
Maintainer

Replies: 28 comments 28 replies

lizrice
May 20, 2025
Maintainer

joestringer May 23, 2025
Maintainer

qmonnet Jun 5, 2025
Collaborator

joestringer Oct 21, 2025
Maintainer

joestringer
May 23, 2025
Maintainer

joestringer May 23, 2025
Maintainer

MrFreezeex May 29, 2025
Collaborator

joestringer May 29, 2025
Maintainer

joestringer Aug 22, 2025
Maintainer

joestringer
May 23, 2025
Maintainer

joestringer May 29, 2025
Maintainer

joestringer Jun 5, 2025
Maintainer

joestringer Aug 22, 2025
Maintainer

joestringer
May 23, 2025
Maintainer

joestringer
May 23, 2025
Maintainer

joestringer
May 30, 2025
Maintainer

liyihuang May 30, 2025
Collaborator

dylandreimerink
Jun 5, 2025
Collaborator

xmulligan
Jun 5, 2025
Maintainer Author

xmulligan Jun 5, 2025
Maintainer Author

joestringer Jun 5, 2025
Maintainer

joestringer Jun 5, 2025
Maintainer