-
Notifications
You must be signed in to change notification settings - Fork 395
Ensure processing is better for automatic LLM-based curinator.md to issue conversion #1893
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: gh-pages
Are you sure you want to change the base?
Conversation
|
Running this as a test after manually running curatinator today...
because the output (from this post) looks like this It's not clear to me why the code fence is there, but it should probably be stripped prior to attempting
|
|
It also misclassified this post as not R related - I can see why it had trouble with it, but misclassifications add work rather than reduce it. |
|
That's interesting. I recently switched over to a Claude sonnet model for our use case. I'll experiment with different models and with changing our system prompt. Maybe I need to do a better job teaching it how to decide the post category. Right now, I'm just using our wiki page material which can be confusing. We might even come up with maybe an eval dataset based on a previous issue or multiple issues to measure accuracy. Also, maybe the 1000 character limit isn't enough sometimes to determine if a post is R related. |
|
I just performed a run to kick off my issue curation for 2025-W40, and my experience was quite similar to @jonocarroll . My observations:
While I am very supportive of finding clever ways to reduce the manual effort in the process of curating an issue, it's pretty clear that more testing is needed to find the optimal combo of a prompt and appropriate LLM to make this practical for the rest of the team. But this is an excellent starting point. |
|
Sorry, for the delay. Finally had some free time today to revisit this. Improved the system prompt, addressed the json code chunks issue, and switched over to a Quarto report format. You'll see I'm now using a decision tree sort of format that Claude recommended to me for the system prompt. Increased the character limit to 1,500 too. The result seems to be much better now: process_curinator_report.html @rpodcast, I'd love to know how the report compares to your grouping for this upcoming issue. We'll probably need to continue tuning the system prompt to meet our needs. It's getting better though. |
Changes made to rweekly scripts folder for curation team use.
This pull request updates the
scripts/process_curinator.Rscript to enhance its data processing pipeline for extracting and summarizing R-related markdown links. The main improvements include reorganizing the order of library imports, expanding the set of libraries used, and significantly refactoring the data wrangling logic to produce grouped, formatted summaries.Dependency management:
ellmerafterdplyrand added new dependencies:tidyrandglue, to support improved data manipulation and string formatting.Data processing and summarization:
result_rawwith a newjson_metadatacolumn.glue.Limitations