-
Notifications
You must be signed in to change notification settings - Fork 26
Connect bluebox agent and browser agent #160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Connect bluebox agent and browser agent #160
Conversation
cfefe46 to
ea6aa46
Compare
bluebox/agents/bluebox_agent.py
Outdated
| timeout_seconds: int = 300, | ||
| use_vision: bool = True, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the agent prob doesnt need to know about these. lets remove them from params and hardcode in the endpoint
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hardcoded in commit 11d8f90
bluebox/agents/bluebox_agent.py
Outdated
|
|
||
| try: | ||
| with requests.post( | ||
| f"{Config.VECTORLY_API_BASE}/buagent/execute/stream", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lets rename this endpoint to browser_agent instead of buagent
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
renamed in commit 6ae51ea
| continue | ||
|
|
||
| try: | ||
| data = json.loads(line[6:]) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we have data models that mirror those in servers for this? so that we can use their fields and not .get(...) when parsing them below
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
added data models in commit c5478f3
| Progress is streamed in real time via SSE. | ||
|
|
||
| Args: | ||
| task: Detailed natural language instruction for the browser agent. Be specific and step-by-step. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lets add an example instruction to convey the level of detail we need. either here or in system prompt
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we do it in the upcoming the bluebox "finalization" PR? because i don't yet have the big picture of the bluebox agent, and how it would pass information from the initial user conversations -> routine search -> converted task description? Also i think we need to play with the browser agent for a bit to give good examples. Added a TODO in commit 3ef2ed5
New tool:
_execute_browser_taskPOST {VECTORLY_API_BASE}/buagent/executewith the sameX-Service-Tokenauth pattern used by the existing routine toolstask(natural language), timeout_seconds (default 300, clamped to 30-1800), anduse_vision(default True)timeout_seconds + 30so the HTTP call doesn't expire before the agent doessuccess,final_result,errors,n_steps, etc.Updated system prompt
execute_browser_taskTODO