Skip to content

Conversation

@rayruizhiliao
Copy link
Contributor

@rayruizhiliao rayruizhiliao commented Feb 11, 2026

New tool: _execute_browser_task

  1. Calls POST {VECTORLY_API_BASE}/buagent/execute with the same X-Service-Token auth pattern used by the existing routine tools
  2. Accepts task (natural language), timeout_seconds (default 300, clamped to 30-1800), and use_vision (default True)
  3. HTTP timeout is set to timeout_seconds + 30 so the HTTP call doesn't expire before the agent does
  4. Returns a normalized result dict with success, final_result, errors, n_steps, etc.

Updated system prompt

  1. Changed workflow step 4: if no routines match, fall back to execute_browser_task
  2. Added rule: always prefer routines (faster, cheaper, more reliable) — browser agent is a fallback only
  3. Removed the "tell the user no routines were found" rule since the agent can now act on it

TODO

  • Client side implementation for SSE

@rayruizhiliao rayruizhiliao changed the title connect bb agent and browser anet Connect bluebox agent and browser agent Feb 11, 2026
@rayruizhiliao rayruizhiliao changed the base branch from main to agent-to-use-semantic-search February 11, 2026 22:52
@dimavrem22 dimavrem22 force-pushed the agent-to-use-semantic-search branch from cfefe46 to ea6aa46 Compare February 11, 2026 23:38
@rayruizhiliao rayruizhiliao marked this pull request as ready for review February 12, 2026 02:12
Comment on lines 308 to 309
timeout_seconds: int = 300,
use_vision: bool = True,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the agent prob doesnt need to know about these. lets remove them from params and hardcode in the endpoint

Copy link
Contributor Author

@rayruizhiliao rayruizhiliao Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hardcoded in commit 11d8f90


try:
with requests.post(
f"{Config.VECTORLY_API_BASE}/buagent/execute/stream",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets rename this endpoint to browser_agent instead of buagent

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

renamed in commit 6ae51ea

continue

try:
data = json.loads(line[6:])
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we have data models that mirror those in servers for this? so that we can use their fields and not .get(...) when parsing them below

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added data models in commit c5478f3

Progress is streamed in real time via SSE.

Args:
task: Detailed natural language instruction for the browser agent. Be specific and step-by-step.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets add an example instruction to convey the level of detail we need. either here or in system prompt

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we do it in the upcoming the bluebox "finalization" PR? because i don't yet have the big picture of the bluebox agent, and how it would pass information from the initial user conversations -> routine search -> converted task description? Also i think we need to play with the browser agent for a bit to give good examples. Added a TODO in commit 3ef2ed5

@rayruizhiliao rayruizhiliao merged commit 7cb110f into agent-to-use-semantic-search Feb 12, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants