返回 Skill 列表
extension
分类: 内容与媒体无需 API Key

scienceworld-room-scanner

此技能执行一个'环顾四周'的动作,以扫描并描述当前房间内的内容,包括可见的物体、容器和门。当进入新房间或代理需要定位特定物品或评估环境状态时,应触发此技能。该技能输出详细的房间描述,这对于库存发现和情景意识至关重要。

person作者: jakexiaohubgithub

Skill: Room Scanner

Purpose

Execute a look around action to obtain a comprehensive description of the current room in the ScienceWorld environment. This description is the foundational step for any task requiring item location, environmental assessment, or navigation planning.

Core Instruction

When this skill is invoked, the agent must perform the look around action.

Trigger Conditions

Invoke this skill when:

  1. You first enter a new room via teleport or other movement.
  2. You need to locate a specific object or container mentioned in your task.
  3. The state of the room may have changed (e.g., after an interaction).
  4. You are formulating a plan and require an inventory of available resources.

Output Processing

The observation from look around will contain:

  • Room Name: The identifier of your current location.
  • Visible Objects & Agents: A list of all entities in the room.
  • Container Contents: For open containers, a nested list of items inside (e.g., a bowl (containing a red apple, a banana)).
  • Device States: The status of interactive objects (e.g., a stove, which is turned off).
  • Connections: All accessible doors and their destination rooms.

You must parse this output carefully. Use it to update your mental model of the environment before proceeding with other actions like pick up, examine, or use.

Integration Notes

  • This is a low-level, atomic skill. It should often be the first action in a sequence.
  • The observation it generates is critical context for subsequent decision-making. Refer back to it.
  • Do not overuse it. Once you have a recent description of a room, rely on that knowledge until you have reason to believe the state has changed.