admin管理员组

文章数量:1345134

Suppose we have the following scenario, for a quiz application that uses the OpenAI realtime API:

  • using the realtime API, a browser client is connected via WebRTC
  • a function tool named get_next_question is used by the agent to retrieve the next quiz question to ask, together with its correct answer and explanation
  • when the agent wants to retrieve the next question, it requests a function tool call the client reacts by sending an API request to a server (e.g. the same from which it retrieves the ephemeral API key)

Now, the server can’t simply answer to the client with the question and solution, as that would leak the data. Instead, what I would like to be able to do is the server would receive the session_id and call_id in the payload from the client, and it would send the output directly to the OpenAI API.

Is this possible?

This example from the docs shows a client sending a conversation item with the function call output using the data channel, but I haven’t been able to figure out whether there’s a way to simply send the item using the conversation API.

For example, I would like to do something like this:

// client
dc.addEventListener("message", (e: MessageEvent) => {
        // Realtime server events appear here!
        const data = JSON.parse(e.data);
        const type = data.type;
        console.log(data);
        switch (type) {
            case "response.function_call_arguments.done":
                const functionName = data.name;
                const callId = data.call_id;
                const functionArgs = JSON.parse(data.arguments);
                handleFunctionCall({ functionName, functionArgs, callId, dc });
        // ...
}

const handleFunctionCall = (args) => {
    // use axios to send a request to a server endpoint, passing all the arguments
}
# server
def receive_function_call(session_id, call_id, *args):
    # ... business logic ...
    # somehow send a conversation item to the session containing the function call output

本文标签: openai apiSend function call output from server in WebRTC connectionStack Overflow