diff --git a/docs/interference.md b/docs/interference.md index b140f66a..1b4f0c11 100644 --- a/docs/interference.md +++ b/docs/interference.md @@ -54,7 +54,7 @@ Send the POST request to /v1/chat/completions with body containing the `model` m import requests url = "http://localhost:1337/v1/chat/completions" body = { - "model": "gpt-3.5-turbo-16k", + "model": "gpt-3.5-turbo", "stream": False, "messages": [ {"role": "assistant", "content": "What can you do?"} @@ -66,4 +66,4 @@ for choice in json_response: print(choice.get('message', {}).get('content', '')) ``` -[Return to Home](/) \ No newline at end of file +[Return to Home](/)