GPT-OSS 2T API - Documentation

🎉 API is Live! Start making requests immediately — no API key required.

📡 Base URL

https://gpt.yzz.me/api

⚡ Quick Start

JavaScript (Fetch API)

const response = await fetch('https://gpt.yzz.me/api', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    messages: [{ role: 'user', content: 'Hello!' }],
    stream: false,
    temperature: 0.8
  })
});

const data = await response.json();
console.log(data.choices[0].message.content);

Python

import requests

response = requests.post(
    'https://gpt.yzz.me/api',
    json={
        'messages': [{'role': 'user', 'content': 'Hello!'}],
        'stream': False,
        'temperature': 0.8
    }
)

data = response.json()
print(data['choices'][0]['message']['content'])

cURL

curl -X POST https://gpt.yzz.me/api \
  -H "Content-Type: application/json" \
  -d '{"messages":[{"role":"user","content":"Hello!"}],"stream":false}'

📝 Request Parameters

Parameter	Type	Required	Description
`messages`	array	✅ Yes	Array of message objects
`stream`	boolean	❌ No	Enable streaming (default: true)
`temperature`	number	❌ No	Sampling temperature 0.0-2.0 (default: 0.8)
`max_tokens`	number	❌ No	Maximum tokens (default: -1)
`top_p`	number	❌ No	Nucleus sampling (default: 0.9)

🔒 Rate Limits

⚠️ Per-IP Rate Limiting: Max 2 requests/second, 500ms minimum delay

🧪 Test the API

Health Check: GET https://gpt.yzz.me/api

Interactive Test: Test Page

🛠️ Technical Details

Infrastructure:

Fully server-side PHP application
Automatic retry with exponential backoff (3 attempts)
Per-IP rate limiting and request queuing
Server-Sent Events (SSE) streaming
System prompt auto-injection