Ajanlar ve Orkestrasyon Orta

MAX_THINKING_TOKENS Ayarı

Öngörülebilir CI maliyetleri için genişletilmiş düşünme token bütçesini sınırlayın

Komut

$ MAX_THINKING_TOKENS=8000 "color:#7C5CFC">claude -p "Refactor auth.py" "color:#d97757">--output-format json

Yanıt

{
  "result": "Refactored auth.py...",
  "usage": { "output_tokens": 8000 },
  "total_cost_usd": 0.65
}

Ayrıştırma Kodu

059669">">// Extended thinking at Opus pricing ($75/M output):
059669">">// 10K tokens = $0.75 per response just 059669">">for thinking
059669">">// 20K tokens = $1.50 per response
059669">">// Cap with MAX_THINKING_TOKENS 059669">">for predictable costs
059669">">//
059669">">// Combine: --effort high + MAX_THINKING_TOKENS=8000
// = full reasoning within a cost ceiling

Tuzaklar

! Genişletilmiş düşünme, yanıt başına 10K+ token tüketebilir (Opus fiyatlarıyla $0.75+)

! Maksimum kontrol için --effort ile birleştirin: --effort kaliteyi ayarlar, MAX_THINKING_TOKENS maliyeti sınırlar

İlgili Tarifler

Çaba Seviyeleri Karşılaştırması Çağrı Başına Maliyet Takibi