@csb @adam "the company believes that about 7% of users will now hit session limits they wouldn’t have before"

Dave

Based on what I see some people describing as their use case it seems they are probably trying to curb the extreme power users that open multiple agents at once in multi-hour Ralph loops. That 7% tracks as the amount of *abusive* users a service like that would probably encounter.

That being said, I expect we will see price increases in the future. Hefty ones. Drug dealer model.

CSB ≡ Comic Strip Blogger

@dave

Dave, I disagree, that price increases of cloud LLM usage will happen: because of competition from Chyna and because of techniques like TurboQuant: https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/

in meantime we all need to keep an eye on URL that shows current hitherto usage

@adam

Dave

@csb @adam

Dave

@csb @adam Turboquant is just math. Now your local model has 4X the context window, but the cloud models will also. Yes, local models will become more capable but the cloud models will also improve one to one so I don’t think it’s going to change the equation at all on the larger business side of things.

Companies like anthropic are bleeding cash. They will have to raise prices. I don’t see how higher token compression changes that.

Podcasting Chat Community

@csb @adam "the company believes that about 7% of users will now hit session limits they wouldn’t have before"

Welcome To Podcasting.Chat!

Recent Posts