Rock Drip

Jobs (78014)

Social Science/Humanities Research Associate V - School Suicide Prevention Lead - Texas Institute for Excellence in Mental Health

The University of Texas at Austin Austin, TX

Help debug latency of runpod serverless API

Upwork

I’m currently deploying a serverless API via Runpod and facing a challenge with an unexpected spike in latency.

The setup is simple: a script runs on a serverless worker with concurrency set to 32 workers. When I send a batch of 8 parallel requests (repeated 4 times for a total of 32 requests), the first batch has an expected average latency of around 0.5 seconds. However, the subsequent batches see the latency jump to around 5 seconds and stay there. I’m unsure of the cause for this behavior.

I've attached the sample inference/ping code. If you have experience in addressing similar issues, please share your hypothesis on the cause and how you would approach solving it.

Location: Anywhere

Posted: Oct. 8, 2024, 7:53 p.m.

Apply Now Company Website

Job Listings

Jobs (78014)

Embedded Software Engineer - Classified & Critical Applications

Solution Architect

Partner Solutions Consultant

Atlassian Solution Architect

Solution Support Associate - College Connect

Sr. Solution Architect, SAP Sales and Distribution

Enterprise Architect Solution

Principal Engineer, Developer Platform APIs

Senior Dotnet API Developer

Help debug latency of runpod serverless API

Api developer

UK Research Associate

Social Science/Humanities Research Associate V - School Suicide Prevention Lead - Texas Institute for Excellence in Mental Health

Senior Researcher

Manager, UX Research Ops

Research Associate Part-Time

Qualitative Research Team Manager (Hybrid)

Registered Nurse (RN)

Travel Nurse RN - Med/Surg - $2,015 to $2,227 per week in Brooklyn, NY

Research Scientist - NLP | Cambridge, MA, USA

Help debug latency of runpod serverless API

Upwork