Astria Fine-tuning API - Overview
The API uses account balance just like the web interface. See the pricing page for more details.
The API allows creating a mock
Tune object with attribute
branch=fast. See Create a tune documentation]. API call for creating
fast tune is free, and subsequent prompt calls will return mock images without incurring any charges. This is useful for testing your integration with the API.
Once you're ready for production, you can purchase credits and adjust the quantity on the checkout page to how much you need.
Auto top off
To allow your account to work without disruption, top-off feature can be enabled to refill account each time account balance reaches zero. Auto top-off can be enabled in the billing page. It is recommended to set the top-off amount to at least a week worth of API calls to avoid hitting the rate limit on top-off - Frequent top-offs can cause credit card charges to fail or bank declines.
Generated images, training pictures and models, will be automatically deleted 30 days after the training has ended. You may delete the fine-tune object including the trained model, training images, and generated images at any time before using the delete API calls. You may opt-in to automatically extend model storage
The API uses bearer
Authorization headers scheme. API calls should include the header:
Authorization: Bearer sd_XXXXXX
Find your API key on the API settings under your account settings.
Astria API is built as a REST API. Main resources are
prompt and each one has all the CRUD operations. Prompts are usually used as a nested resource on top of tunes (i.e:
Pay attention to 504 error below and turn on idempotency for your account to avoid duplicate objects.
422 - Validation error - Log these errors and use an exception tracking system such as Rollbar or Sentry to get notified. Do not retry these requests.
500 - Internal server error - Such requests should be retried with exponential backoff and wait start time of 30 seconds.
504 - Gateway timeout - This error indicates that the request processing took more than the 30 seconds which is set as the max request timeout. In most cases the request would probably have been processed and you should avoid retrying it. In order to get the response see Idempotency section
Idempotency is a principle in programming which means that the same operation (a request in our case) can be made multiple times without changing the result. This can be useful when a request is interrupted and you want to make sure it is processed only once, and avoid duplicate objects.
Idempotency can be enabled for the account in the API settings.
For tunes request with the same
title, idempotency will return the existing tune object and will not create a new one. This is useful for cases where you want to make sure you don't create duplicate tunes. In this case set the tune title to a unique value such as a UUID which identifies the transaction.
For prompts request with the same attributes, idempotency will return the existing prompt object and will not create a new one. Prompt attributes considered for the idempotency are
text, tune_id, cfg_scale, steps, seed, callback, negative_prompt, super_resolution, face_correct, ar, num_images, controlnet, denoising_strength, use_lpw, controlnet_conditioning_scale, w, h, hires_fix, scheduler, controlnet_txt2img, inpaint_faces