Text generation inference

Generate text using a language model. Supports parameters like temperature, top_p, max_new_tokens, and repetition_penalty.