Skip to content

Commit

Permalink
Add RK4 method and sentence splitting for longform generation.
Browse files Browse the repository at this point in the history
  • Loading branch information
lucasnewman committed Dec 1, 2024
1 parent bb672ab commit 26d1a8c
Show file tree
Hide file tree
Showing 4 changed files with 331 additions and 145 deletions.
10 changes: 5 additions & 5 deletions f5_tts_mlx/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -40,9 +40,9 @@ Provide a caption for the reference audio.

`--output`

string, default: "output.wav"
string, default: None

Specify the output path where the generated audio will be saved. If not specified, the script will save the output to a default location.
Specify the output path where the generated audio will be saved. If not specified, audio will play as it's generated.

`--cfg`

Expand All @@ -52,13 +52,13 @@ Specifies the strength used for classifier free guidance

`--method`

str, default: "euler"
str, default: "rk4"

Specify the sampling method for the ODE. Options are "euler" and "midpoint".
Specify the sampling method for the ODE. Options are "euler", "midpoint", and "rk4".

`--steps`

int, default: 32
int, default: 8

Specify the number of steps used to sample the neural ODE. Lower steps trade off quality for latency.

Expand Down
Loading

0 comments on commit 26d1a8c

Please sign in to comment.