Support code generators #610

kubukoz · 2022-02-03T01:45:36Z

Is your feature request related to a problem? Please describe.

Build tools generally provide ways to generate source files before compilation. scala-cli doesn't, it was previously mentioned it potentially could.

Describe the solution you'd like

Some way to run an arbitrary script before every compilation, probably configured in directives or by convention (i.e. putting a script in a ./scala-scripts directory).

The interface would be basically Unit => Unit, maybe with extra context passed as environment variables.

Some possible things I'd like to see passed:

a way to gather all the sources used in the build (e.g. :-separated list of source paths)
the path to a directory where I can output files that'll be included in the compilation. Alternatively I could write to anywhere I want and these paths would be added manually in a directive. Not sure how this would work with cross-compilation.
pwd (or something like a workspace root, in case of bsp? I don't know how exactly that protocol works though)
build metadata (in theory I could read the directives myself, but having this passed would be much better), e.g. deps, resolved scala version, target platform)

Note that this would be triggered on IDE compilations as well.

There should be a way to customize the script beyond the runnable name. Ideas:

Passing args within the directive, e.g. //> build script echo foo bar
Wrapping the script with another, and using the other script in the directive. E.g. script echow runs echo foo bar, and I do //> build script echow

Unfortunately, any kind of scripts would mean that builds can't be shared via e.g. gists. Personally I'm fine with this.

Describe alternatives you've considered

Making my own build tool wrapping scala-cli :)

Additional context

Not much to write here.

The text was updated successfully, but these errors were encountered:

ckipp01 · 2023-04-24T11:12:01Z

Just to tie these together I just had a usecase where having something like this would have been super useful. I wrote about it in here, but to reiterate I essentially needed a BuildInfo that held the version of my app so that it could be displayed to users. Since I use a Makefile for the project I was able to make sure that any command ran a script before running the actual command to compile. This sort of works fine until you get out of the context of the Makefile. For example I just realized now that Scala Steward can no longer run on my project because of this. Having something like the approach outlined by @kubukoz would really help in situations like this.

slabuz · 2023-05-08T11:19:54Z

Hi @kubukoz

After a little cooperation with the scala-cli team, I've come up with a proposal on how to define code generators. You can find it here https://gist.github.com/slabuz/b66432d9c71dd100d193617754c79911. In my proposal, I focused on providing a code generator for protobuf. Let me extend it with a few words of commentary.

Where do generators come from?
The idea is to include some of the popular generators in scala-cli and keep extending the library. In the final version, users will be able to provide new generators locally or from gist.

How are they written?
They can be written using any version of scala, not necessarily the same as the main code version. They can use external dependencies, just like normal scala-cli code.

When do they run?
The code generation step will be an obligatory part of the compilation process, run before it to make sure that all generated sources are in place and up to date. In addition, code generation can be triggered automatically when using code editors such as IntelliJ. For those writing code without such tools, a new step is added, scala-cli generate, as shown in the example.

How is the code structured?
At this point, we have identified 2 main aspects of the generator API. The first is a way for the generator to describe itself, giving the most useful data. In my example it's just a JSON, but in the end there will be a case class definition that every generator will need to instantiate and return. The second part of the API is an actual method to generate source code, given the source file and output location.

We are open to any constructive criticism and suggestions on how to make this solution even better ;)

bishabosha · 2023-05-08T11:43:06Z

Note that bloop also has integrated support for execution of source generators - and it is aware of project dependencies and is cached: scalacenter/bloop#1774, scalacenter/bloop#1819, scalacenter/bloop#1784

tgodzik · 2023-05-08T12:10:55Z

Note that bloop also has integrated support for execution of source generators - and it is aware of project dependencies and is cached: scalacenter/bloop#1774, scalacenter/bloop#1819, scalacenter/bloop#1784

Yes, the plan would be to use that.

kubukoz · 2023-05-09T20:14:10Z

The plan looks great, would love to see it :) let me know once I can try integrating https://github.com/disneystreaming/smithy4s/

przemek-pokrywka · 2023-05-11T15:37:49Z

I think that code generation is clearly behind the ideal scope of scala-cli, because it makes it very difficult to define a clear feature set of the tool. Clear definitions are essential for anyone who comes to learn about stuff. An ocean of idiosyncrasies is the worst thing to confront.

Maybe if the tool allows for a hook, like in Cargo (https://web.mit.edu/rust-lang_v1.25/arch/amd64_ubuntu1404/share/doc/rust/html/cargo/reference/build-scripts.html#build-scripts - the script would need to be Scala to exclude OS differences etc) then the damage could be contained. But, again, what is the new clear definition of the scala-cli?
How do you explain it briefly to newcomers / other-lang-refugees?

przemek-pokrywka · 2023-05-11T15:52:44Z

To add some constructiveness to the criticism above, in my opinion it would be good to make scala-cli a well-behaving component of arbitrary systems larger than itself. I'm thinking of things like Bazel or Nix.

Luka-J9 · 2023-05-16T14:57:34Z

I'd love to see this feature. I would look at how Rust/Bleep designed their solution. Bleep is especially interesting due to the notion that it has some interop with sbt plugins. From a design perspective I also like how the configuration is also handled, as having a one liner with lots of configurations (what I understand the current proposal to be) can end up being cumbersome. However I also understand that file formats like yaml/toml would be a larger departure from how scala-cli currently functions (although it might be worth revisiting in this light?)

I disagree with the notion that this is outside the ideal scope of scala-cli, to me it seems like a natural progression. And wedging it into a larger system like Nix or Bazel raises the barrier to entry for newcomers in an unnecessary way in my opinion.

Currently scala-cli has the concept of exporting to Mill/Sbt for when build requirements become sufficiently complex that scala-cli no longer becomes the appropriate tool. Adding code generation would allow users to stay on scala-cli for longer before needing to resort to such an option. While ejecting into a different build tool is a fine option for those of us who are familiar with sbt or mill, thinking from the perspective of a newcomer I would think it would be frustrating to have to learn a completely different tool to achieve functionality like "I want to generate code from my protobuf" or "I want a access Buildinfo."

He-Pin · 2023-09-08T16:23:06Z

Will it support protobuf code generation?

tgodzik · 2023-09-18T09:23:54Z

Yes, this is the intention and basic feature we want to support

bishabosha · 2024-01-30T17:04:33Z

I would like to propose this as a GSOC project under Scala org, if anyone wants to object

Edit: It is now being worked on by Rizky Maulana @Perklone

przemek-pokrywka · 2024-05-23T18:35:24Z

Hi, seeing the "manual code-gen directive" in action changed my mind as it's much better to support the popular use cases in a standard way rather than forcing users to hack their workarounds.

It would be indeed very helpful to have the ability to depend on code that would be generated in the process of building the script/app.

The main question would be how to implement it in a sound, pragmatic, and ergonomic way. If we tried to formalize @WojciechMazur's proto-directive (naming mine), it might look like this:

//> generate --channel https://disneystreaming.github.io/coursier.json smithy4s generate --dependencies com.disneystreaming.smithy:aws-dynamodb-spec:2023.02.10 -o ./handlers/wildrides

so (provided the code generator exists somewhere as a binary) the main Scala-CLI script could even stay stand-alone / as a gist, easily copy-and-paste-able wherever necessary.

If we wanted the code generation to be sound, I'd propose to treat the generator's output directory as a dependency (writable by the generator only).

There are multiple questions about the interface exposed by the generator to Scala-CLI. How would Scala-CLI know what is the output directory etc?

bishabosha · 2024-07-15T10:02:47Z

as I understand, directives syntax is very limited, it would be nice to support a directive with multiple fields, not just "list of strings, where each string has its own custom dsl", and multiple directives all collapse into the same list of strings.

Edit, investigating this - the base parser itself does collapse repeated directives (e.g. 1 per line) into a single list of strings - so separation isn't possible without some custom logic

Perklone · 2024-07-17T11:59:42Z

This is a concept of supporting source generator that I had in mind, also discussed with @bishabosha and @kannupriyakalra aswell.

The idea is that use the source generator via directives that would look something like this:

//> using sourceGenerator "${.}/source-generator-input|glob:test.in|python3 ${.}/source-generator-1.py"

the format for the directive is

//> using sourceGenerator inputDirectory|glob|commandProcessor

This solves a few points that are addressed in the issue:

Some way to run an arbitrary script before every compilation, probably configured in directives or by convention (i.e. putting a script in a ./scala-scripts directory).

This will be using directives so it will be run before every compilation, but due to the nature of bloop caching mechanism, it will cache if you run an identical command after the first compilation.

a way to gather all the sources used in the build (e.g. :-separated list of source paths)

By using directives, we can use multi-line of those directives to gather all of the sources that is needed to compile what you need.

Let me know what you think, I have created a draft PR where you could try it out. Sorry that it's not fully fleshed out yet, but will be working on improving it 👍

bishabosha · 2024-07-17T12:12:37Z

This will be using directives so it will be run before every compilation, but due to the nature of bloop caching mechanism, it will cache if you run an identical command after the first compilation.

So this seems to be because bloop treats the command as not mutable - so if instead of arbitrary commands, we fix the command to be something that is checked every time - such as a scala-cli command on a scala source file - then that should go away.

But also it might be a design decision that actually source generators should be published in a library, so they have a version number and shouldn't change

Gedochao mentioned this issue May 10, 2023

Support for BuildInfo #2106

Closed

Gedochao added this to the Scala CLI 1.1.0 milestone May 24, 2023

kubukoz mentioned this issue Apr 5, 2024

Allow run with no sources / replacing cs launch #2840

Closed

Perklone mentioned this issue Jul 17, 2024

Add supports for using Source Generator using Directives #3033

Draft

Gedochao removed this from the Scala CLI 1.1.0 milestone Jul 22, 2024

Gedochao added this to Issue Board Jul 22, 2024

github-project-automation bot moved this to To do in Issue Board Jul 22, 2024

Gedochao moved this from To do to In progress in Issue Board Jul 22, 2024

Gedochao assigned bishabosha and Perklone Jul 22, 2024

Gedochao added the enhancement New feature or request label Jul 29, 2024

bishabosha mentioned this issue Jul 29, 2024

Source generator config should be able to track a script file as an input scalacenter/bloop#2386

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support code generators #610

Support code generators #610

kubukoz commented Feb 3, 2022

ckipp01 commented Apr 24, 2023

slabuz commented May 8, 2023

bishabosha commented May 8, 2023

tgodzik commented May 8, 2023

kubukoz commented May 9, 2023

przemek-pokrywka commented May 11, 2023

przemek-pokrywka commented May 11, 2023

Luka-J9 commented May 16, 2023

He-Pin commented Sep 8, 2023

tgodzik commented Sep 18, 2023

bishabosha commented Jan 30, 2024 •

edited

Loading

przemek-pokrywka commented May 23, 2024

bishabosha commented Jul 15, 2024 •

edited

Loading

Perklone commented Jul 17, 2024

bishabosha commented Jul 17, 2024 •

edited

Loading

Support code generators #610

Support code generators #610

Comments

kubukoz commented Feb 3, 2022

ckipp01 commented Apr 24, 2023

slabuz commented May 8, 2023

bishabosha commented May 8, 2023

tgodzik commented May 8, 2023

kubukoz commented May 9, 2023

przemek-pokrywka commented May 11, 2023

przemek-pokrywka commented May 11, 2023

Luka-J9 commented May 16, 2023

He-Pin commented Sep 8, 2023

tgodzik commented Sep 18, 2023

bishabosha commented Jan 30, 2024 • edited Loading

przemek-pokrywka commented May 23, 2024

bishabosha commented Jul 15, 2024 • edited Loading

Perklone commented Jul 17, 2024

bishabosha commented Jul 17, 2024 • edited Loading

bishabosha commented Jan 30, 2024 •

edited

Loading

bishabosha commented Jul 15, 2024 •

edited

Loading

bishabosha commented Jul 17, 2024 •

edited

Loading