24 Apr 16:30

tchaton

26119e3

Release 0.2.5

What's Changed

Remove condition on torch installation by @tchaton in #110
Bump version 0.2.5 by @tchaton in #111

Full Changelog: v0.2.4...v0.2.5

Contributors

tchaton

Assets 2

24 Apr 16:13

tchaton

v0.2.4

d0d19e6

Release 0.2.4

What's Changed

Update LitGPT references in README.md by @rasbt in #90
Don't raise a runtimeError if the downloader doesn't exist. by @tchaton in #98
Added call to setup function of serializer class to set data format by @vgurev in #96
Fix map() failing to create dataset when input_dir is None by @awaelchli in #100
Streamingdataset torch compatibility by @yhl48 in #108
Move to version 0.2.4 by @tchaton in #109

New Contributors

@rasbt made their first contribution in #90
@vgurev made their first contribution in #96
@awaelchli made their first contribution in #100
@yhl48 made their first contribution in #108

Full Changelog: v0.2.3...v0.2.4

Contributors

awaelchli, rasbt, and 3 other contributors

Assets 2

03 Apr 09:18

tchaton

v0.2.3

ee69581

Release 0.2.3

Full Changelog: v0.2.2...v0.2.3

Assets 2

08 Mar 15:17

tchaton

v0.2.2

c3f2278

Release 0.2.2

Couple of tiny fixes.

Assets 2

05 Mar 09:55

tchaton

v0.2.1

e89b5a2

Release 0.2.1

Release 0.2.1. Minor fixes.

Assets 2

26 Feb 13:33

tchaton

v0.2.0

a05495e

Release 0.2.0

⚡ Welcome to Lightning Data

We developed StreamingDataset to optimize training of large datasets stored on the cloud while prioritizing speed, affordability, and scalability.

Specifically crafted for multi-gpu & multi-node (with DDP, FSDP, etc...), distributed training with large models, it enhances accuracy, performance, and user-friendliness. Now, training efficiently is possible regardless of the data's location. Simply stream in the required data when needed.

The StreamingDataset is compatible with any data type, including images, text, video, audio, geo-spatial, and multimodal data and it is a drop-in replacement for your PyTorch IterableDataset class. For example, it is used by Lit-GPT to pretrain LLMs.

This release marks the first of the release from litdata. From now on, we will track all changes within a CHANGELOG.md file.

Thanks to all contributors.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

What's Changed

New Contributors

Contributors

⚡ Welcome to Lightning Data

Releases: Lightning-AI/litdata

Release 0.2.5

What's Changed

Contributors

Release 0.2.4

What's Changed

New Contributors

Contributors

Release 0.2.3

Release 0.2.2

Release 0.2.1

Release 0.2.0

⚡ Welcome to Lightning Data