Skip to content

Latest commit

 

History

History
140 lines (99 loc) · 5.94 KB

index.rst

File metadata and controls

140 lines (99 loc) · 5.94 KB

Welcome to AWS Neuron

AWS Neuron is the software development kit (SDK) used to run deep learning and generative AI workloads on AWS Inferentia and AWS Trainium powered Amazon EC2 instances (Amazon EC2 Inf1, Inf2, Trn1 and Trn2 instances). It includes a compiler, runtime, training and inference libraries, and profiling tools. Neuron supports customers in their end-to-end ML development lifecycle including building and deploying deep learning and AI models.

For more information about the latest AWS Neuron release, see :ref:`latest-neuron-release` and check the :ref:`announcements-main` page.

For list of AWS Neuron model samples and tutorials on Amazon EC2 Inf1, Inf2, Trn1, and Trn2 instances, see :ref:`model_samples_tutorials`.

.. card:: Get Started with Neuron
      :link: neuron-quickstart
      :link-type: ref


.. card:: Neuron Quick Links
      :link: docs-quick-links
      :link-type: ref


.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: Overview

   Quick Links </general/quick-start/docs-quicklinks>
   Ask Q Developer </general/amazonq-getstarted>
   Get Started with Neuron </general/quick-start/index>
   Samples and Tutorials </general/models/index>
   Performance </general/benchmarks/index>
   What’s new </release-notes/index>
   Announcements </general/announcements/index>

.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: ML frameworks

   PyTorch Neuron </frameworks/torch/index>
   JAX Neuron </frameworks/jax/index>
   TensorFlow Neuron </frameworks/tensorflow/index>

.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: NeuronX Distributed (NxD)

   NxD Training (Beta) </libraries/nxd-training/index>
   NxD Inference (Beta) </libraries/nxd-inference/index>
   NxD Core </libraries/neuronx-distributed/index>

.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: Additional ML Libraries

   Third Party libraries </libraries/third-party-libraries/third-party-libraries>
   Transformers Neuron </libraries/transformers-neuronx/index>
   AWS Neuron reference for NeMo Megatron </libraries/nemo-megatron/index>

.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: Developer flows

   Neuron DLAMI </dlami/index>
   Neuron containers </containers/index>
   AWS Workload orchestration </general/devflows/index>
   Amazon SageMaker </general/devflows/sagemaker-flows>
   Third-party solutions <general/devflows/third-party-solutions>
   Setup guide </general/setup/index>

.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: Runtime & Tools

   Neuron runtime </neuron-runtime/index>
   Monitoring Tools </general/monitoring-tools>
   Profiling Tools </general/profiling-tools>
   Third-party solutions <tools/third-party-solutions>
   Other Tools </general/other-tools>

.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: Compiler

   Neuron compiler </compiler/index>
   Neuron Kernel Interface (Beta) <general/nki/index>
   Neuron C++ Custom Operators </neuron-customops/index>

.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: Learning Neuron

   Architecture </general/arch/index>
   Features </general/arch/neuron-features/index>
   Application notes </general/appnotes/index>
   FAQ </general/faq>
   Troubleshooting </general/troubleshooting>
   AWS Neuron glossary </general/arch/glossary>

.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: Legacy Software

   Apache MXNet </frameworks/mxnet-neuron/index>


.. toctree::
   :hidden:
   :maxdepth: 1
   :caption: About Neuron

   Release details </release-notes/release>
   Roadmap </general/roadmap-readme>
   Support </general/support>