Azure Kubernetes Service#

This guide offers recommended configurations and settings unique to Azure Kubernetes Service (AKS). These should be used to augment the generic requirements offered on our primary requirements page.

Instance types#

  • Minimum: Standard_D8s_v4

  • Recommended: Standard_D16s_v4

Storage#

Unfortunately, we have found that Azure’s built-in, managed Network File System (NFS) service, Azure Files NFS, does not provide an acceptable performance level for use with Data Science & AI Workbench. We have not yet had the opportunity to evaluate Azure NetApp Files.

For this reason, Anaconda recommends creating a separate Virtual Machine for hosting NFS storage. Follow the recommendations offered in this document, with these Azure-specific recommendations:

  • The Standard_D4s_v3 machine type is suitable for this purpose.

  • Azure tightly couples disk size and IOPS performance. Anaconda recommends a minimum disk size of 1 TiB to ensure good performance.

Note

This server can be the administration server as well.

Network#

Azure offers two different networking options for AKS clusters. Both approaches are compatible with Workbench, so the determination depends upon your larger networking needs.

Note

AKS uses a LoadBalancer which by default sets TCP idle timeouts to 4 minutes, and enables TCP resets. This may affect any user workload that depends on a continuous TCP connection.

GPUs#

Please see this Azure guide for adding GPU resources to your AKS cluster.