Kubermates
  • Articles
  • Events
  • Docs
  • Releases
  • Contribute
  • Docs
    • SUSE and Tigera: Empowering Secure, Scalable Kubernetes with Calico Enterprise
    • How to Connect Nested KubeVirt Clusters with Calico and BGP Peering
    • KubeCon + CloudNativeCon North America 2025 Co-Located Event Deep Dive: Kubernetes on Edge Day
    • Classifying human-AI agent interaction
    • Evolving our ServiceNow integration: Sunsetting the Notification Service for more capable alternatives
    • Friday Five — October 3, 2025
    • Introducing Red Hat OpenStack VMware migration toolkit
    • VCF Breakroom Chats Episode 61: From VI Admin to Cloud Hero – Building a Multi-Tenant Cloud with VCF 9.0
    • Announcing cost-efficient storage with Network file storage, cold storage, and usage-based backups
    • Announcing per-sec billing, new Droplet plans, BYOIP, and NAT gateway preview to reduce scaling costs
    • Introducing DigitalOcean Organizations, a new and comprehensive account layer
    • Storage that thinks for itself: Introducing Storage autoscaling, the newest feature for Managed Databases
    • Build Smarter Agents with Image Generation, Auto-Indexing, VPC Security, and new AI Tools on DigitalOcean Gradient™ AI Platform
    • How Red Hat can support your journey to a standard operating environment
    • Red Hat and Sylva unify the future for telco cloud
    • Red Hat Learning Subscription: Expert chat for premium and standard users
    • Security update: Incident related to Red Hat Consulting GitLab instance
    • Kyverno vs Kubernetes Policies: How It Complements and Completes
    • VMware Cloud Foundation Automation -Infrastructure Resource Policy Overview
    • Fluentd to Fluent Bit: A Migration Guide
    • Llama Stack and the case for an open “run-anywhere” contract for agents
    • Red Hat Summit 2026 call for proposals is now open
    • Searching for the 2026 Red Hat Certified Professional of the Year
    • VCF Operations Management Packs: Announcing End of General Support
    • Set Your Implementation Up for Success with VCF Jumpstart Workshop
    • 🏆 How I Passed the Certified Argo Project Associate (CAPA) Exam — And Why It Was Worth It
    • Ireland’s next steps for effective AI delivery
    • Optimizing application architectures for AI: From monoliths to intelligent agents (2 of 2 blogs series)
    • What you don’t see could cost you: Why open source matters in enterprise AI
    • VCF Breakroom Chats Episode 60: Infrastructure Modernization, Health, and APIs for Private Cloud
    • Certifications in DevOps: Which Are Worth Your Time in 2025?
    • Diagnostics for VMware Cloud Foundation Operations – Newest Findings
    • Empowering Platform Engineers with native Kubernetes Multi-Cluster Management in VMware Cloud Foundation
    • KubeCon + CloudNativeCon North America 2025 Co-Located Event Deep Dive: BackstageCon
    • Hacktoberfest 2025: How to Participate
    • Bridging the gap: Secure virtual and container workloads with Red Hat OpenShift and Palo Alto Networks
    • Managing Red Hat Device Edge: Tools and strategies
    • Migrating to Red Hat OpenShift Virtualization with NetApp FlexPod
    • Red Hat Device Edge: Decision framework
    • Simplify virtualization deployments with the Assisted Installer
    • Announcing H1 2026 KCDs
    • VCF Breakroom Chats Episode 61: From VI Admin to Cloud Hero – Building a Multi-Tenant Cloud with VCF 9.0
    • KubeCon + CloudNativeCon North America 2025 Co-Located Event Deep Dive: Kubeflow Summit
    • Friday Five — September 26, 2025
    • Vodafone revolutionizes telco cloud with OpenShift, validated patterns, and GitOps
    • Autonomous Testing of etcd’s Robustness
    • Sydney Sovereign Cloud Day 2025: Where Compliance Meets Innovation
    • How to Upgrade to VMware Cloud Foundation 9.0
    • Announcing Changed Block Tracking API support (alpha)
    • From Chaos to Control: Achieving Network Policy Nirvana with Kyverno
    • Maximize your OpenShift investment: 6 reasons to upgrade to OpenShift Platform Plus
    • More than meets the eye: Behind the scenes of Red Hat Enterprise Linux 10 (Part 2)
    • Red Hat and O-RAN Alliance accelerating cloud adoption at the Edge
    • The flight plan for AI: How we’re building a culture of innovation at Turkish Technology
    • Kubernetes Observability: Your Q&A Guide to Calico Whisker
    • CNCF’s Helm Project Remains Fully Open Source and Unaffected by Recent Vendor Deprecations
    • Local Roots, Global Reach: CNCJ Reflects on KubeCon + CloudNativeCon Japan 2025
    • DxEnterprise operator for high availability now certified for RHEL 9.6
    • New Red Hat Ansible Certified Content Collections for HashiCorp Terraform and HashiCorp Vault
    • The evolution of Red Hat Ansible Lightspeed
    • Securing Your Infrastructure as Code: The Power of Nirmata and HashiCorp Terraform
    • Solving Kubernetes Multi-tenancy Challenges with vCluster
    • Analyst Insight Series: Virtualization Virtue #2: Stronger Cloud Security and Fault Tolerance
    • The FinOps Journey: From Visibility to Business Value
    • AI and Red Hat: Powering the future of cable providers
    • Building an adaptable enterprise: A guide to AI readiness
    • Introducing Headlamp Plugin for Karpenter - Scaling and Visibility
    • Kubernetes v1.34: Pod Level Resources Graduated to Beta
    • Build faster, debug smarter, and make AI safer with new DigitalOcean Gradient™ AI Platform features
    • KubeCon + CloudNativeCon North America 2024 Co-Located Event Deep Dive: CiliumCon
    • Blog: Spotlight on the Kubernetes Steering Committee
    • Worldpay's Platform as a Product: Revolutionizing development with Red Hat OpenShift
    • KubeCon + CloudNativeCon India 2025: A Transformative Experience in Hyderabad
    • Kubernetes v1.34: Recovery From Volume Expansion Failure (GA)
    • First VMmark Result Published Using VMware Cloud Foundation 9.0
    • Hacktoberfest 2025: Celebrate All Things Open Source!
    • Top Kubernetes (K8s) Troubleshooting Techniques – Part 2
    • KubeCon + CloudNativeCon North America 2024 Co-Located Event Deep Dive: Observability Day
    • Accelerate AI inference with vLLM
    • Friday Five — September 19, 2025
    • Top 10 must-reads: Open source innovation at Red Hat
    • Kubernetes v1.34: DRA Consumable Capacity
    • VCF Breakroom Chats Episode 57: Behind the Code – A Journey from Customer Pain to VCF 9.0
    • CNCF Expands Infrastructure Support for Project Maintainers Through Partnership with Docker
    • 10 VMware Cloud Foundation 9.0 Enhancements: Simplifying Your Day 2 Operations
    • Implementing granular failover in multi-Region Amazon EKS
    • Capacity Management: The IT Balancing Act You Can’t Ignore
    • Red Hat Customer Portal recognized by the Association of Support Professionals as one the Best Support Websites of 2025
    • Reducing bias in AI models through open source
    • Kubernetes v1.34: Pods Report DRA Resource Health
    • CNCF Welcomes 20 New Silver Members Reflecting Broader Cloud Native and AI Adoption
    • Use Raspberry Pi 5 as Amazon EKS Hybrid Nodes for edge workloads
    • Calico Whisker vs. Traditional Observability: Why Context Matters in Kubernetes Networking
    • Kubernetes v1.34: Moving Volume Group Snapshots to v1beta2
    • Best DevOps Courses in 2025: Learning Paths to Boost Your Career
    • Deploy Distributed LLM Inference with GPUDirect RDMA over InfiniBand in VMware Private AI
    • Distributed performance testing for Kubernetes environments: Grafana k6 Operator 1.0 is here
    • Fedora 43 Beta now available
    • Unlocking AI innovation: GPU-as-a-Service with Red Hat
    • Use the RHEL command-line assistant offline with this new developer preview
View page source Edit this page Create child page Create documentation issue
Tags
  • Announcement1
  • Automation2
  • Aws16
  • Azure13
  • Cicd1
  • Cloud28
  • Cloud-Foundation41
  • Cloudnative2
  • Cncf202
  • Community52
  • Containers2
  • Devops5
  • Docker1
  • Eks46
  • Event173
  • Falco1
  • Finops3
  • Github2
  • Githubactions3
  • Gitops1
  • Gke44
  • Grafana11
  • Jenkins-X50
  • K3s25
  • Karmada1
  • Kodekloud17
  • Kubeflow11
  • Kubernetes410
  • Networking4
  • Nirmata14
  • Openshift23
  • Opensource7
  • Productivity1
  • Programming1
  • Rancher50
  • Rke12
  • Rke226
  • Rss10
  • Security17
  • Servicemesh1
  • Terraform1
  • Terragrunt1
  • Tigera23
  • Trivy1
  • Vmware41
  1. Docs
  2. Amazon EKS enables ultra scale AI/ML workloads with support for 100K nodes per cluster

Amazon EKS enables ultra scale AI/ML workloads with support for 100K nodes per cluster

Link
2025-07-16 ~1 min read aws.amazon.com #eks #aws

Open the original post ↗ https://aws.amazon.com/blogs/containers/amazon-eks-enables-ultra-scale-ai-ml-workloads-with-support-for-100k-nodes-per-cluster/

Open Original ↗
Share on X LinkedIn

© 2025 Kubermates powered by Kapheira and Hamdi KHELIL.