AI infrastructure
Enterprises investing in deep learning platforms need AI infrastructure sufficient enough to synthesize a massive amount of data. Find the information you need to make decisions about AI-specific compute architectures -- from GPU-packed servers to highly scalable clustered computing systems built for big data and machine learning applications.
Top Stories
-
Tip
12 Aug 2024
10 benefits of containers for AI workloads
IT organizations face a daunting onslaught of new AI projects. Containers offer key benefits to ease the burden throughout an AI workload's lifecycle. Continue Reading
-
News
05 Aug 2024
AI inference startup Groq raises $640M
The startup develops language processing units and sells them as a service. While the vendor must deal with market giant Nvidia, it has an opportunity to stay competitive. Continue Reading
-
News
05 Aug 2024
AI inference startup Groq raises $640M
The startup develops language processing units and sells them as a service. While the vendor must deal with market giant Nvidia, it has an opportunity to stay competitive. Continue Reading
-
Video
05 Aug 2024
An explanation of generative design
Generative design transforms the creation process across many fields and excels in generating unbiased, efficient designs. Continue Reading
-
News
02 Aug 2024
Intel's weak position in AI chip market leads to mass layoff
Intel's failure to profit from the red-hot AI market is behind plans to cut 15,000 jobs. The workforce reduction is part of a $10 billion cut in capital expenses in 2025. Continue Reading
-
News
30 Jul 2024
Nvidia targets metaverse with OpenUSD NIM microservices
The vendor introduces new generative AI models that will be available as microservices. The new models show that the metaverse is still a part of Nvidia's strategy. Continue Reading
-
Feature
30 Jul 2024
What is regression in machine learning?
Regression in machine learning helps organizations forecast and make better decisions by revealing the relationships between variables. Learn how it's applied across industries. Continue Reading
-
Tutorial
29 Jul 2024
Natural language programming using GPTScript
GPTScript enables programmers to use natural language syntax and tap into OpenAI when building apps. Here's a basic GPTScript tutorial with examples for beginners. Continue Reading
-
News
25 Jul 2024
Noom CEO on the need for AI in wellness and health
CEO Geoff Cook discusses the benefits of AI in the wellness industry, the need for humans and the possibility of AI technology in remedying the doctor shortage. Continue Reading
-
News
24 Jul 2024
Google intros Mistral Codestral as a service on Vertex AI
The cloud provider differentiates its GenAI stance by offering the code-generating LLM as a service on the Vertex AI Model Garden, along with Mistral Large 2. Continue Reading
-
News
23 Jul 2024
Meta intros its biggest open source AI model: Llama 3.1 405B
The model is the biggest open source model yet, the tech giant claims. The social media company also upgraded its model context window to 128k and updated its AI assistant. Continue Reading
-
News
19 Jul 2024
OpenAI goes small with GPT-4o mini
The AI vendor launched a small language model that is priced at 15 cents per million input tokens. It also introduced new compliance tools for ChatGPT Enterprise users. Continue Reading
-
News
19 Jul 2024
Enterprises chasing AI confront a harsh reality
Recent reports from KPMG, McKinsey & Co. and Goldman Sachs indicate that generative AI is immature and carries a high price tag, with no clear path to ROI. Continue Reading
-
News
17 Jul 2024
Google launches Distributed Cloud edge hardware
The hardware for highly regulated industries runs the Google Cloud infrastructure stack, data security services and the Vertex AI platform for running pretrained AI models. Continue Reading
-
Feature
17 Jul 2024
25 machine learning interview questions with answers
Aspiring machine learning job candidates should be fluent in many aspects of machine learning, from statistical theory and programming concepts to general industry knowledge. Continue Reading
-
Podcast
15 Jul 2024
AWS GenAI strategy based on multimodel ecosystem plus Q
While offering its own large language models and AI chips, the tech giant gives customers access to other vendors' generative models, including open models. Continue Reading
-
Tip
12 Jul 2024
How to prevent deepfakes in the era of generative AI
Businesses must be ever vigilant in detecting the increasingly sophisticated nuances of deepfakes by applying security techniques that range from the simple to the complex. Continue Reading
-
Tip
11 Jul 2024
10 popular libraries to use for machine learning projects
Machine learning libraries expedite the development process by providing optimized algorithms, prebuilt models and other support. Learn about 10 widely used ML libraries. Continue Reading
-
News
11 Jul 2024
Moving GenAI from proof of concept to production on AWS
A prevailing critique of generative AI tools is that they can't be put into practice. However, some organizations now use vendor tools to produce capabilities for their clients. Continue Reading
-
News
10 Jul 2024
AWS intros GenAI app studio, updates Amazon Q and Bedrock
The cloud provider introduced new features that help enterprises create applications faster, while still applying responsible practices. It also unveiled new educational resources. Continue Reading
-
Video
10 Jul 2024
An explanation of AI model collapse
Generative AI creates content quickly and accurately but faces the risk of model collapse. Continue Reading
-
Video
09 Jul 2024
An explanation of foundation models
The core of every generative AI chatbot -- such as ChatGPT, Bard and YouChat -- is the foundation model. Continue Reading
-
Tip
09 Jul 2024
Learn how to create a machine learning pipeline
Well-considered machine learning pipelines provide a structured approach to AI development in modern IT environments, ensuring uniformity, speed and business alignment. Continue Reading
-
Feature
08 Jul 2024
Attributes of open vs. closed AI explained
What's the difference between open vs. closed AI, and why are these approaches sparking heated debate? Here's a look at their respective benefits and limitations. Continue Reading
-
Tip
08 Jul 2024
Generative models: VAEs, GANs, diffusion, transformers, NeRFs
Choosing the right GenAI model for the task requires understanding the techniques each uses and their specific talents. Learn about VAEs, GANs, diffusion, transformers and NerFs. Continue Reading
-
News
01 Jul 2024
SK Hynix's $75B investment in AI chips shows a growing trend
The South Korea-based chipmaker responds to the growing demand for AI memory chips with a big investment through 2028. Meanwhile, memory chipmakers step up production. Continue Reading
-
Opinion
01 Jul 2024
It's time to rethink enterprise storage for the AI era
Pure's platform-centric storage strategy will enable customers to scale their storage infrastructure to keep pace with the fast-evolving AI landscape. Continue Reading
-
Tip
28 Jun 2024
AI-focused storage choices, features and considerations
The list of GenAI-focused storage options grows as Pure, Dell, HPE and other major vendors innovate to win over IT infrastructure buyers. Continue Reading
-
News
27 Jun 2024
Google targets GenAI accuracy, speed, size, efficiency
The tech giant advances again in the generative AI competition that has rocked the tech industry over the last two years, with key updates to its Gemini and Imagen models. Continue Reading
-
Opinion
26 Jun 2024
Artificial intelligence, Nvidia took center stage at HPE Discover 2024
With many organizations developing or evaluating generative AI initiatives, HPE increased its commitment to the space through a broad partnership with Nvidia. Continue Reading
-
Feature
24 Jun 2024
Strategies for simplifying network complexity
Experts at the Cisco Live 2024 conference discussed the future of AI in networks and how its use can help simplify network and data center operations. Continue Reading
-
News
20 Jun 2024
Arista, Cisco, HPE answer AI infrastructure demand
Arista, Cisco and HPE are racing to seize a share of the promising GenAI infrastructure market. Cisco and HPE have computing, networking and software; Arista focuses on networking. Continue Reading
-
News
18 Jun 2024
Trump could significantly alter U.S. climate priorities
The Biden administration's regulatory efforts have defined the U.S. approach to climate over the last four years. That could change if Trump wins the 2024 election. Continue Reading
-
News
18 Jun 2024
HPE GreenLake adds GenAI capabilities as on-premises PaaS
HPE GreenLake debuts a new PaaS offering for enterprise GenAI development, co-created with Nvidia. HPE also updated OpsRamp and server hardware refreshes for AI workloads. Continue Reading
-
News
14 Jun 2024
Security pros grade Apple Intelligence data privacy measures
Apple has built a Private Cloud Compute server to process and then delete data sent from Apple Intelligence running on an iPhone, iPad or Mac. Apple says it won't store any data. Continue Reading
-
News
12 Jun 2024
New Databricks tools target AI quality, cost and security
The vendor's latest features aim to help customers improve model accuracy by securely developing compound systems that include multiple language models and RAG pipelines. Continue Reading
-
Tip
10 Jun 2024
Gemini vs. ChatGPT: What's the difference?
ChatGPT took early lead among AI-generated chatbots before Google answered with Gemini, formerly Bard. While ChatGPT and Gemini perform similar tasks, there are differences. Continue Reading
-
News
05 Jun 2024
Cisco commits to GenAI with HyperFabric, $1B fund
Cisco partners with Nvidia on HyperFabric as customers try to understand how generative AI will help them manage their network infrastructure. Continue Reading
-
News
04 Jun 2024
AMD, Intel and Nvidia's latest moves in the AI PC chip race
Major chip makers used Computex 2024 to launch their aggressive AI chip strategies and stake a claim in the burgeoning AI PC market. Continue Reading
-
News
04 Jun 2024
Snowflake demonstrates shift to AI with newest features
After a slow start to building an environment for developing AI applications, the vendor unveils new features that show it is catching up to its peers. Continue Reading
-
News
04 Jun 2024
Intel launches Xeon 6 for AI data centers
This month, Intel will launch the first of its Xeon 6 data center silicon that offers two microarchitectures, one for performance and the other for power efficiency. Continue Reading
-
Podcast
03 Jun 2024
The importance of open source in GenAI
Lightning AI, the creator of PyTorch Lightning, was a pioneer of open source technology. The vendor has built on many platforms, but CTO Luca Antiga doesn't see one as best. Continue Reading
-
Definition
03 Jun 2024
What is generative AI? Everything you need to know
Generative AI is a type of artificial intelligence technology that can produce various types of content, including text, imagery, audio and synthetic data. Continue Reading
-
Opinion
30 May 2024
Dell Technologies World was all about AI; what about security?
At Dell Technologies World 2024, Dell made it crystal clear that it is all-in on AI, but the company must also emphasize the importance of cybersecurity. Continue Reading
-
Opinion
29 May 2024
Dell AI Factory takes the spotlight at Dell Technologies World
Dell has aligned its entire portfolio behind AI and has expanded critical partnerships to support customers adopting artificial intelligence technologies. Continue Reading
-
Guest Post
29 May 2024
How network engineers can prepare for the future with AI
The rapid rise of AI has left some professionals feeling unprepared. GenAI is beneficial to networks, but engineers must have the proper tools to adapt to this new change. Continue Reading
-
News
23 May 2024
AI companies losing public trust in safety
Researchers find that more than half of Americans polled believe AI companies aren't considering ethics when developing the technology, and nearly 90% favor government regulations. Continue Reading
-
Feature
22 May 2024
Latest AI network trends signal future of network automation
AI will revolutionize all aspects of network operations, from management to security. Experts discussed AI's potential and the challenges of readiness at ONUG Spring 2024. Continue Reading
-
News
22 May 2024
New IBM Watsonx GenAI focuses on enterprises, governance
The veteran tech giant, with deep roots in AI, bases its new AI strategy on open source, multimodel support and helping businesses modernize their code and IT operations. Continue Reading
-
Feature
22 May 2024
The benefits of AI in network operations
Sessions at ONUG Spring 2024 highlighted how AI will transform networking. Experts discussed how AI can help optimize, automate and secure networks, despite some skepticism. Continue Reading
-
News
21 May 2024
IBM moves ahead with open source, multi-model AI strategy
The 113-year-old tech giant expanded its open source, open ecosystem, multi-model approach with support for more models and updates to its Watsonx generative AI assistants. Continue Reading
-
News
20 May 2024
Dell refines AI factory, expands Nvidia partnership
The Dell AI Factory took center stage at Dell Technologies World with new infrastructure additions, a broader AI ecosystem and an expanded partnership with Nvidia. Continue Reading
-
News
14 May 2024
Google Gemini generative AI hits all products, including Search
The tech giant made its Gemini GenAI model more powerful and rolled out new Gemini-powered products for enterprises and consumers. Continue Reading
-
Definition
09 May 2024
information technology (IT)
Information technology (IT) is the use of computers, storage, networking and other physical devices, infrastructure and processes to create, process, store, secure and exchange all forms of electronic data. Continue Reading
-
News
09 May 2024
OpenAI deepfake detector 'belated but welcome'
The GenAI vendor's image-identifying tool has been warmly received, though some say it's belated, with elections around the world this year. The vendor also joined C2PA. Continue Reading
-
News
01 May 2024
Backup vendors embrace GenAI, but features remain immature
Data backup and disaster recovery vendors are keeping up with the GenAI hype by quickly releasing new features -- but the use cases are limited, and their value remains unclear. Continue Reading
-
Tip
01 May 2024
Embedding models for semantic search: A guide
Embedding models in semantic search are changing how we interact with information by going beyond keyword matching to capture meaning and relationships in text and other data. Continue Reading
-
News
25 Apr 2024
Microsoft's new Phi-3-mini AI language model runs on iPhone
Microsoft researchers contend the Phi-3-mini's performance is on par with the much larger ChatGPT 3.5 model and can run on an iPhone 14 powered by an A16 Bionic chip. Continue Reading
-
News
24 Apr 2024
Lenovo, AMD broaden AI options for customers
Lenovo is expanding its partnership with AMD to bring more options for servers and HCI devices aimed at AI. It also launched an AI advisory and professional services offering for customers. Continue Reading
-
Definition
23 Apr 2024
neuro-symbolic AI
Neuro-symbolic AI combines neural networks with rules-based symbolic processing techniques to improve artificial intelligence systems' accuracy, explainability and precision. Continue Reading
-
News
23 Apr 2024
AWS boosts Amazon Bedrock GenAI platform, upgrades Titan LLM
The cloud giant buttressed its GenAI platform with features to import, select and build safety guardrails for third-party LLMs from Meta, Cohere, Mistral and others more easily. Continue Reading
-
Definition
22 Apr 2024
host virtual machine (host VM)
A host virtual machine is the server component of a virtual machine the underlying hardware that provides computing resources to support a particular guest VM. Continue Reading
-
News
19 Apr 2024
Businesses confront reality of generative AI in finance
As large language models move from pilot projects to full-scale deployment in finance, the industry is facing a mixture of compliance and technological challenges in 2024. Continue Reading
-
News
18 Apr 2024
Meta releases two Llama 3 models, more to come
The social media giant's new open source LLM is telling of the challenges in the open source market and its future ambitions involving multimodal and multilingual capabilities. Continue Reading
-
Opinion
17 Apr 2024
Google Cloud Next 24 recap: GenAI tools for enterprises
New products introduced at Google Cloud Next stand to provide enterprises with GenAI tools for analytics, data management, AI model building and more. Continue Reading
-
News
17 Apr 2024
Lawmakers concerned about deepfake AI's election impact
Lawmakers want Congress to intervene and tackle AI manipulations that could affect U.S. elections. However, legislation has yet to advance to the House or Senate floor. Continue Reading
-
News
17 Apr 2024
Looking closer at Microsoft's investment in UAE AI vendor G42
The tech giant will own a minor stake, and G42's LLM will be on Azure. The move helps the cloud provider expand globally and helps the U.S. court the UAE away from China. Continue Reading
-
News
15 Apr 2024
Google Cloud customers disclose GenAI strengths, weaknesses
Troubling flaws remain a problem for enterprises, but Google Cloud customers, including Ford, Belk and Deutsche Bank, find the tech too compelling to pass up. Continue Reading
-
Tip
15 Apr 2024
Tips for planning a machine learning architecture
When planning a machine learning architecture, organizations must consider factors such as performance, cost and scalability. Review necessary components and best practices. Continue Reading
-
News
11 Apr 2024
Meta's new silicon shows a growing trend for AI hyperscalers
The new hardware highlights a growing trend of hyperscalers designing custom chips for internal use. This move will help vendors rely less on hardware providers such as Nvidia. Continue Reading
-
Tip
11 Apr 2024
Building networks for AI workloads
Conventional and high-performance computing networks cannot adequately support AI workloads, so network engineers must build specialized networks to accommodate their massive size. Continue Reading
-
Definition
10 Apr 2024
schema
In computer programming, a schema (pronounced SKEE-mah) is the organization or structure for a database, while in artificial intelligence (AI), a schema is a formal expression of an inference rule. Continue Reading
-
Tip
04 Apr 2024
How to build an enterprise generative AI tech stack
Generative AI tech stacks consist of key components like LLMs, vector databases and fine-tuning tools. The right tech stack can help enterprises maximize their generative AI ROI. Continue Reading
-
News
02 Apr 2024
Edge AI startup reveals GenAI accelerator, $120M fundraise
The startup introduced a new GenAI accelerator for PCs and smart vehicles. The vendor's growth highlights the shift to training GenAI workloads from the cloud to the edge. Continue Reading
-
News
28 Mar 2024
US AI policy for federal agencies requires transparency
The OMB's new policy calls for federal agencies to be transparent about AI use and designate chief AI officers to coordinate efforts. Continue Reading
-
Feature
27 Mar 2024
AI hardware vendors band together to challenge Nvidia
An industry group including Arm and Intel seeks to increase the number of options in the AI market and decrease developers' dependence on GPUs. Continue Reading
-
Podcast
25 Mar 2024
Security, bias risks inherent in GenAI black box models
Language models are stochastic models that generate output based on data upon which they have been trained. Often, these models are a closed black box. That leads to many problems. Continue Reading
-
News
22 Mar 2024
Nvidia partners, customers drive AI into data centers
Nvidia and its partners are providing the tools and infrastructure to build and deploy AI applications that companies say could transform their businesses. Continue Reading
-
News
21 Mar 2024
UN AI resolution marks global interest in rules, principles
While the United Nations' artificial intelligence resolution does not create legally binding rules, it might indicate which countries are thinking in that direction. Continue Reading
-
News
18 Mar 2024
Nvidia unveils new AI Blackwell chip, microservices and more
The vendor launched a barrage of AI tech including faster chips in its new Blackwell infrastructure and new microservices that enable enterprises to create custom applications. Continue Reading
-
Tip
18 Mar 2024
Compare enterprise generative AI deployment options
To pick the best generative AI deployment model for your organization, examine how cloud and on-premises approaches fit into your security, cost, infrastructure and network needs. Continue Reading
-
News
13 Mar 2024
Cerebras introduces next-gen AI chip for GenAI training
The new accelerator is for training large AI models. It powers the startup's CS-3 supercomputer, which is designed to train models that are 10 times larger than GPT-4 and Gemini. Continue Reading
-
Feature
13 Mar 2024
The need for common sense in AI systems
Building explainable and trustworthy AI systems is paramount. To get there, computer scientists Ron Brachman and Hector Levesque suggest infusing common sense into AI development. Continue Reading
-
News
12 Mar 2024
Meta intros two GPU training clusters for Llama 3
The Facebook parent company said the training clusters are part of its plans to grow its infrastructure and obtain 350,000 Nvidia H100 GPUs by the end of the year. Continue Reading
-
News
12 Mar 2024
Cohere tackles some generative AI challenges with Command-R
The startup's new large language model aims to address problems with factual accuracies in generative AI models. It also focuses on language problems, and cloud and API challenges. Continue Reading
-
News
11 Mar 2024
Elon Musk plans to take xAI chatbot Grok open source
The move comes nearly two weeks after the Tesla owner filed a lawsuit against OpenAI. It also comes as more vendors are providing open source options for enterprise users. Continue Reading
-
News
11 Mar 2024
Salesforce AI a work in progress for customers
While eager to embed AI in Salesforce software, customers acknowledge that planning, time and work are needed to make AI useful in the company's applications. Continue Reading
-
News
11 Mar 2024
Podcast: A look at SambaNova's open source AI strategy
Despite sometimes being seen as a direct competitor to Nvidia Systems, the AI hardware and software vendor tries to distinguish itself by focusing on training open source models. Continue Reading
-
News
07 Mar 2024
Microsoft whistleblower, OpenAI, the NYT, and ethical AI
The vendor has filed a memorandum to dismiss some of the arguments The New York Times made in its copyright lawsuit. However, it now faces criticism from its own software engineer. Continue Reading
-
News
05 Mar 2024
Box AI adds Microsoft Azure OpenAI Service integration
Box adds Microsoft Azure OpenAI Service to its lineup of AI tools for document summaries, joining Google's Vertex and OpenAI LLMs for users to choose from. Continue Reading
-
News
04 Mar 2024
AI race surges as Anthropic intros Claude 3
The new models have a larger context window and multimodal capabilities. They reflect the new level of normal in generative AI and the myriad model choices for enterprises. Continue Reading
-
News
29 Feb 2024
H2O.ai releases small language model: H2O-Danube-1.8B
The new model comes as the generative market continues to see the emergence of small language models. The models provide enterprises with better privacy and data controls. Continue Reading
-
News
29 Feb 2024
Collibra adds AI governance to data management platform
The data management vendor's new suite adds capabilities aimed at enabling enterprises to safely and securely use AI the same way data governance frameworks apply to data. Continue Reading
-
News
28 Feb 2024
Intel, Nvidia aim latest systems-on-a-chip at AI workstations
Nvidia's RTX 500 and RTX 1000 GPUs are for the lightest mobile workstations. Intel's Core Ultra with a built-in GPU can handle many AI-powered tasks on the workhorse computers. Continue Reading
-
News
26 Feb 2024
Microsoft allies with OpenAI rival Mistral AI
The tech giant is investing in the open source startup. The partnership means Mistral's premium models, including its new model, Mistral Large, will be available on Azure. Continue Reading
-
News
22 Feb 2024
Stability AI adopts new architecture in Stable Diffusion 3
The new version of the image model uses a different architecture than previous versions. It comes in different sizes and has better spelling capabilities. Continue Reading
-
News
22 Feb 2024
Intel Foundry launches as enterprise AI surges
If the trend continues, enterprises will need more AI chip suppliers to help stabilize prices and meet the demand for AI processing at the edge and the data center, experts said. Continue Reading
-
News
22 Feb 2024
AI vendor finds opportunity amid AI computing problem
The GPU cloud provider recently raised $320 million. It has found an opportunity as more enterprises seek to run generative models and the demand for infrastructure is high. Continue Reading
-
News
21 Feb 2024
Google releases new family of open models: Gemma
The cloud provider's new models compete with Meta's Llama 2 open source model. Google incorporates responsible AI standards that should appeal to enterprises. Continue Reading
-
Opinion
21 Feb 2024
AI news roundup: OpenAI video model, Nvidia chatbot and more
Explore last week's AI news highlights with analyst Mike Leone's roundup of top developments, including OpenAI's launch of video model Sora and Nvidia's locally running chatbot. Continue Reading
-
News
15 Feb 2024
Declining revenues lead to 4,000 job cuts at Cisco
A 12% drop in networking revenue contributed to the company's overall revenue decline and its decision to cut 5% of its workforce. Continue Reading
-
News
15 Feb 2024
Google updates AI model Gemini, adds 1M context window
The cloud provider's 1.5 Pro model has the largest context window seen in the market. Despite its innovation, it still needs to show the applicability of its model for enterprises. Continue Reading