Reading Level

Multimodal learning / multimodal AI

• Multimodal AI - or multimodal learning - mimics the human brain’s ability to simultaneously process textual, visual, and audio information, enabling a more nuanced understanding of reality.
• Transitioning from a unimodal model (like those specialized in text, images, or sounds) to a multimodal model presents technical challenges, particularly in creating shared representations for different types of data.
• Multimodal AI offers advantages such as capturing more comprehensive knowledge of the environment and enabling new applications, like merging data from various modalities for complex tasks.
Watch the video
Soft Robotics Lab – ETH Zürich (lab head: Prof. Robert Katzschmann (not in the picture). From left to right: Jose Greminger (Master student), Pablo Paniagua (Master student), Jakob Schreiner (visiting PhD student), Aiste Balciunaite (PhD student), Miriam Filippi (Established researcher), and Asia Badolato (PhD student).

When will we see living robots? The challenges facing biohybrid robotics

Read the article
A woman stands in a train, holding a phone. She is wearing a beige coat and a blue and brown scarf. The interior of the train is bright, with seats and metal support bars.

A mathematical model to help AIs anticipate human emotions

Read the article
person wearing bioelectronic fibre arrays for dual-ECG signal acquisition / credit: Wenyu Wang and Yuan Shui

Bioelectronics: disease monitoring sensors that can be printed directly onto human skin

Read the article

Artificial pollination: robotic solutions that aim to supplement the work of bees

Read the article
GettyImages - A woman is working on a computer, surrounded by two screens displaying code. The office environment is bright and modern, with stationery items visible on the desk.

Tom Chatfield: “AI could lead to a massive pollution of the world’s data and the erasure of trust.”

Read the article
GettyImages - A man in a gray vest is consulting a tablet on a construction site, with visible cables in the background.

Automated intervention reports for augmented technicians thanks to generative AI

Read the article