# When the model already knows the physics

Canonical HTML: https://dsti.school/techblog/data-assimilation-physics-models

This Markdown copy is generated from the same DSTI static-site build as the canonical HTML page. It is intended for machine readability and concise retrieval.

[DSTI TechBlog](https://dsti.school/techblog) /  AI is maths

Faculty AI is maths

An ocean model may obey the equations of fluid dynamics and still begin from the wrong ocean. A satellite may measure sea-surface height accurately and still see only fragments. Professors Jacques Blum and Didier Auroux have spent decades working on the mathematics that joins those incomplete truths.

JB
DA Research spotlight: Pr Jacques Blum & Pr Didier Auroux Optimisation, inverse problems, numerical simulation and data assimilation

22 Jun 2026 12 min read AI is maths

data-assimilation optimisation inverse-problems physical-models numerical-analysis hybrid-ai

## Three ingredients, none sufficient alone

M
A model Physical laws describe how the system can evolve.

O
Observations Sensors reveal parts of what the system actually did.

A
Assimilation Optimisation reconciles the model and the evidence into a usable state.

DSTI's central position: learn what is unknown, but do not spend data, time and energy relearning reliable knowledge that can be represented explicitly.

Artificial intelligence is often described as learning a model from data. That is one important family of approaches, not a universal definition of intelligence. In weather, oceanography, energy systems, industrial processes and many other physical domains, we already possess equations, conservation laws, boundary conditions and decades of scientific knowledge. The real challenge is often to combine that knowledge with observations that are sparse, noisy and incomplete.

This is not an argument against machine learning. It is an argument for choosing what actually needs to be learned .

## 01 A forecast can obey every equation and still start from the wrong world

A dynamical model describes how a system evolves from an initial state. If that initial state is wrong, the model can solve its equations perfectly and produce the wrong trajectory. In nonlinear systems such as the atmosphere or the ocean, small errors in the starting state can grow quickly.

Observations do not solve the problem by themselves. Satellites, buoys, radars and sensors measure only selected variables, at selected places and times, with uncertainty. The complete model state may contain millions of values; the observation vector is usually much smaller.

Model Coherent, but imperfectly initialised
The equations encode structure and physical consistency, yet the starting state and parameters may be uncertain.

Data Real, but partial and noisy
Measurements anchor the model in reality, but they do not describe every variable or every point in space and time.

Inverse problem Recover what cannot be observed directly
Infer the hidden initial condition, parameters or trajectory that best explains the evidence while respecting the model.

Data assimilation is the mathematical field at that interface. It does not ask the data to replace the model, or the model to ignore the data. It asks for the state that makes the two as compatible as possible.

## 02 Data assimilation: make the model listen without making it forget

One useful way to understand assimilation is as a controlled negotiation. The model contributes dynamics: which evolutions are physically possible. The observations contribute correction: where the simulated trajectory departs from reality. Statistical assumptions and regularisation decide how strongly each source should be trusted.

### A small assimilation laboratory

This deliberately simple illustration blends a model trajectory with observations. The teal analysis moves as the assumed confidence in the observations changes. Real data assimilation uses richer covariance structures, dynamical constraints and optimisation.

Relative confidence in observations: 55%
trust model more trust observations more

model observation assimilated analysis

The important point is not the weighted average in the toy display. It is the principle: the analysis is constructed from both knowledge and evidence . A good assimilation method must respect the model's time evolution, account for observation error and remain computationally possible at the scale of the system.

## 03 Optimisation is the bridge between what we know and what we observe

Variational data assimilation turns the reconstruction into an optimisation problem. A cost function measures the discrepancy between the model trajectory and the observations, then adds prior information or regularisation. The unknown initial state and parameters are adjusted until that cost is reduced.

**Equation**

```text
Conceptual 4D-Var objective
```

**Equation**

```text
J(x₀, θ) = observation mismatch + background mismatch + parameter regularisation
```

The mathematical details determine how errors are weighted and how uncertainty propagates. The core idea is a constrained search for the trajectory most consistent with both the equations and the measurements.

**Equation**

```text
Forward model Simulate the system from the current estimate of the initial state.
```

**Equation**

```text
Adjoint or sensitivity information Determine how changing the unknowns changes the mismatch.
```

**Equation**

```text
Optimisation step Update the estimate, run again and continue until an acceptable solution is reached.
```

This is why continuous optimisation, inverse problems and data assimilation belong together in an AI curriculum. The machine does not merely fit a curve. It solves a structured problem under constraints, with a computational budget and a definition of what counts as an admissible answer.

## 04 Back and Forth Nudging: correct, reverse, repeat

Jacques Blum and Didier Auroux introduced the Back and Forth Nudging algorithm in 2005. Standard nudging adds a feedback term to the model equations, pulling the simulated state towards observations. Their key move was to apply correction both forwards and backwards across the same assimilation window.

1 Run forwards
Start from the current estimate and integrate the physical model while nudging its trajectory towards the observations.
2 Run backwards
Use the corrected final state to integrate back through the same window, with a feedback term of the appropriate sign.
3 Update and repeat
The recovered state at the beginning becomes the next initial estimate. Iterate until the reconstructed trajectory stabilises.

The first paper proved convergence for a linear system. It also made the method attractive in practice: the core formulation does not require the model linearisation, adjoint construction or separate minimisation process used by 4D-Var. Later work developed the theory and tested the approach on Lorenz systems, transport equations, shallow-water and ocean models.

Method family | Central mechanism | Strength | Engineering challenge

4D-Var | Minimise a cost over a time window. | Globally structured variational formulation. | Adjoint development and repeated model integrations can be demanding.

Kalman / ensemble filters | Alternate forecast and statistical correction. | Explicit treatment of evolving uncertainty. | Covariance propagation or large ensembles can be costly.

BFN / DBFN | Alternate forward and backward observers. | Direct feedback, comparatively light implementation and rapid convergence in studied settings. | Backward stability, gain selection and model suitability still require mathematical care.

The Diffusive Back and Forth Nudging extension was designed for particular diffusive models. In experiments on a two-dimensional shallow-water model and a three-dimensional primitive-equation ocean model, it stabilised backward integration and reduced the impact of noisy observations. That is a research result, not a claim that one algorithm replaces every other method.

The mature scientific position: methods are chosen according to the structure of the model, the observation system, the uncertainty and the computational constraints. “Use AI” is not yet a method specification.

## 05 Two research lives, one culture of applied mathematics

The collaboration is especially powerful because it sits inside much broader research careers. The same mathematical language — partial differential equations, control, optimisation, numerical analysis and inverse problems — travels from plasma physics to ocean circulation, image processing, weather forecasting and industrial modelling.

![Portrait of Professor Jacques Blum](https://upload.wikimedia.org/wikipedia/commons/f/f5/Blum_Jacques.jpg)

Professor · numerical analysis, control and data assimilation

### Pr Jacques Blum

From the École normale supérieure and a doctorate under [Jacques-Louis Lions](https://en.wikipedia.org/wiki/Jacques-Louis_Lions), through CNRS research, professorships at Grenoble, École Polytechnique and Nice, Jacques Blum built a career around the simulation, identification and optimal control of physical systems governed by partial differential equations.

His work spans tokamak plasma equilibrium, real-time reconstruction, ocean circulation and data assimilation. Even the 2017 version of his CV records a research and teaching trajectory of remarkable breadth. At DSTI, he is a member of the Scientific Advisory Board and helped shape the school's approach to mathematical support across the student body.

1984 CNRS Bronze Medal

1990 Prix Blaise-Pascal

1998 Seymour Cray prize

2017 Grand Prix de la Ville de Nice

[University page](https://math.univ-cotedazur.fr/u/jblum/)[Biography](https://fr.wikipedia.org/wiki/Jacques_Blum)[DSTI faculty](https://dsti.school/faculty#academics)

![Professor Didier Auroux](https://msi.univ-cotedazur.eu/medias/photo/didierauroux_1705058244330-png?ID_FICHE=1197071)

Professor · Director of the Maison de la Modélisation, de la Simulation et des Interactions

### Pr Didier Auroux

Didier Auroux trained at the École normale supérieure de Lyon, completed a doctorate on data assimilation for environmental problems and an habilitation on fast algorithms for image processing and data assimilation. His research joins geophysics, observers, optimal control, inverse problems, numerical analysis and scientific computing.

He now directs Université Côte d'Azur's [Maison de la Modélisation, de la Simulation et des Interactions](https://msi.univ-cotedazur.fr/a-propos), a structure that supports research through modelling, simulation, high-performance computing and data science.

[University page](https://math.univ-cotedazur.fr/u/auroux/)[Publications](https://math.univ-cotedazur.fr/u/auroux/prod_fr.php)[MSI](https://msi.univ-cotedazur.fr/a-propos)[DSTI faculty](https://dsti.school/faculty#academics)

## 06 Excellence matters most when students can reach it

At DSTI, the point of bringing distinguished mathematicians into the classroom is not to decorate a faculty list. It is to let students encounter the habits of mind behind serious modelling: define the state, expose assumptions, formulate the objective, identify what is observable, and understand the numerical consequences.

Warm Up Fundamentals of Mathematics
Jacques Blum teaches the mathematical foundations in the Warm Up of every DSTI data MSc programme, working with cohorts whose prior mathematical preparation can vary widely.

All data MSc programmes Mathematics for Data Science
Jacques Blum and Didier Auroux jointly teach the mathematical language needed to reason about data science rather than merely operate its tools.

MSc Data Science & AI Continuous Optimisation
Students learn how objectives, gradients, constraints and algorithms turn a mathematical problem into a computable solution.

Across the curriculum Support Sessions
Jacques proposed the creation of DSTI's Support Sessions, in the spirit of the recitation classes used at Ivy League and leading Californian universities. Didier regularly leads support sessions for mathematics-driven modules. The standard is high, and students are given additional structured teaching to help them reach it.

MSc Data Science & AI Inverse Problems & Data Assimilation
Their research field enters the curriculum directly: reconstructing hidden states and parameters from models and incomplete observations. [Explore the curriculum](https://dsti.school/msc-in-data-science-and-ai#curriculum).

BSc Computer Science & Engineering Mathematics Harmonisation
Didier teaches Mathematics Harmonisation with Dr Christine Malot, helping students establish a shared mathematical foundation before progressing to later quantitative work. [Explore the curriculum](https://dsti.school/bachelor-computer-science-and-engineering#curriculum).

BSc Computer Science & Engineering Energy – Climate – Sustainable IT
Jacques teaches the physics component, connecting computation to the physical systems, energy limits and environmental questions it affects. [Explore the curriculum](https://dsti.school/bachelor-computer-science-and-engineering#curriculum).

Jacques and Didier are particularly attached to teaching across the full student population, including learners far from their own research level. That matters. Mathematical confidence is not created by lowering the intellectual ceiling; it is created by building a reliable route towards it.

## 07 DSTI's position: hybrid intelligence before fashionable uniformity

### Do not force the learner to rediscover what the domain already knows.

When reliable physical laws, constraints, taxonomies or relationships exist, represent them. Use data-driven learning for the residual uncertainty, unknown parameters, unresolved scales and patterns the explicit model cannot provide. Intelligence lies in the combination.

01 Respect known structure
Conservation laws, differential equations, causal constraints and domain knowledge are information. Discarding them is not neutrality; it is a design decision.

02 Learn the unknown part
Data are invaluable where parameters are uncertain, models are incomplete, sub-grid effects are unresolved or patterns cannot be specified analytically.

03 Optimise the interface
The difficult work is deciding how model error, observation error and learned components interact — and validating the resulting system.

Physical systems

### Data assimilation

Combine a dynamical model with observations so the reconstructed state respects both the evidence and the laws governing evolution.

≈

Knowledge systems

### Semantic Web

Represent known entities and relationships explicitly rather than asking every downstream system to infer them repeatedly from unstructured data.

The analogy is an engineering principle, not a claim that the mathematics is identical. In both cases, explicit knowledge and learning are complementary. Pr Fabien Gandon's teaching of Semantic Web technologies and the data-assimilation work of Jacques Blum and Didier Auroux point towards the same educational discipline: know what you know, learn what you do not, and make the boundary inspectable .

This also changes how efficiency is taught. A smaller, structured method can sometimes be preferable to a larger generic learner: less data movement, less training, stronger physical consistency and a clearer explanation of failure. Sometimes the learned model is the right answer. Sometimes it is one component inside a larger mathematical system.

## 08 The research trail behind the classroom

The article is grounded in a sequence of publications that traces the work from the introduction of an algorithm and its convergence proof through numerical comparison, theoretical development and geophysical applications.

Back and forth nudging algorithm for data assimilation problems
Didier Auroux & Jacques Blum · C. R. Acad. Sci. Paris, 2005

The founding note introduces BFN and proves convergence for a linear ordinary differential equation system.

[DOI](https://doi.org/10.1016/j.crma.2005.05.006)

A nudging-based data assimilation method: the Back and Forth Nudging algorithm
Didier Auroux & Jacques Blum · Nonlinear Processes in Geophysics, 2008

A fuller development and numerical study of the method in oceanographic data assimilation.

[Open article](https://doi.org/10.5194/npg-15-305-2008)

Diffusive Back and Forth Nudging algorithm for data assimilation
Didier Auroux, Jacques Blum & Maëlle Nodet · C. R. Mathématique, 2011

An extension designed to manage diffusion in backward integration.

[DOI](https://doi.org/10.1016/j.crma.2011.07.004)

Data Assimilation for Geophysical Fluids: The Diffusive Back and Forth Nudging
Didier Auroux, Jacques Blum & Giovanni Ruggiero · Mathematical Paradigms of Climate Science, 2016

Tests on shallow-water and full ocean models, including the method's behaviour with observation noise.

[Book chapter](https://doi.org/10.1007/978-3-319-39092-5_8)

### The lesson students should keep

AI is not a single class of models. It is the disciplined construction of systems that infer, optimise and act under uncertainty. Sometimes data should learn the model. Sometimes data should correct the model. Knowing the difference is part of becoming an engineer who understands the scientific foundations.
