Music Grading, Part 3: The Temp Track Illusion
Part 3 of a series on temporal and spatial placement. Read Part 1 | Read Part 2
Here's where it gets interesting.
The temp track for a recent cue I was working on was Veris leta facies from Orff's Carmina Burana — specifically the 1974 CBS Masterworks recording by Michael Tilson Thomas and the Cleveland Orchestra.
Start pulling the thread, and the three-variable model unravels something unexpected at every layer.
1. The Composer: Orff's original 1936 score calls for full orchestra, choir, and two pianos as a featured component. The text is medieval Latin and Middle High German — a springtime fertility poem from a 13th-century manuscript. Nothing liturgical. Nothing dark. A reduced version for two pianos and percussion was later authorized by Orff in 1956, to make the work accessible to smaller ensembles. No version calls for organ.
2. The Performers and the Vision: MTT's interpretation was explicitly not a straight reading. Critics described it as "Orff seen through the lens of Stravinsky's Les Noces" — mechanical, obsessive, rhythmically driven. Unusual and extreme by critical consensus of the time. This was not Cleveland playing Orff. This was MTT's argument about Orff, captured on tape.
3. The Technology — and the Real Revelation: Producer Andrew Kazdin — Glenn Gould's longtime collaborator and a pioneer of multi-channel orchestral recording — designed this session explicitly as a quadraphonic spatial showcase. His stated goal was to present "a different placement of each individual number" across the sound field. The spatial weight, the sense of vast liturgical space, the immersive surround of choir — that wasn't a room. It was an engineered construct, built in a studio and designed for a speaker format most people no longer own.
What you heard streaming in stereo in 2026 was a collapsed, two-channel shadow of a quadraphonic spatial experiment from 1974, performed by an orchestra interpreting a 1936 choral cantata set to a 13th-century springtime poem — cut against a violent baptism ritual.
And here's the honest admission: I wasn't actually hearing any of that clearly. I was hearing it through the filter of liturgical visuals, a director's feedback about "dusty chapels" and period correctness, and my own compositional intent already forming. The three-variable model wasn't just operating on the director — it was operating on me. What I perceived as the sonic identity of the temp was partly the recording, and partly everything I brought to the listening.
Which is exactly the point.
The Meaning of a Piece Is Not Fixed at Composition — It Accumulates
Orff didn't write a liturgical piece. He didn't write for organ. He didn't write for a stone chapel, a violent scene, or a 21st-century streaming platform.
And yet something in that compound — the springtime text, MTT's mechanical vision, Kazdin's spatial engineering, a music editor's cut, a director's instinct, a composer's interpretation of feedback — landed against a baptism scene and felt true.
None of those people were wrong. Each transformation added something the original couldn't have contained.
The meaning of a piece of music is not fixed at composition. It accumulates.
The temp track isn't a musical reference. It's a compound artifact — carrying the weight of every decision, accident, and transformation between the composer's first idea and the picture editor's timeline. When a director says "I want it to feel like this," they're responding to all of it at once, and they can't separate what came from where.
That uncertainty isn't a problem to solve. It's the starting point. Asking which of those three variables the director is actually responding to — that's the composer's job. You may need to honor one, translate another, and deliberately replace the third.
The IR chapel, the mic ratio work, the spectral shaping — none of that was recreating the temp. It was finding the sound the picture actually needed, independent of what the temp happened to be. The temp pointed a direction. The score finds the destination.
You are the next transformation. Score accordingly.