Anni di esperienza
0
+
Aziende e ONG
0
+
Dirigenti e leader
0
+
Most organisations communicate with words. Strategy documents. Email chains. Slide decks. Quarterly reports. The more important the message, the longer the text tends to be. This is the wrong instinct. And it is expensive.
Visual storytelling is the discipline of communicating through structured images, graphic forms, and any other component of the visual form not as a supplement to words, but as the primary medium of meaning. It is the practice of understanding how images encode and transmit significance, how visual choices shape perception before the conscious mind can intervene, and how the deliberate deployment of visual form produces comprehension that text alone cannot achieve. The visual form as employed in storytelling is expected to be always strategic, if not it remains purely decorative.
The numbers are not contested: over 90% of digital data is visual. The brain processes images approximately 60,000 times faster than text, and people retain roughly 80% of what they see versus 20% of what they read. These are not arguments for aesthetics. They are arguments for competitive advantage.
In the YESS methodology, visual form is the first ‘1’ in the equation 1+1=3. It is the operational hardware of communication: the shape that content takes. When form and content are misaligned, the story fails to land regardless of how well-constructed the narrative is. When form and content are integrated through what YESS calls Montaggio, i.e. the operative grammar of Strategic Storytelling, the result is the surplus value of 3: meaning that exceeds the sum of its components.
Visual storytelling is therefore not a department. It is not a deliverable. It is the medium through which strategy becomes perception, and perception becomes value for your enterprise.
Visual storytelling is not graphic design. Design is the execution layer. Visual storytelling is the strategic layer that precedes and governs it: what should be communicated, to whom, in what visual register, and to what end. Confusing the two is one of the most expensive structural errors in organisational communication.
Visual storytelling is not a rebrand. Changing your logo and colour palette is not visual storytelling. Visual storytelling is structural: it concerns the deep choices about what to show, what to omit, what to juxtapose, and how those choices construct meaning in the viewer’s mind before a single word is processed.
Visual storytelling is not social media content. Posting images is not a visual communication strategy. What is required is something harder and more valuable: an understanding of the visual grammar, as the rules by which images generate meaning, and the discipline to deploy that grammar consistently and purposefully across every stakeholder touchpoint.
And visual storytelling is absolutely not decoration. When visual communication is treated as the last step of a process, as ‘making it look nice,’ the result is communication that is memorable in form but hollow in meaning. Visual form must carry the argument. When it merely ornaments it, the 1+1=3 equation collapses to 1+1=2, or less.
Visual storytelling matters now because the cognitive environment your stakeholders inhabit has fundamentally changed, and most organisations are still communicating as if it had not.
Attention is the scarcest resource. In a landscape where the average person encounters thousands of images daily, and makes snap judgments in milliseconds, text-heavy communication does not fail slowly. It does not register at all. Your enterprise’s narrative needs to operate at the speed of visual perception, not the speed of reading.
The democratisation of visual production has raised the baseline. When anyone can produce visually polished content, visual quality is no longer differentiating. What differentiates your communication is visual meaning: the deliberate use of image, composition, and form, to construct specific understanding in a specific audience. This is a discipline truly few enterprises do have.
At the same time, the rise of AI-generated imagery has created a paradox that sophisticated enterprises are beginning to exploit. In a world flooded with synthetic visuals that look technically correct, but carry no authentic meaning, images that carry genuine narrative weight stand out precisely because they are legible as real. Visual authenticity is becoming a premium differentiator.
And, most importantly, in an era of ESG scrutiny and stakeholder activism, what you show says more than what you say. A sustainability report with carefully controlled imagery communicates something different from one with unmediated operational photography. The gap between the official visual narrative and the lived visual reality is increasingly the front line of reputational risk, not the backroom of marketing.
Massimedia’s approach to visual storytelling is built on the premise that most consulting practices still undervalue, if not thoroughly ignore: visual communication is not a craft skill to be learned on the side, but a strategic intelligence to be systematically developed.
At Massimedia, we prefer to start with a visual literacy training, such as SIS, to develop the team’s capacity to read, analyse, and deliberately produce images as meaning-making structures. Most professionals lack this, even when they have instincts about what looks good or feels right, there is no structured understanding of the crucial Why. Visual literacy training closes that gap by developing the analytical frameworks that make visual choices legible, so teams consciously make confident decisions rather than by lucky accident.
The following step is to move to visual narrative architecture: the application of the YESS methodology’s Montaggio grammar to the full scope of an organisation’s visual communication. This means understanding how juxtaposition (what is placed beside what), sequence (what follows what), composition (how elements are arranged within a frame), and omission (what is deliberately excluded) work together to construct intended meaning at every stakeholder touchpoint in a manner that is thorough strategic.
The result is not a style guide, but an organisation whose visual communication decisions are grounded in strategic intent rather than aesthetic preference: one that can answer the question ‘Why are we showing this?’ with precision, and one that can read the visual storytelling of competitors, media, and culture with the same analytical clarity.
Complementary to this practice is MIA (The Meta-Image App), the first self-contained and self-managed hands-on training on visual storytelling developed from Massimiliano Fusari’s academic research at the University of Westminster. MIA provides teams with an intentional and effective practical instrument for making visual analysis and developing visual intelligence in real time. It is the tool that takes visual storytelling out of the classroom and into the daily workflow of practitioners who need it where decisions actually happen.
Visual storytelling is where Massimedia’s advantage is most distinctive, and truly most difficult to replicate.
Dr. Massimiliano Fusari holds a PhD in Visual Anthropology from the University of Venice (2011). He then expanded his theoretical expertise with a consequential PhD research in Multimedia Storytelling (2014) that consolidated his findings, and translated them into a revised approach to communication, and more precisely to storytelling: Massimiliano finalised his notion of Strategic Storytelling as pivoting on the visual quality of communication. His entire consulting and academic practice has been built at the intersection of how images construct meaning, how visual communication drives strategic outcomes, and how organisations can develop genuine visual intelligence rather than merely commissioning visually competent output. The academic foundation is not background, but the engine that fuels his consulting activity in parallel to the senior faculty positions he held at the H-FARM College and the University of Westminster. As a result, Fusari is one of the few practitioners globally who can connect the academic theory of visual storytelling to the operational mechanics of enterprise communication, and build toolkits and frameworks that practitioners can actually use.
The result is a practice that does not subcontract visual thinking to a design agency, but develops visual intelligence as the key organisational capability: the ability to make strategic visual choices, read the visual communication of competitors and cultural contexts, and deploy visual form as the primary language of strategic persuasion.
The complementary MIA app, developed from Fusari’s research at the University of Westminster, makes this capability accessible to all and truly in real time. The SIS (Storytelling | Image | Strategy) training programme has delivered visual storytelling intelligence to government institutions, corporations, and academic programmes for over 15 years, from its origins in visual diplomacy training for the Italian Ministry of Foreign Affairs to its current form as a comprehensive consulting and training intervention. And the YESS methodology ensures that every visual choice an organisation makes is grounded not in aesthetic preference, but in strategic intent.
Over 25 years and 100+ transformations, from the Italian Ministry of Foreign Affairs to the International Labour Organization, from the Council of Europe to private enterprises, accelerator programmes, and startups across Asia and Europe, Massimedia has refined a visual storytelling practice where rigour and results are the same thing.
Within the YESS methodology, the centrality of visual form is not incidental. It is strategically structural.
YESS’s operative grammar, Montaggio, is a fundamentally visual practice. Drawn from Eisenstein’s cinematic theory, validated by the Kuleshov Effect, and extended through Roland Barthes’ semiotic analysis of images, Montaggio is the mechanism through which the juxtaposition of discrete visual and verbal units produces emergent meaning that neither unit contains independently. This is the mechanism of 1+1=3 in its most precise form: not a metaphor for vague synergy, but a description of how meaning is constructed between elements through the deliberate act of composition and editing.
Every Pillar of YESS has a visual dimension. This is not incidental but constitutive: because Strategic Storytelling operates at the intersection of form and content, every one of its nine structural Pillars generates, requires, or is tested by a visual logic. In YESS, the visual form is not the packaging of the narrative: it is its primary language.
Domanda | establishes the visual dimension before a single image is chosen. The diagnostic inquiry that constitutes Pillar 1 must be applied with equal rigour to the enterprise’s visual situation. An enterprise that has not asked the right question will not produce the right image; it will produce, at best, a beautiful answer to the wrong inquiry. |
Vision | is composed through image: the aspirational future state must be made visible before it can be believed. The entire philosophical etymology of Vision encodes this necessity. The essential visual logic of Vision as a Pillar is that any enterprise must first create the image of who it is becoming. |
Culture | is communicated through what an enterprise chooses to show and what it refuses to show, with the visual decisions constituting the stance itself. Culture as a Pillar demands visual decisions that are simultaneously moral positions. The visual choice is the cultural stance: there is no version of this Pillar achieved through words while remaining neutral in images. |
Creatività | disrupts not just through language but mostly through form: the unexpected image that violates category conventions is often more persuasive than the unexpected word. The most creative visual strategies are not merely different but differently meaningful, with the deviation from convention producing, rather than dissipating, meaning. |
Coherence | is, among its other dimensions, a visual discipline of the highest order. Visual incoherence is immediately perceptible to an audience, often before verbal incoherence becomes apparent. The enterprise’s visual system is a reservoir of accumulated meaning that is depleted each time it is abandoned, and compounded each time it is deepened. |
Authenticity | is largely a visual judgment: audiences evaluate whether the images an enterprise deploys align with its operational reality, or contradict it. Visual authenticity is not a stylistic choice, but the commitment to congruence between the seen and the real. |
Audience | is calibrated as a visual exercise: different cultural contexts read visual codes differently, and the enterprise that misreads those codes fails regardless of the precision of its verbal narrative. Audience demands that the enterprise approach its visual choices with the rigour of a cultural translator: visual codes do not travel neutrally across cultural boundaries. |
Product | is the Pillar in which the visual argument achieves its most consequential and irrefutable form. The Product is not the object about which a story is told: it is the medium through which the story is made visible. The product embodies its values visually to completely reduce the enterprise’s dependence on communication. |
Purpose | gives visual communication its direction and its capacity to mean something beyond immediate impact. The visual system without Purpose is a collection of images; the visual system governed by Purpose is an argument. And in Strategic Storytelling, it is arguments, not images, that build the durable conviction on which enterprise value ultimately rests. |
This is why visual communication is not a component of YESS. It is the medium through which YESS operates. The nine Pillars define what a strategic narrative must possess. Montaggio defines how that narrative is constructed, composed, sequenced, and deployed. Strip the visual dimension from YESS, and the methodology remains theoretically intact, but operationally inert. For professionals seeking to master Strategic Storytelling, visual intelligence is not optional enrichment. It is the difference between knowing what to say and knowing how to make it seen.
Do you fear that visual communication is decoration, the design team’s domain, or a stylistic concern subordinate to strategy? Do you suspect that “visual storytelling” is fashionable language for what every marketing team already does? If you are a sceptic about the strategic centrality of the visual in today’s communication, these FAQs are precisely for you!
Each answer is grounded in over thirty years of academic research and daily operational practice through the Massimedia YESS methodology (Your Enterprise Strategic Storytelling): its 1+1=3 axiom, its nine Pillars, its operative grammar of Montage, and the documented engagements that demonstrate the strategic consequences of visual governance across sectors and scales.
Because visual decisions are now P&L events, not stylistic preferences. In November 2024 Jaguar launched a rebrand whose flagship film contained no car, replacing its leaping-cat marque with a pastel wordmark. By April 2025, European sales had collapsed 97.5% year-on-year (49 cars sold against 1,961 in the prior April), Tata Motors stock declined roughly 10% year-to-date against a +6% benchmark, and Jaguar Land Rover’s CEO stepped down. Nor is this an isolated, consumer-brand artefact.
Blankespoor, Hendricks and Miller (Journal of Accounting Research, 2017) demonstrated empirically that investor perception of CEOs, derived from 30-second content-filtered video clips of IPO roadshows, is positively associated with offer price and first-day valuation, controlling for fundamentals. A visual decision is therefore a fiduciary one. The YESS methodology treats it as such: as architecture to be governed, not aesthetics to be admired.
Yes! And it is your most under-audited concern. Heritage is itself a visual proposition, whether the enterprise has formalised it or not. The patina of an industrial site, the typography of a century-old wordmark, the archival photograph in the annual report are each doing strategic work, often in directions the leadership has never realised. Heritage that goes unread visually is heritage that depreciates: ungoverned, it becomes nostalgia, which markets discount.
Patagonia, by contrast, has governed five decades of visual continuity into demonstrable pricing power and the calibre of capital it attracts. Veneto industrial groups and Italian family firms hold comparable visual capital and frequently leave it uncatalogued, therefore unmonetised. The strategic question is therefore inverted. Institutional, industrial, and heritage enterprises, especially those with decades of accumulated visual equity, are the ones with the most to lose from neglecting visual governance, and the most to recover from instituting it.
Precisely because technical sectors are where unaudited visual decisions cost the most. Every regulatory filing, financial dashboard, due-diligence pack, RFP response, engineering schematic and investor deck is a visual artefact. The placement of a chart, the legend on an axis, the colour assigned to a risk indicator decide what stakeholders see first and conclude fastest. Decades of research demonstrated that flawed visual encoding has changed real outcomes. Think of an IKEA assembly leaflet: a visual artefact engineered to govern stakeholder behaviour without a single sentence.
The Massimedia client roster, the International Labour Organization, the Council of Europe, national governments, industrial groups across the Veneto, is itself the evidence. Industrial complexity does not exempt an enterprise from visual rigour: it raises the cost of getting it wrong.
It commoditises production, not governance, and has therefore raised, not lowered, the strategic premium on visual architecture. Industry analyses from McKinsey, Gartner and BCG converge on the same diagnostic across 2024–2025 enterprise GenAI deployments: most fail to convert into measurable value, and the leading reason is the absence of upstream strategy. AI is a downstream amplifier. It multiplies output but does not decide which images, in what sequence, with what omissions, addressed to which decision-maker, with what consequence on perception. Generating a thousand images per hour solves nothing if the upstream architecture is absent.
Visual fluency is no longer about who can produce; it is about who can govern. The YESS framework, with its nine Pillars and its Montage grammar, supplies the upstream logic that makes any downstream production, human or generative, strategically usable. AI accelerates an architecture that exists. By design, it cannot supply one.
The cognitive evidence is robust, even where popular statistics are not. Mary Potter’s 2014 MIT study (Attention, Perception & Psychophysics) demonstrated that the brain can identify the meaning of an image after exposures of as little as 13 milliseconds, at the lower bound of perceptual processing. Allan Paivio’s Dual Coding Theory (1971) and Richard Mayer’s Cognitive Theory of Multimedia Learning, replicated across three decades of peer-reviewed research, establish the Picture Superiority Effect: visual and verbal information are processed through distinct cognitive channels, and well-designed visuals reliably outperform text-only encoding for comprehension and recall.
Combined with the dominance of video in current internet traffic, over 65% per the AppLogic / Sandvine 2023 Global Internet Phenomena Report, projected toward 80% on mobile networks, the convergent finding is unambiguous. The visual is the dominant comprehension channel of the people you need to persuade.
It cannot, and YESS does not propose to. What scales is the diagnostic discipline, not a single visual signature. Red signals celebration in Chinese consumer markets and warning in Western financial dashboards. A sans-serif wordmark that reads as authoritative in Northern European institutions can read as discount retail in Southern Italian heritage sectors. A confident campaign tone in Frankfurt can read as arrogance in Tokyo. The codes are non-negotiable, and they are not transferable.
This is precisely the kind of work academic foundations make possible. Visual anthropology, the discipline behind one of the founder’s two doctorates, is the systematic study of how meaning is produced visually within and across cultures. Massimedia’s engagements across UN agencies, European institutions, and enterprises in Asia, Europe and Africa are the operational proof that visual coherence and cultural specificity are reconcilable when the method is rigorous. The methodology produces a Visual Storytelling Architecture coherent across markets, not identical within them.
No, and the difference is operational, not stylistic. Graphic design produces artefacts; strategic visual storytelling produces governed meaning. A logo is a mark, but a Visual Storytelling Architecture is the structure that decides which images, which sequences, which deliberate omissions an enterprise delivers.
The 1+1=3 axiom captures the difference operationally. It is a direct inheritance from Sergei Eisenstein and Lev Kuleshov, who demonstrated in the 1920s that two images placed in sequence generate a third meaning that neither contained alone. Branding answers what something looks like; the YESS methodology asks what the visual is doing to stakeholder cognition, and what the strategic consequence of that effect is. That third meaning is where strategy lives, where graphic design ends, and where the YESS Montage grammar begins.
Aesthetics is not the opposite of strategy; it is one of its most consequential instruments, but only when it is governed. Decoration is what happens to a visual when no one has decided what it must do. Strategic form, by contrast, is the disciplined alignment of image, sequence, and meaning to a strategic intent.
Jaguar 2024 was not a bad-aesthetics story; it was an ungoverned-aesthetics story. The pastel palette, the cars-absent film, the lower-case wordmark were each aesthetic decisions with a balance-sheet consequence. My forthcoming monograph Aesthetics as Storytelling (Intellect, UK) makes the argument explicitly: aesthetic decisions are strategic decisions, whether the enterprise admits it or not. A typography choice signals risk posture. A photographic register signals cultural positioning. A composition signals authority, or its absence. The sceptic dismisses aesthetics because the cost of their current aesthetic decisions has not yet been measured.
Yes, which is why YESS is a methodology of selection, not multiplication. The discipline is not to produce more images, but to produce fewer images with higher meaning density. The Montage grammar exists precisely to engineer the cognitive lift of every visual decision so that one frame does the work of ten. This is the operational lesson Eisenstein and Kuleshov demonstrated a century ago: meaning is made at the cut, not inside the shot.
Attention economics confirms it. Patagonia, Apple, and Estetista Cinica, three otherwise incomparable enterprises, share a single discipline: visual restraint. Each authorises a fraction of the visual output their competitors produce, and each recovers a disproportionate share of stakeholder attention as a result. I n a saturated field, visual restraint is the highest form of visual strategy. Adding noise without architecture compounds an enterprise’s own irrelevance.
By the same metrics that govern any strategic discipline. Five are typically diagnostic for a board: deal velocity (the time from first stakeholder contact to signed agreement, where governed visuals demonstrably compress understanding); cost of capital (the perception premium quantified by Blankespoor et al., Journal of Accounting Research, 2017, in IPO pricing); talent acquisition (the cost-per-hire delta between firms with coherent visual identity and those without); crisis-response time (elapsed hours between an event and a defensible visual response, audited against peer benchmarks); and stakeholder-perception stability across cycles, not campaigns.
The unmeasured cost is the unmanaged one. Enterprises rarely audit what their ungoverned visual output is already costing them in lost contracts, mispriced talent, or repeated explanations that should have happened once. The Massimedia diagnostic phase quantifies precisely that gap. Once the gap is visible, the return on closing it is the cheapest decision the boardroom will make this year.
Most organisations communicate with images by accident and with words by design. The competitive advantage lies in making each communication form resonate by design and intentionality, and thus deploying visual storytelling with the same strategic rigour as financial planning, market entry analysis, or crisis management.
Book a complimentary 30-minute discovery session. No slides. No sales pitch. A direct conversation about how your organisation currently communicates visually, and what it is leaving on the table by not doing so strategically.
Si è verificato un problema nella segnalazione di questo post.
Confermare che si desidera bloccare questo utente.
Non sarà più possibile:
Si prega di notare: Questa azione rimuoverà anche questo membro dalle connessioni e invierà una segnalazione all'amministratore del sito. Si prega di attendere qualche minuto per il completamento di questo processo.