Skip to main content Skip to footer

Team & Thesis

Why Visonic AI exists

Visonic AI is being built by a team spanning machine learning, product, accessibility, and commercial delivery.

The company thesis is straightforward: long-form video understanding is the missing layer between generic multimodal AI demos and production-grade media workflows. We started with audio description because it is both commercially urgent and technically demanding.

What the team is building and why the problem matters

Serious long-form video understanding

Short clips and isolated captions are not enough for real media workflows. Audio description requires narrative context, scene awareness, character continuity, and timing judgment across full-length content.

A team that spans technology and deployment

The founding team combines research and engineering depth with product, accessibility, and customer-facing experience. That mix is essential when the job is not just to build models, but to build a workflow the market can actually use.

Starting where the market pain is sharpest

Audio description sits at the intersection of accessibility, compliance, and media operations. Solving it well creates immediate value for buyers and a strong technical foundation for broader video-intelligence products.

Meet the team

Ari Surana

Ari Surana

Founder and Technology Lead


Ari is a seasoned technologist and engineer with a proven track record of building high-performance machine learning systems across enterprise software, ecommerce, media, and geospatial intelligence.


Before founding Visonic AI, he led AI application work at Atlassian, BigCommerce, and Nearmap, with a particular focus on large datasets, robust AI infrastructure, and production-grade computer vision systems.


Aditi Raheja

Aditi Raheja

Co-Founder and Product Lead


Aditi has more than 15 years of experience across product marketing, customer engagement, and operational leadership in both the private and public sector.


She has delivered more than $800M in product revenue and focuses on translating technical capability into clear market value for buyers, operators, and accessibility stakeholders.


Daniel Suess

Daniel Suess

Co-Founder and R&D Lead


Daniel has spent years building AI products at leading Australian companies, with hands-on experience taking computer vision systems from research to deployed products.


His work spans television, retail, and asset management, and he brings deep technical judgment to long-form video understanding, model evaluation, and multimodal system design.


Virendra Raheja

Virendra Raheja

Co-Founder and Customer Success Lead


Viren has more than 47 years of experience across Australia, the United States, Europe, India, and the Middle East, with a long track record of turning complex technology into commercially viable products.


His background in accessibility technology, product leadership, and customer-facing delivery grounds Visonic AI in practical deployment, trust, and workflow fit.


Questions buyers and partners ask about the team

This page carries the same answer-first pattern as the commercial pages so Visonic's expertise is visible to both readers and search systems.

Who is behind Visonic AI?

Visonic AI is built by a founding team spanning machine learning, computer vision, product leadership, accessibility technology, enterprise software, and customer delivery. That mix matters because serious audio description requires both frontier technical work and a clear understanding of how media operations actually run.

Why does the team have credibility in AI audio description and long-form video understanding?

The team combines experience from large-scale AI and software environments with direct accessibility-market knowledge. That gives Visonic AI a more credible foundation than teams approaching audio description as a narrow workflow add-on without deep video-understanding or deployment experience.

Why did Visonic AI start with audio description?

Audio description is a technically demanding problem with immediate commercial and social value. It forces a system to handle narrative context, timing, character awareness, and delivery constraints, which makes it a strong proving ground for long-form video intelligence.

Is Visonic AI only an accessibility story?

Accessibility is the first major workflow and a strategically important one, but the company thesis is broader. Visonic AI is building production-grade long-form video understanding that can support multiple downstream media workflows over time.

What kinds of experience sit behind the Visonic AI team?

The team spans machine learning, computer vision, product leadership, enterprise software, accessibility workflows, and customer-facing operational delivery. That matters because market-leading output requires both strong models and a strong understanding of how the work is actually deployed.

What is the team thesis on where this market is going?

The thesis is that the market will move away from lightly assisted human-only workflows and toward platform-first systems that understand long-form video deeply enough to automate serious production tasks. Audio description is the first proving ground, not the final destination.

See how the team thesis shows up in the product

The strongest proof point is not abstract vision language. It is whether the workflow solves a real production, compliance, or accessibility problem on actual media content.