Asset Format & Rule Detection Guide

Modified on Fri, 17 Apr at 9:34 AM


OVERVIEW

Why your asset format matters for compliance reviews

Our platform uses AI to review every marketing asset for legal, regulatory and brand compliance. Before any rule can fire, the platform must first read and extract the content from your file, a step called content extraction. The way your file is structured directly affects how well this extraction works and therefore how reliably your compliance rules fire.



Key principle: A well-configured rule cannot detect something the extractor did not read correctly. Format changes are the most common cause of rules not firing as expected.




ASSET TYPES

How we read your assets 

Each file type is processed differently when uploaded to the platform or sent to IntelligenceBank via the Add-In. Understanding which method applies to your assets helps you anticipate where rule detection may be stronger or weaker.






PDF

IMAGE
PDF — text-based
Text is extracted directly from the PDF layer. Highest rule detection accuracy. Ensure your PDF is exported with a proper text layer, not flattened as an image.

Image / PNG / JPG
Text is read using OCR (optical character recognition). Accuracy depends on image resolution, font legibility, and contrast between text and background.






HTML

PPTX
Web page / HTML
Text is extracted from the page structure. Multi-column or complex layouts can affect the order in which content is read and matched against rules.

PowerPoint / PPTX
Slide text is extracted from text boxes. Images or diagrams embedded inside slides are processed via OCR and may have lower detection accuracy.






DOCX


Word / DOCX
Text is extracted from the document structure. Columns, tables, and floating text boxes affect reading order and can impact rule detection.








REAL-WORLD EXAMPLE

The layout change problem



What can happen: A dealer finance web page is set up as a two-column layout, with compliance rules trained and validated against that format. When the same campaign content is produced as a social image (PNG) or a PDF poster, the content extraction process is completely different and rules that fire reliably on the web page may not fire on the image.



Web page — 2 columns
Content is extracted column by column. Rules detect rate disclosures, comparison rates and end-date text correctly because the content is machine-readable.

Social image (PNG)
The same content must be read by OCR. If resolution is low or text overlaps a background image, OCR may miss the comparison rate disclosure  and the rule does not fire.



Web page — 3 columns (new layout)
Adding a third column changes the extraction order. Rules trained on the two-column output may not match the new structure.  Disclosures can appear in unexpected positions.

PDF poster
A text-layer PDF gives high accuracy. If the PDF was exported as a flattened image, it is treated as a PNG, and OCR is applied.  Detection accuracy drops significantly.




YOUR ACTION

Tell us when your asset format changes

Your compliance rules are calibrated to your specific asset types and layouts. When those change, rules may need to be reviewed or updated. Please notify your Customer Success Manager whenever any of the following occur:


  • You change the column layout of a webpage or document (e.g., from two columns to three)

  • You add a new asset type to your workflow (e.g., social images when you previously only produced web pages)

  • You change from a text-layer PDF to a flattened/image PDF, or vice versa

  • You change image dimensions, DPI, or resolution settings on image assets

  • You switch from one design template to another with different text placement

  • You add or remove disclaimer or disclosure sections from a standard layout




Good practice: Whenever you create a new campaign template or introduce a new asset format, share one example with your CSM before going live. A quick format review takes far less time than investigating missed rules after a campaign has launched.




WHEN IN DOUBT

What to do before uploading a new asset type


1

Identify the change

Note what is different — column count, file type, template design, or export settings.

2

Collect a sample

Gather at least three examples of the new asset type before reaching out to your CSM.

3

Notify your CSM

Share the samples and a brief description of what changed and which rules should apply.

4

Wait for confirmation

Do not use the new format in production until your CSM confirms rules have been reviewed.

5

Resume as normal

Once confirmed, upload freely.  Your rules now reflect the updated format.



Questions? Contact your Customer Success Manager. This guide should be reviewed whenever your standard production asset mix changes.



ADD-IN REVIEWS

What to expect from off-platform Add-In reviews

Add-In reviews are a great way to use your Marketing Compliance rules where you create the content, off the IntelligenceBank platform. . Risk reviews help authors refine their content before doing on-platform reviews and approvals. There are some nuances to keep in mind when using the Add-Ins.





POWERPOINT ADD-IN

WORD ADD-IN
PPT(X) Reviewed in PowerPoint
Text and images are reviewed, and OCR (optical character recognition) is applied to text embedded in images. Rules are annotated on a per-slide basis, and individual risk words are not directly highlighted.

DOC(X) Reviewed in Word
Document body text is reviewed. Headers, footers, images and text embedded in images are not reviewed. Rules are annotated in the document with risk word highlighting.






FIGMA

UNIVERSAL CONNECTOR ADD-IN, GENSTUDIO ADD-IN
FIG Files Reviewed in Figma
Selected Frames are converted to PDF for review.

Content Reviewed in Universal Connector

The user can review three types of content:


  • Upload files in the formats and sizes supported by the API. See the list here.

  • Selected text copy.

  • Web pages are automatically saved as PDFs for review.


Connected Apps generally send files for review, though some may use other content types (e.g., AEM sends pages or content fragment data for text review).







Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select at least one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article