This article explains how to create a Content Profile for Cato's DLP service. This profile includes one or more of the DLP Data Types which you can use in an Application Control policy or SaaS Security API Data Protection policy.
Cato’s Data Loss Prevention (DLP) service helps you monitor and control sensitive information across your network. You can add DLP content profiles to a Data Control rule to detect or block sensitive data and prevent potential exfiltration. DLP can scan text-based content, include data embedded in images by using OCR-based inspection, and documents embedded in files.
Content Profiles can include predefined data types or custom data types, including User-Defined Data Types and Sensitivity Labels. For more about data types, see the following articles:
The DLP Content Profile is a global object for the Cato Management Application which includes one or more Data Types.
You can configure a Content Profile so the DLP engine includes image files and images embedded in files in content matching for the profile. The engine uses OCR to extract text that appears in image files and sends the extracted text for content matching. The OCR scanning option appears when configuring a Content Profile. OCR image scanning includes:
Low-resolution and blurred mobile images
Warped, rotated, or crumpled images
Images that contain text in two languages
Predefined ML Classifiers identify sensitive data, for example, a resume, in over a hundred languages. For more information, see Working with Predefined Data Types for DLP.
Use the DLP Configuration page to create and edit Content Profiles. When you are adding Data Types to a profile you can filter the types according to a specific country or Universal (for all countries). In addition, you can sort the Data Types in ascending or descending alphabetical order according the the category or name, or according to the country.
When you add multiple Data Types to a profile, select the relationship between them:
- Any (OR) - Match only one of the Data Types in the profile
- All (AND) - Match all the Data Types in the profile (otherwise, the rule with this profile is ignored)
A Data Control rule can contain up to 20 Data Types across all Content Profiles.
When you configure a Content Profile, optionally enable OCR scanning for the profile.
To create a DLP Content Profile:
- From the navigation menu, select Security > Data Types & Profiles, and in the DLP Profiles tab select Content Profile.
-
Click New.
The Add Content Profile panel opens.
- Create the profile and add the Data Types.
- Optionally, select OCR Scan Enabled for the profile.
- Click Apply and then click Save.
The Data Types page shows all the Data Types that you can add to a profile. This lets you research and understand more about specific Data Types that you are using in your organization. The catalog also shows the Threshold for each data type, indicating the minimum number of occurrences to activate the data type. For more about data type thresholds, see Working with Custom Data Types for DLP.
Files up to 50 MB are supported. The supported file types are listed below (Audio, video, and binary files are not supported).
- CSV files:
.csv - Excel Template:
.xlt, .xltx - Excel Workspace:
.xlw - Microsoft Access Database:
.mdb - Microsoft Excel:
.xls, .xlsx, .xlsm, .xlam, .xlsb, .slk, .xltm - Microsoft PowerPoint:
.ppt, .pps, .pot, .pptx, .ppsx, .pptm, .ppsm, .potx, .potm - Microsoft Word:
.doc, .docx, .docm, .dotx - MS Access Project:
.ade - ODF Documents:
.odt, .ods, .odp - ODF Presentation Template:
.otp - ODF Spreadsheet Template:
.ots - ODF Text Template:
.ott - Outlook Form Template:
.oft - Portable Document Format:
.pdf - Rich Text Format:
.rtf - SQL files:
.sql - Text files:
.txt - XPS files:
.xps - XML files:
.xml
For PNG and JPEG files, scanning is only supported for the Upload action
- Bitmap:
.bmp - BMP Uncompressed:
.bmp-uncompressed - JFIF files:
.jfif - JPEG files:
.jpeg, .jpg - PBM files:
.pbm - PGM files:
.pgm - PNG files:
.png - PNM files:
.pnm - PPM files:
.ppm - Progressive JPEG:
.pjpeg, .pjp - TIFF files:
.tiff, .tif - WebP files:
.webp
Images embedded in these files types are scanned. Up to 5 images are scanned per file, if a file contains more than 5 images, only the 5 largest images are scanned.
- Microsoft Excel:
.xls, .xlsx - Microsoft PowerPoint:
.ppt, .pptx - Microsoft Word:
.doc, .docx - Portable Document Format:
.pdf
- Bash scripts:
.sh - Basic source code:
.bas - Batch files:
.cmd, .bat - C, C++, and C# source files:
.c, .h, .cc, .hh, .cs, .cpp, .hpp - Go files:
.go - HTML files:
.html - Include files:
.inc - Java files:
.java, .jav, .j - JavaScript files:
.js - Make files:
.mak, .mk, .pmk - Matlab files:
.mat - Perl files:
.pl, .pm, .plf - Python files:
.py, .pyi, .pyc, .pyd, .pyo, .pyw, .pyz - Ruby files:
.rb - Scripts / config files:
.ini, .json
0 comments
Please sign in to leave a comment.