Source Code Detection

IDOL KeyView has the ability to classify documents by their binary filetype (such as PDF, Word Document, JPEG Image, Windows Executable, and over 1000 others). If configured to do so, it can also determine that a file is the source file of a programming language, and can detect the source files of over 200 programming languages.

The detection runs in two modes. In ‘basic’ mode it detects documents in the 48 most-common programming languages. ‘Full’ mode adds the remaining programming languages, including many rare languages, and so should only be used if a sizable number of such rare languages are expected in the files.

NOTE: The detection is designed to locate files that largely or entirely consist of source code, not files in other formats, such as word processor formats, that happen to contain snippets of source code.