Document Autoclassification: Source and Purpose

Before we start looking at classification techniques, a few concepts need to be defined.

Source data

The information we rely on to classify content automatically comes from three sources within a single document.

Purpose

There are several uses for our auto-classification efforts and your purpose in classifying documents will determine the appropriate method. These are the three most important for me.

First in the series

Next in the series