From the dictionary, cognition is the mental action or process of acquiring knowledge and understanding through thought, experience, and the senses. At Precognox, we follow this thread. Our systems think and learn through repeated analysis of large data repositories, and then display the results in an understandable format.
We confront huge stores of unstructured text data – often in several different languages. By unstructured, we mean written words or numbers in no particular or uniform format – such as doctors’ notes or comments on proposed new laws.
Typically encountered stores of unstructured data include:
For example, there is some anecdotal suggestion that the tragic increase in autism is perhaps related to Caesarean births. A researcher would need a Cognitive System to plow through all the scientific articles, doctors’ notes, and Twitter entries to find links between these seemingly unrelated topics.
Projects are almost always customized, and include a consulting component. We assemble a specific solution from a toolbox of Precognox proprietary software and also some open source components.
In this process, the Precognox software learns and improves the results, by identifying and recognizing entities like companies, people, locations, diseases, etc. Human assistance can be valuable in identifying relationships, such as nicknames, which the system then uses to improve the results.
We take into account the type of information the client desires, and develop a heuristic approach to get a good result with optimal use of the computer resource. A heuristic approach to discovery means a practical method not guaranteed to be perfect, but which is sufficient for the immediate goal. For example, in news articles one might predefine where to expect the “leader” section, and not have to search through all the text.
As an example of building structured information from unstructured text, we could analyze thousands of restaurant reviews for a city. For each review, we would take the restaurant name, even if it was misspelled, and use sentiment and emotion analysis (see later) to identify if this was a positive or negative review, and whether the review was angry or enthusiastic. Our software would then tabulate these results into an easy-to-read spreadsheet.
Precognox never uses a “brute force” approach to data analysis. We refine and develop heuristics for each situation, and therefore economize the computer resource. However, cognitive systems do often require extensive computing. We optimize the time and cost through parallel processing computing, and the Hadoop approach. Hadoop is a software methodology that uses inexpensive hardware, with the expectation that there will be hardware failures. The Hadoop software corrects for these expected failures, resulting in exact precision at lowest cost.
With a combination of our consulting expertise and experience, coupled with our software toolkit, Precognox can bring order and insight into your large unstructured data stores.