Search

1 to 10 of 12
Sort by

Blog Entry
Dealing with Documents in other Languages than English

High-stake investigations and eDiscovery projects are not limited by national boundaries and no investigator can afford to miss relevant information because it is in a foreign language and the cost of translation is too high. Multi-lingual text collection hide more complexities than it...

Johannes Scholtes's profile image

Blog Entry
Automatic Fraud Triangle Analytics made possible with Text-Mining and Content Analytics

Economic crimes such as corruption and fraud are difficult to detect and prevent, but the financial and reputational consequences and the growing public and political demand for harsh action on corporates whose employees break the law are forcing companies to review their security and compliance...

Johannes Scholtes's profile image

Blog Entry
The Essential Role of Technology for Law Enforcement and Internal Investigations

Almost every crime leaves a proverbial “paper trail”, but when the probative information spans a vast volume of confiscated documents, electronic records, e-mail, wiretap transcripts, observation reports, cold cases, intelligence, information from telecom and internet providers,...

Johannes Scholtes's profile image

Blog Entry
The Impact of Incorrect Training Sets and Rolling Collections on Technology-Assisted Review (TAR) and Defensible Disposition

Last week, I participated in the DESI Workshop held as part of the International Conference on Artificial Intelligence in Law on June 14 in Rome, Italy. At the conference we submitted recent Technology Assisted Review (TAR) findings described in the paper “The Impact of Incorrect Training...

Johannes Scholtes's profile image


Blog Entry
Free report for download: How Content-Analytics can help Big-Data

The ongoing information explosion from the computer age gained significant momentum in the last decade (or so), finally reaching epic proportions and earning its own name: Big Data . The realities of Big Data encompass both Big Data challenges and opportunities. The challenges stem from the...

Johannes Scholtes's profile image

Blog Entry
Language is Not Just a Jumbled Bag of Words: Why Natural Language Processing Makes a Difference in Content Analytics

State-of-the art text analysis supports multiple languages, which is critical when investigations go global and involve collections of information in various languages. In such scenarios, the technology obviously adapts to differences in character sets and words, but the tools also need to...

Johannes Scholtes's profile image

Blog Entry
WikiLeaks Makes the Case for Intelligent and Automatic Redaction

The WikiLeaks scandal involving the release of 250,000 classified State Department cables has put automatic redaction software in the spotlight. Apart from this example of the unauthorized release of confidential information, there are many other use cases for data redaction and examples of...

Johannes Scholtes's profile image

Blog Entry
Handling Language Dependencies in eDiscovery, Records Management when using Content Analytics

Foreign language texts contain a lot of hidden information, making multilingual information extraction tools – and applications that allow cross-lingual information access – particularly useful. Only a few system developers offer their products for more than two or three languages....

Johannes Scholtes's profile image

Blog Entry
What is the Term ‘Early Case Assessment’ Really About?

In the last couple of months, I have seen many different definitions of Early Case Assessment (ECA). Some vendors mold the concept of Early Case Assessment to support their own offerings--which isn’t surprising, but it does cloud the issue. Some of the definitions have merit, but not all. There...

Johannes Scholtes's profile image