Search

1 to 2 of 2
Sort by

Blog Entry
Technology Assisted Review, Concept Search and Predictive Coding: The Limitations and Risks

Several risk factors are listed here, but there are more depending on the specific machine learning technology that is used: technology that is based on Bayes classifiers (falsely) presumes statistical independence between measured features (e.g. word occurrences) and Latent Semantic Indexing (LSI) and its variants such as Probabilistic Latent Semantic Analysis (PLSA) effectively use a lossy information compression algorithm (SVD) that may result in more (irreversible) information loss than required

Johannes Scholtes's profile image