343
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Detecting novel associations in large data sets.

      Science (New York, N.Y.)
      Algorithms, Animals, Baseball, statistics & numerical data, Data Interpretation, Statistical, Female, Gene Expression, Genes, Fungal, Genomics, methods, Humans, Intestines, microbiology, Male, Metagenome, Mice, Obesity, Saccharomyces cerevisiae, genetics

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Identifying interesting relationships between pairs of variables in large data sets is increasingly important. Here, we present a measure of dependence for two-variable relationships: the maximal information coefficient (MIC). MIC captures a wide range of associations both functional and not, and for functional relationships provides a score that roughly equals the coefficient of determination (R(2)) of the data relative to the regression function. MIC belongs to a larger class of maximal information-based nonparametric exploration (MINE) statistics for identifying and classifying relationships. We apply MIC and MINE to data sets in global health, gene expression, major-league baseball, and the human gut microbiota and identify known and novel relationships.

          Related collections

          Most cited references16

          • Record: found
          • Abstract: not found
          • Article: not found

          Testing for nonlinearity in time series: the method of surrogate data

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            How does multiple testing correction work?

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Consistent Nonparametric Regression

                Bookmark

                Author and article information

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content147

                Cited by638

                Most referenced authors498