3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      ChatGPT Chemistry Assistant for Text Mining and the Prediction of MOF Synthesis

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Related collections

          Most cited references55

          • Record: found
          • Abstract: not found
          • Article: not found

          Revised effective ionic radii and systematic studies of interatomic distances in halides and chalcogenides

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Random forest: a classification and regression tool for compound classification and QSAR modeling.

            A new classification and regression tool, Random Forest, is introduced and investigated for predicting a compound's quantitative or categorical biological activity based on a quantitative description of the compound's molecular structure. Random Forest is an ensemble of unpruned classification or regression trees created by using bootstrap samples of the training data and random feature selection in tree induction. Prediction is made by aggregating (majority vote or averaging) the predictions of the ensemble. We built predictive models for six cheminformatics data sets. Our analysis demonstrates that Random Forest is a powerful tool capable of delivering performance that is among the most accurate methods to date. We also present three additional features of Random Forest: built-in performance assessment, a measure of relative importance of descriptors, and a measure of compound similarity that is weighted by the relative importance of descriptors. It is the combination of relatively high prediction accuracy and its collection of desired features that makes Random Forest uniquely suited for modeling in cheminformatics.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules

              We report a method to convert discrete representations of molecules to and from a multidimensional continuous representation. This model allows us to generate new molecules for efficient exploration and optimization through open-ended spaces of chemical compounds. A deep neural network was trained on hundreds of thousands of existing chemical structures to construct three coupled functions: an encoder, a decoder, and a predictor. The encoder converts the discrete representation of a molecule into a real-valued continuous vector, and the decoder converts these continuous vectors back to discrete molecular representations. The predictor estimates chemical properties from the latent continuous vector representation of the molecule. Continuous representations of molecules allow us to automatically generate novel chemical structures by performing simple operations in the latent space, such as decoding random vectors, perturbing known chemical structures, or interpolating between molecules. Continuous representations also allow the use of powerful gradient-based optimization to efficiently guide the search for optimized functional compounds. We demonstrate our method in the domain of drug-like molecules and also in a set of molecules with fewer that nine heavy atoms.
                Bookmark

                Author and article information

                Contributors
                Journal
                Journal of the American Chemical Society
                J. Am. Chem. Soc.
                American Chemical Society (ACS)
                0002-7863
                1520-5126
                August 16 2023
                August 07 2023
                August 16 2023
                : 145
                : 32
                : 18048-18062
                Affiliations
                [1 ]Department of Chemistry, University of California, Berkeley, California 94720, United States
                [2 ]Kavli Energy Nanoscience Institute, University of California, Berkeley, California 94720, United States
                [3 ]Bakar Institute of Digital Materials for the Planet, College of Computing, Data Science, and Society, University of California, Berkeley, California 94720, United States
                [4 ]Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California 94720, United States
                [5 ]Department of Mathematics, University of California, Berkeley, California 94720, United States
                [6 ]Department of Statistics, University of California, Berkeley, California 94720, United States
                [7 ]School of Information, University of California, Berkeley, California 94720, United States
                [8 ]KACST−UC Berkeley Center of Excellence for Nanomaterials for Clean Energy Applications, King Abdulaziz City for Science and Technology, Riyadh 11442, Saudi Arabia
                Article
                10.1021/jacs.3c05819
                37548379
                c324b684-0b2a-4e4e-9fc6-24f9bc2a7385
                © 2023

                https://doi.org/10.15223/policy-029

                https://doi.org/10.15223/policy-037

                https://doi.org/10.15223/policy-045

                History

                Comments

                Comment on this article