Direct biomolecule discrimination in mixed samples using nanogap
Scientific Reports volume 13, Article number: 9103 (2023) Cite this article
Metrics details
In single-molecule measurements, metal nanogap electrodes directly measure the current of a single molecule. This technique has been actively investigated as a new detection method for a variety of samples. Machine learning has been applied to analyze signals derived from single molecules to improve the identification accuracy. However, conventional identification methods have drawbacks, such as the requirement of data to be measured for each target molecule and the electronic structure variation of the nanogap electrode. In this study, we report a technique for identifying molecules based on single-molecule measurement data measured only in mixed sample solutions. Compared with conventional methods that require training classifiers on measurement data from individual samples, our proposed method successfully predicts the mixing ratio from the measurement data in mixed solutions. This demonstrates the possibility of identifying single molecules using only data from mixed solutions, without prior training. This method is anticipated to be particularly useful for the analysis of biological samples in which chemical separation methods are not applicable, thereby increasing the potential for single-molecule measurements to be widely adopted as an analytical technique.
The direct measurement of complex samples offers advantages such as time and cost savings by minimizing the sample preparation steps and sample loss, while also enabling the detection of a wide range of molecules. Single-molecule measurement is attracting attention as a novel molecular detection/quantification measurement method because in this method, a molecule between nanoelectrodes is directly measured1,2,3. In the break junction method4,5,6,7, a single-molecule electrical measurement method, a metal nanogap is formed by repeatedly breaking and forming junctions. A single molecule is detected by measuring the tunneling current that occurs when a molecule passes through the nanogap. Single-molecule measurements are being actively researched for the development of molecular devices2,8,9,10,11,12,13. Since Di Ventra's group theoretically proposed the potential for DNA and RNA sequencing, single-molecule measurements have received significant attention as an analytical method owing to their high throughput, low detection limit, and the ability to conduct measurements with no preprocessing steps3,14,15. To date, our group has reported conductance measurements of DNA and RNA nucleobases and demonstrated the applicability of single-molecule measurements as an analytical method16,17,18. The target molecules are not limited to DNA and RNA, and can be extended to various molecules such as amino acids19,20, peptides21,22, proteins23,24,25, neurotransmitters26, glucose27, and NADH28. Furthermore, the measurement targets are not limited to biomolecules. Single-molecule measurements are expected to have a wide range of applications; for example, the potential of detecting explosives29. Although the conductance of different molecules can be measured with single-molecule measurements, single-molecule conductance is highly variable30,31,32,33. Therefore, the statistical evaluation of single-molecule signals is essential for reliable molecular identification. Most typical conductance histograms-based analysis provides only statistical conductance information on single-molecule conductance. The overlapping of conductance histograms results in a low accuracy in single-molecule discrimination. The application of machine learning to single-molecule measurements is a promising method to address these issues. Machine learning-based analysis has improved the discrimination accuracy of single-molecule measurements26,34,35,36,37,38. However, conventional machine-learning approaches require training data obtained from solutions containing only one chemical species for every target molecule. Considering the application of single-molecule measurements for detecting biomolecules or specific targets, preparing a reference containing only one sample from a solution containing impurities for all molecules is occasionally difficult. However, preparing samples with varying concentrations of the target molecules in impure solutions can be comparatively easier. For example, by promoting or inhibiting the emission of the target in biological samples or adding a reference molecule in a sample solution. Even if a solution containing only a specific target molecule can be measured, the machine-learning classifier built with the training data may not be applicable to the samples because the measurement environment of the training data may be different from that of the sample. From these reasons, the development of a method for direct discrimination from mixed samples without single-species target samples, represents a significant advancement in the field of single-molecule measurements. The approach has significant potential in providing insights into the detection of biological molecules and other targets in complex samples. Herein, the aim of this study was the development of an analytical method for identifying molecules based only with mixed solutions. As shown in Fig 1, targeting dGMP and dTMP, which are already known to be identifiable by pure solution single-molecule measurements and conventional machine learning-based analysis, we developed a method to determine the concentration ratio of mixed solutions from their mixtures only.
Flow chart of single-molecule classification. For single-molecule current measurements, the sample solutions were injected into a PDMS well, and the chips were bent with a finely controlled push bar with a piezoelectric device to form a nanogap, after which the current was measured. The green box represents the conventional method, while the orange box represents the new concepts. The solid lines show the process for each individual sample, and the dashed lines show the process for the mixture.
The target molecules in this study are two DNA nucleotides, deoxyguanosine monophosphate (dGMP) and thymidine monophosphate (dTMP). These targets were selected as model systems for single-molecule signal identification using machine learning rather than for their applicability in identifying mixtures of two molecules. Nucleotides can be identified by single-molecule measurements and have been previously reported as target molecules in various studies15,17,36. Figure 2a,b show the molecular structures of dGMP and dTMP, respectively. As shown in Fig. 2c,d, a current pulse signal is generated when an individual molecule passes through the nanogap. Figure 2c,d show histograms of the maximum current (Ip) values. The average currents for dGMP and dTMP are 32 pA and 25 pA under a 100 mV bias voltage for dGMP and dTMP, respectively. dGMP exhibits a higher conductance than dTMP does because its HOMO level is closer to the Au Fermi level39, which is the conduction orbital for dGMP rather than for dTMP. Although the average conductance of the two molecules shows a clear difference, their Ip histograms exhibit an overlap. Both histograms exhibit low-current signals at 20 pA. The low-current signal was caused the single-molecule bridging structure between the nanogap. Electron transport via lower molecular orbital of ribose sugar cause lower current40. The large overlapping indicates that relying solely on histogram-based analysis methods that depend on Ip is insufficient for accurate discrimination and that the use of machine learning is necessary.
Results of dGMP and dTMP single-molecule measurements. (a), (b) Molecular structure of dGMP and dTMP, respectively. (c), (d) Three individual current pulses of dGMP and dTMP. (e), (f) Histograms of the maximum current (Ip) for dGMP and dTMP, respectively. Each current is measured under a bias of 100 mV.
As a comparison to the proposed method, the mixing ratio of the mixture was predicted using a conventional machine-learning-based classification method. In the conventional method, the machine-learning classifier is first trained from the single-molecule current signals obtained from measurements of each single-target solution with the label of molecular names. The machine-learning classifier then identifies the current signals obtained from the mixture based on the learned characteristics of each molecular signal. Finally, each predicted molecular label of the mixed solution data is counted, and the concentration ratio is determined as the ratio of the number of signals for each molecule. Fig. 3a shows the validation process of the machine-learning classifier training. The machine-learning validation process consists of mechanically controllable break junction (MCBJ) measurement, signal extraction, feature extraction, training, and identification. In this study, 13-dimensional vectors consisting of Ip, duration time (td), and the 10-dimensional normalized current factor, which were used in previously reported methods, are used as features20,26,35,36. The 10-dimensional normalized current factors are defined as the average current value normalized by the maximum current value of each of the 10-time sections, as shown in Fig. 3b. A 10-fold cross-validation (CV) method was used for verification, training, and prediction as shown in Fig. S1 in Supplementary information. In 10-fold CV, all data are divided into ten subsets, and one subset is used as the testing data, whereas the identification is trained by the other subsets in a 10-time loop to ensure that all data are tested once. The validation results for the two molecules measured in pure solutions are presented in the confusion matrix shown in Fig. 3c. The F-measure, a performance index of classification, is 0.78. This approach demonstrates the identifiability of a machine-learning classifier trained on data measured from solutions containing only a single chemical species. To confirm the discriminative ability of the classifier, the mixing ratio of the target was predicted using a machine-learning classifier that learned the current signal of each molecule in the previous step. Figure 4a,b show the histograms of Ip measured in the two mixtures dGMP:dTMP = 3:1 and dGMP:dTMP = 1:3, respectively. The dGMP:dTMP = 3:1 solution, which contains more of the more-conductive dGMP, shows higher conductance than the dGMP:dTMP = 1:3 solution, which contains more of the less-conductive dTMP. Figure 4c shows the process of identifying the current signals obtained in the mixture using the machine-learning classifier trained from the current signals of each target in the previous step to predict the mixture ratio. Using this process, the machine-learning classifier predicted mixing ratios of 1.7:1 and 1:1.6 for the signals obtained from the dGMP:dTMP = 3:1 and dGMP:dTMP = 1:3 solutions, respectively, as shown in Fig. 4d. As shown in Fig. 3c, the identification accuracy of each nucleotide varies individually, which can result in an underestimation of the prediction ratio of abundant nucleotides.
Conventional machine-learning training methods and identification results. (a) Training process of machine learning for pure solutions using the conventional method. Features includes factors such as peak current (Ip), duration (td), average current (Iavg.), and 10-dimensional normalized current for each pulse signal. (b) Single-molecule individual current pulse (blue solid line) and definitions of the features. The black dashed lines show the area of the current pulse divided into ten parts along the time axis. The average current values (red dashed lines) of each portion divided from I1 to I10 are 13.2, 38.3, 38.0, 43.5, 35.4, 44.1, 42.0, 34.3, 39.0, and 30.8 pA, respectively. Si means Ii normalized with respect to Ip. The green, red, and pink solid lines represent Ip, td, and Iavg., respectively. (c) Confusion matrix of dGMP and dTMP predictions in pure solutions.
Process and results of predicting the mixing ratio of the target using the classifier trained on the current signal of the molecule in the previous step. (a), (b) Ip histograms measured in two mixtures, dGMP:dTMP = 3:1 and dGMP:dTMP = 1:3, respectively. (c) The process of identifying the current signal of mixtures using the machine-learning classifier trained on the current signal of each target to predict the mixing ratio. (d) The results of predicting the mixing ratio of mixtures based on trained data.
The main goal of this research is to develop a method to distinguish between the two molecules from the data measured using only mixed solutions. The relationship between the concentrations of the two mixtures, that is, solutions containing more dGMP or dTMP, is known. The measurement and identification processes for this new concept are illustrated in Fig. 5a. The discriminative boundaries of the two molecules were estimated directly from the data obtained from the two mixtures with unlabeled data and unlabeled data classification (UUC) based on kernel density estimation (KDE)41. Fig. 5b shows a conceptual diagram of UUC, a method for determining discriminant boundaries from data in which the two classes are mixed in different concentrations. In Fig. 5b, the blue and red colors represent two types of mixtures. Both solutions contain different concentrations of the two classes. The classes are unknown in advance. The purpose of UUC is to distinguish between these two classes based on which class is more abundant in the solution. KDE is a non-parametric statistical technique used to estimate the probability density function in a feature space directly from observed data, as shown in Fig. 5c. Intuitively, KDE calculates the probability density by adding the Gaussian kernels obtained from each observed data point, similar to the manner in which a histogram is created by adding data points. This method can obtain a smooth probability density distribution with fewer data than that of a histogram. In this study, the Gaussian kernel was centered on the observed data points. In the UUC method used in this study, the probability density distributions of the two classes were determined by KDE through correction. This method is proposed for a situation in which one of the data points contains only positive classes. However, because the proposed method is based on the principle that regions of higher concentration exhibit higher probability densities, it can also be applied to two unlabeled data mixtures with known concentration relationships. For comparison with the conventional method, identification was performed with the same features extracted from the same dataset as that described in the previous section. The UUC machine learning classifier was trained using only the signals from the mixtures and predicted the molecules, and the results are presented in Fig. 5d. The ratios of signals corresponding to 3:1 and 1:3 ratios of dGMP:dTMP were predicted to be 3.2:1 and 1:3.5, respectively. The performance of the new identification method proposed in this study is compared with that of conventional methods, as shown in Fig. 5e. The electronic structures of the electrodes affect single-molecule conductance. Electronic structure variation due to molecular adsorption on the electrode surface or different geometries of the electrodes may affect single-molecule signals42,43,44,45. A wide variety of machine learning methods have been developed in recent years. Unsupervised learning is applicable to the identification of data without explicit labels, as is supervised learning. This method has been applied to the discrimination of I–z traces of single-molecule measurements34. However, conventional unsupervised learning methods cannot adequately identify the experimental data from the two solutions, as shown in SI.5. The new UUC method can discriminate between two molecules by measuring only the mixtures. The method is assumed to prevent the propagation of errors owing to environmental changes and cause higher accuracy discrimination than conventional methods. Figure 5f shows the current profile of the dGMP:dTMP = 3:1 solution with the molecular prediction results obtained by the UUC method. The red and blue signals denote dGMP- and dTMP-derived signals, respectively. The signals obtained from the mixtures can be discriminated individually.
(a) Process of training and identifying with data from mixtures only. (b) Schematic image of UUC. The red and blue colors represent two types of mixtures with different concentrations of the two classes. The circles and triangles represent each class. The UUC method determines the orange curve, which represents the boundary between two classes. (c) Schematic image of the KDE for estimating the probability density function in the feature space. The red and blue dots and dashed lines indicate the data points and their Gaussian kernel, respectively. The solid curves represent the sum of the dashed lines, which represents the kernel density estimate. (d) The result of predicting the mixing ratio of two mixtures with data trained on the mixture only. (e) Comparison of the performance of the new and old methods with respect to the prediction ratio. (f) The current profile resulting from identifying the signal of each single molecule individually (in dGMP:dTMP = 3:1 solution). (g), (h) Ip histograms based on the identification results of the dGMP:dTMP = 3:1 and dGMP:dTMP = 1:3 solutions, respectively. The red and blue bars represent the histograms predicted as dGMP and dTMP, respectively. The solid lines represent the sum of the two histograms.
In the previous section, the conductance histograms of individual nucleotides (Fig. 2) showed that dGMP has a higher conductance. Focusing on the individual signals identified, the dGMP signal does not always show a higher conductance than the dTMP signal. Machine-learning algorithms can differentiate between signals based on both the conductance and signal shape. This is because the current histograms of the identified results are statistically analyzed. The Ip histograms of the identified results of the signals obtained from dGMP:dTMP = 3:1 and dGMP:dTMP = 1:3 solutions are shown in Fig. 5g,h, respectively. The red and blue bars represent histograms predicted as dGMP and dTMP, respectively. The histograms confirm that the UUC method can predict mixing ratios and that dGMP has a higher conductance than dTMP. This agrees with the results of the pure-solution measurement. Notably, this new method enables the determination of concentration ratios using only two mixture solutions of unknown concentrations. This technique is assumed to be applicable to molecular detection methods. For example, this technique can be applied to determine the concentration ratio of a molecule in a biological sample containing a foreign material by comparing it to a normal sample and a positive/negative sample with a control that promotes or inhibits the molecule of interest or by measuring the concentration of the molecule of interest in a sample of unknown concentration and a sample to which a reference sample is added. The concentration ratio of the molecule of interest can also be determined from positive/negative samples of the molecule of interest with a control that promotes or inhibits the molecule of interest.
In this study, we developed a new method to identify molecules using single-molecule measurement of only mixed solutions and a discrimination method for two types of unlabeled data using kernel density estimation. Compared to the traditional method, our approach showed improved accuracy in predicting the composition of mixed solutions. The technique developed in this study for identifying target molecules in mixed solutions without individual sample training is expected to have broad applications for various molecules in the field of single-molecule measurement.
Deoxyguanosine monophosphate (dGMP, Sigma-Aldrich) and deoxythymidine monophosphate (dGTP, Sigma-Aldrich) were diluted in Milli-Q water without any further purification process. The concentration of each solution of dGMP and dTMP used in the measurement was 10 μM. Measurements of dGMP:dTMP = 3:1 used the mixture of 750 μM dGMP and 250 μM dTMP, and measurements of dGMP:dTMP = 1:3 used the solution of 250 μM dGMP and 750 μM dTMP. Polydimethylsiloxane (PDMS) wells were fabricated and treated with an oxygen plasma for 10 s, attached to the MCBJ nanogap electrode device, and treated in the vacuum oven at 90 °C for 60 min.
The MCBJ technique was applied to form gold nanogaps. The gold wires were deposited on the flexible silicon substrate. First, polyimide thin-film was formed as an insulating layer on the silicon substrate. Tens of nanometer-wide patterns were fabricated using electron beam lithography, and the gold wires were deposited on the patterns using plasma-enhanced chemical vapor deposition. Finally, the polyimide layer was dry etched to form the gold wire bridge. The gold wire substrate was installed in the MCBJ system and the current change was monitored until the wires were mechanically broken due to repeated bending by three-point bending and a sharp current drop appeared. During this process, the current was measured using the piezoelectric device to precisely control the gap width in real time and fine-tune the piezo-adjusted pushing rod.
The solutions were injected into PDMS well attached to the MCBJ device. A voltage of 100 mV was applied to the solution electrode for 5 min. Before every individual measurement, a control experiment was performed by injecting only Milli-Q water. The interelectrode distance d of the nanogap was fixed at 0.58, 0.56, and 0.54 nm by the MCBJ technique.
Each of the 830 pulse signals was trained and classified with supervised machine learning of the Random Forest (RF) classifier in scikit-learn version 0.24.246. In validation process, the 10-fold CV was performed and its average and standard deviation values provided the classification ratios and errors. The errors are standard deviation of 10-time classification. In mixed solution analysis, the RF supervised machine learning classifier was trained with 1000 dGMP and dTMP signals each. Signals with Ip > 20 pA and td > 1 ms were analyzed. The signals from the mixtures were classified one by one with the trained classifier. The analysis was performed using Python 3.10.4. UUC and weighted KDE source codes were prepared by ourselves using Python 3.10.4. The 1000 signals and features from mixtures are same to conventional methods. Gaussian kernel was adopted. The bandwidth is determined by Silverman's rule41.
The data that support the findings of this study are available from the corresponding author upon reasonable request. Correspondence and requests for materials should be addressed to M.T.
Li, Y., Yang, C. & Guo, X. Single-molecule electrical detection: A promising route toward the fundamental limits of chemistry and life science. Acc. Chem. Res. 53, 159–169 (2020).
Article CAS PubMed Google Scholar
Xie, X. et al. Single-molecule junction: A reliable platform for monitoring molecular physical and chemical processes. ACS Nano 16, 3476–3505 (2022).
Article CAS PubMed Google Scholar
Di Ventra, M. & Taniguchi, M. Decoding DNA, RNA and peptides with quantum tunnelling. Nat. Nanotechnol. 11, 117–126 (2016).
Article ADS PubMed Google Scholar
Martin, C. A., Ding, D., Van Der Zant, H. S. J. & Van Ruitenbeek, J. M. Lithographic mechanical break junctions for single-molecule measurements in vacuum: Possibilities and limitations. New J. Phys. 10, 065008 (2008).
Article ADS Google Scholar
Reed, M. A., Zhou, C., Muller, C. J., Burgin, T. P. & Tour, J. M. Conductance of a molecular junction. Science 278, 1–3 (1997).
Article Google Scholar
Krans, J. M. et al. One-atom point contacts. Phys. Rev. B 48, 14721–14724 (1993).
Article ADS CAS Google Scholar
Agraït, N., Yeyati, A. L. & van Ruitenbeek, J. M. Quantum properties of atomic-sized conductors. Phys. Rep. 377, 81–279 (2003).
Article ADS Google Scholar
Evers, F., Korytár, R., Tewari, S. & Van Ruitenbeek, J. M. Advances and challenges in single-molecule electron transport. Rev. Mod. Phys. 92, 35001 (2020).
Article CAS Google Scholar
Bai, J., Li, X., Zhu, Z., Zheng, Y. & Hong, W. Single-molecule electrochemical transistors. Adv. Mater. 33, 1–20 (2021).
Article Google Scholar
Aradhya, S. V. & Venkataraman, L. Single-molecule junctions beyond electronic transport. Nat. Nanotechnol. 8, 399–410 (2013).
Article ADS CAS PubMed Google Scholar
Huang, C., Rudnev, A. V., Hong, W. & Wandlowski, T. Break junction under electrochemical gating: Testbed for single-molecule electronics. Chem. Soc. Rev. 44, 889–901 (2015).
Article CAS PubMed Google Scholar
Su, T. A., Neupane, M., Steigerwald, M. L., Venkataraman, L. & Nuckolls, C. Chemical principles of single-molecule electronics. Nat. Rev. Mater. https://doi.org/10.1038/natrevmats.2016.2 (2016).
Article Google Scholar
Song, H., Reed, M. A. & Lee, T. Single molecule electronic devices. Adv. Mater. 23, 1583–1608 (2011).
Article CAS PubMed Google Scholar
Zwolak, M. & Di Ventra, M. Colloquium: Physical approaches to DNA sequencing and detection. Rev. Mod. Phys. 80, 141–165 (2008).
Article ADS Google Scholar
Zwolak, M. & Di Ventra, M. Electronic signature of DNA nucleotides via transverse transport. Nano Lett. 5, 421–424 (2005).
Article ADS CAS PubMed Google Scholar
Ohshiro, T. et al. Direct observation of DNA alterations induced by a DNA disruptor. Sci. Rep. 12, 1–9 (2022).
Article Google Scholar
Tsutsui, M., Taniguchi, M., Yokota, K. & Kawai, T. Identifying single nucleotides by tunnelling current. Nat. Nanotechnol. 5, 286–290 (2010).
Article ADS CAS PubMed Google Scholar
Ohshiro, T. et al. Single-molecule RNA sequencing for simultaneous detection of m6A and 5mC. Sci. Rep. 11, 1–10 (2021).
Article Google Scholar
Zhao, Y. et al. Single-molecule spectroscopy of amino acids and peptides by recognition tunnelling. Nat. Nanotechnol. 9, 466–473 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Ryu, J., Komoto, Y., Ohshiro, T. & Taniguchi, M. Single‐molecule classification of aspartic acid and leucine by molecular recognition through hydrogen bonding and time‐series analysis. Chem. Asian J. 17, e202200179 (2022).
Google Scholar
Ohshiro, T. et al. Detection of post-translational modifications in single peptides using electron tunnelling currents. Nat. Nanotechnol. 9, 835–840 (2014).
Article ADS CAS PubMed Google Scholar
Hihath, J. & Tao, N. Rapid measurement of single-molecule conductance. Nanotechnology 19, 265204 (2008).
Article ADS PubMed Google Scholar
Zhang, B. et al. Observation of giant conductance fluctuations in a protein. Nano Futur. 1, 035002 (2017).
Article ADS Google Scholar
Zhang, B. et al. Role of contacts in long-range protein conductance. Proc. Natl. Acad. Sci. U. S. A. 116, 5886–5891 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Ruiz, M. P. et al. Bioengineering a single-protein junction. J. Am. Chem. Soc. 139, 15337–15346 (2017).
Article CAS PubMed Google Scholar
Komoto, Y. et al. Time-resolved neurotransmitter detection in mouse brain tissue using an artificial intelligence-nanogap. Sci. Rep. 10, 1–7 (2020).
Article Google Scholar
Nishino, T., Shiigi, H., Kiguchi, M. & Nagaoka, T. Specific single-molecule detection of glucose in a supramolecularly designed tunnel junction. Chem. Commun. 53, 5212–5215 (2017).
Article CAS Google Scholar
Hu, Y. et al. Determination of Ag[I] and NADH using single-molecule conductance ratiometric probes. ACS Sensors 6, 461–469 (2021).
Article CAS PubMed Google Scholar
Yu, P. et al. Single-molecule tunneling sensors for nitrobenzene explosives. Anal. Chem. 94, 12042–12050 (2022).
Article CAS PubMed Google Scholar
Zhu, Z. et al. Single-molecule conductance variations of up to four orders of magnitude via contacting electrodes with different anchoring sites. J. Mater. Chem. C 9, 16192–16198 (2021).
Article CAS Google Scholar
Chen, F., Hihath, J., Huang, Z., Li, X. & Tao, N. J. Measurement of single-molecule conductance. Annu. Rev. Phys. Chem. 58, 535–564 (2007).
Article ADS CAS PubMed Google Scholar
Stefani, D. et al. Large conductance variations in a mechanosensitive single-molecule junction. Nano Lett. 18, 5981–5988 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, H. et al. Large variations in the single-molecule conductance of cyclic and bicyclic silanes. J. Am. Chem. Soc. 140, 15080–15088 (2018).
Article CAS PubMed Google Scholar
Huang, F. et al. Automatic classification of single-molecule charge transport data with an unsupervised machine-learning algorithm. Phys. Chem. Chem. Phys. 22, 1674–1681 (2020).
Article CAS PubMed Google Scholar
Komoto, Y., Ohshiro, T. & Taniguchi, M. Detection of an alcohol-associated cancer marker by single-molecule quantum sequencing. Chem. Commun. 56, 14299–14302 (2020).
Article CAS Google Scholar
Taniguchi, M. et al. High-precision single-molecule identification based on single-molecule information within a noisy matrix. J. Phys. Chem. C 123, 15867–15873 (2019).
Article CAS Google Scholar
Bro-Jørgensen, W., Hamill, J. M., Bro, R. & Solomon, G. C. Trusting our machines: Validating machine learning models for single-molecule transport experiments. Chem. Soc. Rev. 51, 6875–6892 (2022).
Article PubMed PubMed Central Google Scholar
Magyarkuti, A., Balogh, N., Balogh, Z., Venkataraman, L. & Halbritter, A. Unsupervised feature recognition in single-molecule break junction data. Nanoscale 12, 8355–8363 (2020).
Article CAS PubMed Google Scholar
Ohshiro, T. et al. Single-molecule electrical random resequencing of DNA and RNA. Sci. Rep. https://doi.org/10.1038/srep00501 (2012).
Article PubMed PubMed Central Google Scholar
Furuhata, T. et al. Highly conductive nucleotide analogue facilitates base-calling in quantum-tunneling-based DNA sequencing. ACS Nano 13, 5028–5035 (2019).
Article CAS PubMed Google Scholar
Yoshida, T., Washio, T., Ohshiro, T. & Taniguchi, M. Classification from positive and unlabeled data based on likelihood invariance for measurement. Intell. Data Anal. 25, 57–79 (2021).
Article Google Scholar
Kaneko, S. et al. Identifying the molecular adsorption site of a single molecule junction through combined Raman and conductance studies. Chem. Sci. 10, 6261–6269 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bekyarova, E. et al. Electronic properties of single-walled carbon nanotube networks. J. Am. Chem. Soc. 127, 5990–5995 (2005).
Article CAS PubMed Google Scholar
Li, C. et al. Charge transport in single Au|alkanedithiol|Au junctions: Coordination geometries and conformational degrees of freedom. J. Am. Chem. Soc. 130, 318–326 (2008).
Article CAS PubMed Google Scholar
Bamberger, N. D. et al. Beyond simple structure-function relationships: The interplay of geometry, electronic structure, and molecule/electrode coupling in single-molecule junctions. J. Phys. Chem. C 126, 6653–6661 (2022).
Article CAS Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Download references
This work was supported by Japan Society for the Promotion of Science (JSPS) KAKENHI Grant Numbers 19H00852, 21H01741, 22K14566 and Japan Science and Technology Agency (JST) Core Research for Evolutional Science and Technology (CREST) Grant Number JPMJCR1666 and JST Support for Pioneering Research Initiated by the Next Generation (SPRING) Grant Number JPMJSP2138, Japan. We would like to thank Editage (www.editage.com) for English language editing.
SANKEN, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka, 567-0047, Japan
Jiho Ryu, Yuki Komoto, Takahito Ohshiro & Masateru Taniguchi
Artificial Intelligence Research Center, Osaka University, Ibaraki, Osaka, 567-0047, Japan
Yuki Komoto
Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiative (OTRI), Osaka University, Ibaraki, Osaka, 567-0047, Japan
Yuki Komoto
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
J.R., Y.K., T.O., and M.T. planned and designed the experiments. J.R., Y.K. and T.O. participated in the fabrication of MCBJs and single-molecule electrical measurements. J.R. and Y.K. performed data analysis. J.R., Y.K., T.O., and M.T. co-wrote the paper.
Correspondence to Masateru Taniguchi.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Reprints and Permissions
Ryu, J., Komoto, Y., Ohshiro, T. et al. Direct biomolecule discrimination in mixed samples using nanogap-based single-molecule electrical measurement. Sci Rep 13, 9103 (2023). https://doi.org/10.1038/s41598-023-35724-1
Download citation
Received: 27 March 2023
Accepted: 23 May 2023
Published: 05 June 2023
DOI: https://doi.org/10.1038/s41598-023-35724-1
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.