본문 바로가기
상단메뉴 바로가기

물 자연 그리고 사람 - 물로 더 행복한 세상을 만들어가겠습니다.

R&D성과

RESEARCH PERFORMANCE

HOMER&D성과논문실적

논문실적

Development of an embedded molecular structure-based model for prediction of micropollutant treatability in a drinking water treatment plant by machine learning from three years monitoring data 게시글의 제목, 학술지명, 저자, 발행일, 작성내용을 보여줌
Development of an embedded molecular structure-based model for prediction of micropollutant treatability in a drinking water treatment plant by machine learning from three years monitoring data
학술지명 ELSEVIER 저자 최재원 발표일 2023-05-02
of micropollutant in a drinking water treatment plant (DWTP) by machine learning using 69 micropollutants monitoring data at 18 DWTPs for three years. The molecular structure, which contains physicochemical characteristics, was embedded as a fixed-length vector that is advantageous for data-driven analysis and machine learning. First, the molecular structure of the micropollutants was converted to a sequence of tokens using the simplified molecular-input line-entry system (SMILES) pair encoding tokenizer, a frequency-based tokenization method. It was then compressed into fixed-length vectors using an autoencoder trained on various molecular structures within the Chemical Entities of Biological Interest. To validate the proposed models, a binary classification of micropollutant treatability was performed using the embedded molecular structure of micropollutants with various external features, such as concentration, season, and the presence of specific drinking water treatment processes by machine learning. The accuracy of the developed model for the 69 micropollutants in this study was 0.86, and the molecular structure was determined to be the most important feature. Furthermore, an accuracy of 0.71 was obtained in external validation for pharmaceuticals and personal care products that were not used for training. This shows that the proposed embedding vector can be generalized to unseen molecules during the training process, which means that it reflects the characteristics of the molecular structures.

관련 사이트

찾아오시는길

우편번호 34045 대전광역시 유성구 유성대로 1689번길 125(전민동 462-1) COPYRIGHT © K-water ALL RIGHTS RESERVED.