OAK

도서관 로그인

검색

SUNGSHIN Repository 대학원 일반대학원 미래응용과학학과 학위논문

질량 분석 기반 생체 분자의 펩타이드 및 글라이칸 특성 연구

Metadata Downloads

Alternative Title: Mass Spectrometry Based Characterization of Peptides and Glycans in Biomolecules

Abstract: 단백질체학(Proteomics)은 생물학적 시스템 내에 존재하는 단백질과 펩타이드의 다양성과 복잡성을 이해하기 위한 중요 연구 분야이다. 이러한 단백질의 특성 규명을 위해 주로 사용되는 질량 분석법은 높은 정확도로 인해 강력한 분석 도구로 여겨진다. 이에 따라 본 연구에서는 질량 분석을 활용해 정밀 분석 기법을 확립하고, 이를 HLA class Ⅰ 복합체, 치료용 항체, 콜라겐 등 다양한 생체 단백질에 적용함으로써 구조 분석 및 특성 분석을 위한 전략을 제시하고자 하였다.
먼저, HLA는 Human Leukocyte Antigen의 약자로, 인간 백혈구 항원을 의미한다. HLA는 class I, II, III로 분류되며, 이 중 HLA class I 분자는 내인성 유래 펩타이드를 CD8⁺ T 세포에 제시함으로써 체내 면역을 수행하는 메커니즘에 관여한다. 이 매개 면역 요법은 단순히 펩타이드를 면역 반응에 사용하는 것을 넘어, 암과 같은 질환의 발병 및 진행 과정에서 다양한 생물학적 역할을 수행한다. 따라서 이러한 항종양 치료 방법을 확립하기 위해서는 종양 세포에서 발현되는 HLA class I 결합 펩타이드의 서열 데이터를 평가 및 수집하기 위한 적절한 분석법의 개발이 중요하다.
이를 위한 실험에는 인간 대장암 세포주 HCT-116이 사용되었다. 실험에 앞서, 펩타이드를 효율적으로 추출하기 위한 cell lysis 조건을 최적화하고자 다양한 lysis 시간 조건으로 실험을 진행했다. 그 결과, 4시간 처리군에서 가장 높은 추출 효율을 보였으며, lysis 시간이 경과할수록 HLA class I 펩타이드 추출 효율이 감소하는 경향을 확인할 수 있었다. 이후 Orbitrap 질량 분석기를 사용해 추출한 펩타이드의 분석을 수행했고, 데이터 분석은 PEAKS Studio 및 Proteome Discoverer 소프트웨어를 활용했다. 먼저 Peaks Studio에서 De novo 서열 분석을 통해 펩타이드 서열을 예측하고, 이를 FASTA format 파일로 변환해 Proteome Discoverer에서 데이터베이스로 활용해 동정되는 펩타이드 수를 확인하였다. 이 분석을 통해 총 1400개 이상의 HLA 펩타이드 서열을 규명할 수 있었다. 또한, HLA class I 분자의 면역학적 기능에 관여하는 N-글라이칸을 분리해 분석하고, UPLC-FLR 결과와 질량 분석 결과를 비교함으로써 전체적인 글라이칸 구조를 확인했다.
두번째로, 질량 분석을 기반으로 치료용 단일클론 항체의 글라이코실화 특성을 분석하였다. 치료용 단일클론 항체는 다양한 질환에 대한 치료 효과로 현대 의학에서 사용 추세가 꾸준히 증가하고 있다. 이에 따라 생산된 항체의 이질성 평가는 필수적이며, 그중에서도 항체 글라이코실화는 치료 항체의 효능에 직접적인 영향을 미치기 때문에 항체 품질 검증 과정에서 핵심적인 평가 요소이다. 따라서 글라이칸 특성 규명을 위한 분석법의 최적화는중요한 요소로 간주된다.
본 연구에서는 다양한 오리지널 항체 및 그 바이오시밀러 샘플의 글라이코실화 패턴을 파악하기 위해 항체 자체에 대한 LC-MS 분석과 분리한 N-글라이칸에 대한 LC-MS 분석을 병행했다.
우선, 항체 샘플들에 별도의 효소 처리를 하지 않은 채로 질량 분석을 수행해 분자량을 확인했다.
추가적으로 PNGase F의 처리를 통한 분석으로 mass shift를 확인하여 항체에 부착된 글라이칸 조성을 유추했다. 또한, 분리한 N-글라이칸은 초고성능 액체 크로마토그래피-형광 분석(UPLC-FLR)과 LC-MS 분석 기법을 활용해 분석했고, 이를 통해 다양한 글라이칸 구조를 식별할 수 있었다. 주요 글라이칸의 peak 면적 백분율 총합을 계산한 결과, 일부 항체를 제외한 대부분의 시료에서 80% 이상의 백분율 총합이 확인되어 항체 품질 측면에서 양호한 특성을 보임을 확인하였다. 또한, 항체 간에 G0F, G1F, G2F 구조의 분포 비율 차이가 관찰되었으며, 이를 통해 시료 간 미세한 글라이칸 패턴 차이를 확인할 수 있었다.
마지막으로, 생물의 주요 구성 성분 중 하나인 콜라겐에 대해 질량 분석 기반의 특성 분석을 수행했다. 일반적으로, 효소 처리로 얻은 저분자 어류 콜라겐은 다양한 산업 분야에서 활용도가 높다. 그러나 시중에 유통되는 콜라겐은 어종에 따라 기능과 안정성에 차이가 존재하며, 대부분 가공된 형태로 제공되기 때문에 원료의 품질과 유래를 명확히 파악하기 어렵다. 따라서 콜라겐 성분에 대한 정밀한 분석을 통해 어종별 특성을 평가할 수 있는 기술의 개발이 필요하다. 하지만, 저분자 어류 콜라겐은 다른 단백질에 비해 분자량이 작고 다양한 펩타이드로 구성되어 있어, 기존의 분석 방법만으로
는 정확한 서열을 파악하기 어렵다. 이를 극복하기 위해 본 연구에서는 액체 크로마토그래피와 결합한 질량 분석 방식을 활용해 복잡한 펩타이드 혼합물의 서열 내에서 어종별 공통 서열과 특이서열을 동정할 수 있는 분석 전략을 확립했다. 사용한 시료는 명태, 틸라피아. 대구 각 3종류, 총 9개의 어류 콜라겐이며, 이를 Orbitrap 질량 분석기로 분석했다. 데이터 분석에는 PEAKS Studio와 Proteome Discoverer 소프트웨어를 활용했다. 이 역시 동일한 절차에 따라 PEAKS Studio에서 De novo 서열 분석을 수행한 후, 자체 구축한 데이터베이스를 활용하여 Proteome Discoverer에서 어종 간 공통서열 및 특이서열을 확인하였다.| Proteomics is a vital field of research aimed at understanding the diversity and complexity of proteins and peptides within biological systems. Among the various analytical techniques, mass spectrometry (MS) is widely regarded as a powerful tool due to its high precision in characterizing proteins. In this study, we aimed to establish precise analytical methodologies using mass spectrometry and to apply these approaches to the structural and functional characterization of various
biologically relevant proteins, including HLA class I complexes, therapeutic antibodies, and collagen.
The Human Leukocyte Antigen (HLA) system plays a central role in immune regulation and is categorized into classes I, II, and III. Among them, HLA class I molecules are responsible for presenting endogenously derived peptides to CD8⁺ T cells, thereby playing a crucial role in the immune response. This antigen presentation not only contributes to immune surveillance but also has significant implications in the development and progression of diseases such as cancer. Therefore, identifying and characterizing the peptide sequences bound to HLA class I molecules expressed in tumor cells is essential for advancing
antigen-based cancer immunotherapies.
For this purpose, the human colorectal cancer cell line HCT-116 was used. Prior to peptide isolation, we optimized cell lysis conditions to improve peptide extraction efficiency by varying the lysis duration. We observed the highest extraction efficiency at 4 hours, while longer lysis times resulted in a decline in peptide yield. Peptides were then analyzed using an Orbitrap mass spectrometer, and the resulting data were processed using PEAKS Studio and Proteome Discoverer software. Initially, de novo sequencing was conducted using PEAKS Studio to predict peptide sequences, which were then converted into a FASTA
format file. This file was used as a custom database in Proteome Discoverer to identify and quantify the number of peptides matched.
Through this approach, we successfully identified over 1,400 HLA class I peptides.
In addition, N-glycans, which are functionally significant in the immunological roles of HLA class I molecules, were analyzed. We compared the UPLC-FLR and mass spectrometry results to confirm the overall glycan structures associated with HLA class I complexes.
Secondly, we investigated the glycosylation characteristics of therapeutic monoclonal antibodies using mass spectrometry-based approaches. The use
of monoclonal antibodies has been steadily increasing in modern medicine due to their therapeutic efficacy against a variety of diseases.
Accordingly, evaluating the heterogeneity of these antibodies is essential, especially since antibody glycosylation directly affects their therapeutic efficacy. Therefore, optimizing analytical methods for glycan profiling is considered a crucial step in the quality assessment of therapeutic antibodies.
In this study, we performed a comprehensive analysis of the glycosylation patterns of various originator antibodies and their biosimilar counterparts using a combination of LC-MS analysis of the intact antibodies and LC-MS analysis of the released N-glycans. First, the intact antibody samples were analyzed by mass spectrometry without enzymatic treatment to determine their molecular weights. Subsequently, PNGase F digestion was performed to identify mass shifts indicative of glycan removal, enabling inference of the glycan compositions attached to the antibodies. The released N-glycans were further analyzed using
Ultra-Performance Liquid Chromatography with Fluorescence Detection (UPLC-FLR) and LC-MS, which enabled the identification of various glycan structures. Based on the calculated percentage of peak areas for the major glycans, it was observed that most antibody samples exhibited over 80% of total peak area, indicating favorable glycosylation characteristics in terms of quality. In addition, differences in thedistribution ratios of G0F, G1F, and G2F were observed among the antibodies, revealing subtle variations in glycan patterns.
Lastly, we conducted a proteomic analysis of collagen, a key structural protein in biological tissues, using a mass spectrometry-based approach.
Low-molecular-weight fish collagen, typically obtained via enzymatic hydrolysis, is widely used in various industries. However, the functional and stability properties of collagen can differ depending on the species of origin, and commercial collagen products are often provided in processed forms, making it difficult to trace their source and quality. Therefore, there is a growing need for analytical techniques capable of accurately evaluating species-specific characteristics of collagen. Due to the low molecular weight and peptide complexity of hydrolyzed fish collagen, conventional methods often fall short in providing accurate sequence identification.
To overcome this limitation, we established a strategy that combines liquid chromatography with mass spectrometry (LC-MS) to analyze complex peptide mixtures and identify both common and species-specific peptide sequences. We analyzed nine collagen samples derived from three fish species—Alaska pollock, tilapia, and cod—using an Orbitrap mass spectrometer. Data analysis was performed using PEAKS Studio and Proteome Discoverer. As with previous experiments, de novo sequencing was conducted using PEAKS Studio, followed by the construction of a custom database that enabled the identification of shared and unique
peptide sequences among different fish species in Proteome Discoverer.