Introduction

Primary pulmonary sarcoma (PPS) is a very rare malignant mesenchymal tumor that accounts for 0.013–0.4% of lung cancers1,2. The cumulative number of PPS cases in most healthcare institutions has been less than 50 for more than 20 years1,3,4,5. Histologically, PPS is usually categorized as malignant fibrous histiocytoma, smooth muscle sarcoma, rhabdomyosarcoma, synovial sarcoma, etc. The 5-year OS rate of PPS is lower than that of limb soft-tissue sarcoma, and the median survival (mOS) is 21 months6. The prognosis of PPS is affected by a variety of factors, and the most frequently discussed therapy method is the surgery, which is more often seen in primary pulmonary artery sarcoma7. In addition, the prognosis of PPS is influenced by age, gender, smoking history, tumor size, and radiotherapy, but these are more commonly seen in case reports respectively8,9,10,11.

Due to the low incidence and mostly disseminated nature of PPS, a better treatment system has not yet been established. So in the last decades any significant progress wasn’t got in survival prognosis. There are fewer clinical reports that focus on the effect of individual factors on the prognosis of PPS patients. Meanwhile, studies have found that PPS patients are mostly middle-aged, with an average age of around 50 years1,3,4,12. What is known is that along with the popularity of high-resolution CT, the number of rare tumors diagnosed in the lungs is increasing every year. Therefore, it is necessary to analyze the multifactorial prognosis. The SEER database contains multicenter data of oncology patients, which is rich and reliable. As a result, it can provide a large sample for research. Nomograms, with the fetures of visualization and quantification, are widely used as tools for prognostic evaluation of malignant tumors13,14. Based on the above, we investigated the clinical features and prognostic factors of PPS using the SEER database, and constructed a novel nomogram prediction model to predict the OS rate. Meantime, we performed multidimensional validation to fully confirm the good predictive efficacy of the model, which will provide objective and scientific guidance for clinical decision-making.

Materials and methods

Source

The SEER database is an open-access cancer database covering 28% of the US population. We extracted data for PPS patients in “Incidence-SEER 17 Regs Custom Data (with additional treatment fields), Nov 2021 Sub(2000–2019)” from 2010 to 2019.

Patients

Patient screening criteria were as follows: (1) Inclusion criteria: first primary in lung and bronchus; ICD-O-3 Hist/Behav (8800-8805/3, 8810/3, 8811/3, 8840/3, 8850/3, 8851/3, 8854/3, 8890/3, 8891/3, 8894/3, 8896/3 8900-8902/3, 8910/3, 8912/3, 8920/3, 8921/3, 8933/3, 8936/3, 9040-9044/3, 9260/3); pathological diagnosis. (2) Exclusion criteria: patients diagnosed by autopsy or death certificate; patients with incomplete required clinical information.OS was the primary study endpoint. The following information was collected: demographic variables, including age, sex, race, and marital status; clinicopathological information, including the year of diagnosis, survival time, pulmonary metastases, bone metastases, brain metastases, liver metastases, surgery, radiotherapy, and chemotherapy. Finally, a total of 169 patients with PPS were included in this study and randomized into the development and validation groups in a 7:3 ratio (Fig. 1). We constructed a prediction model with patient data from the development training group, and then validated the model internally with patients in the development training group and externally with patients in the validation group.

Figure 1
figure 1

The patients screening process.

Full size image

Statistical analysis

Statistical analysis was performed using R software (version 4.2.1), and P < 0.05 (bilateral) was considered statistically significant. The chi-square test was used to compare the distribution of variables in the modeling and validation groups. Then, univariate and multivariate COX regression analyses were used to stepwise identify independent prognostic factors. We built a novel predictive nomogram based on the independent prognostic factors to predict the OS rate of PPS patients at 1, 2 and 3 years, and constructed survival curves by Kaplan–Meier analysis to specify the median survival (mOS) of the relevant factors. In addition, time-dependent receiver operating characteristic (ROC) curves for 1, 2, and 3 years of nomogram were generated and the corresponding time-dependent area under the curve (AUC) was applied to discriminate. AUC  0.75 as good. In addition, calibration curves and decision curve analysis (DCA) were plotted to assess the nomogram.

Ethics approval

We signed the SEER Study Data Agreement to access SEER information using reference number 20225-Nov2021. We performed study methods following approved guidelines to access data provided by the SEER database. The Office of Human Research Protections considers data analysis to be non-human subjects studied by the U.S. Department of Health and Human Services because they are publicly available and de-identified. Therefore, it does not require any approval from the Institutional Review Board.

Results

Baseline characteristics of PPS patients

The SEER database was extracted from 570 patients with PPS, which included 98 cases of synovial sarcoma. It was the most common pathological type of sarcoma, accounting for approximately 17.19% of all PPS. After screening by inclusion and exclusion criteria, a total of 169 patients with PPS were finally included in this study, with an mOS of 7 months. PPS was mostly seen in older men with higher pathological grading. Distant metastases occurred in about 28.4% of them, with pulmonary metastases being the most common, accounting for 10.7% of all and 37.5% of distant metastases. Nearly half of the patients underwent surgical intervention and about one-third received chemotherapy and radiotherapy. All patients were randomly divided in a 7:3 ratio into a development training group of 118 (70%) and a validation group of 51 (30%), and the results of chi-square test showed that all variables were not statistically different between the two groups (Table 1).

Table 1 Basic characteristics of PPS patients.
Full size table

Prognostic factors and survival curves

We included age, gender, race, marriage, pathological grade, distant metastasis (lung, liver, brain, bone), surgery, radiotherapy, and chemotherapy variables in the development training group for univariate COX regression analysis, and the results showed that age, pathological grade, liver metastasis, surgery, and chemotherapy were statistically significant (P < 0.05). We subjected the above statistically significant factors to multivariate COX regression analysis, and the results showed that all the above factors were statistically significant (Table 2). This indicated that age, pathological grade, liver metastasis, surgery and chemotherapy are independent factors affecting the prognosis of PPS. The forest plot visualizes the magnitude of each independent factor and ranks them (in order of liver metastasis, pathologic grade, chemotherapy, and age). Patients with PPS who were ≥ 65 years old, had liver metastases, and had higher pathological grading had a poorer prognosis, and those who received surgical and chemotherapeutic interventions improved their prognosis (Fig. 2). Then, we plotted survival curves for the above factors. The results showed that those who developed liver metastases had a poorer prognosis than those who did not, and none had an OS of 1 year (Fig. 3a). Those with high pathological grading had a worse prognosis than those with low grading (mOS: 6 months vs. 53 months, 5 year OS rate: 12.5% vs. 28%) (Fig. 3b). Those with surgical intervention had a better prognosis than those without surgery (mOS: 21 months vs. 3 months), with a 5 year OS rate of approximately 33% (Fig. 3c). Chemotherapy had a better prognosis than those without chemotherapy (mOS: 16 months vs. 4 months, 5-year OS rate: 25% vs. 12.5%) (Fig. 3d), and those ≥ 65 years had a worse prognosis than those < 65 (mOS: 4 months vs. 23 months, 5 year OS rate: 6% vs. 25%) (Fig. 3e).

Table 2 Prognostic factors for patients with PPS.
Full size table
Figure 2
figure 2

A forest plot of PPS independent prognostic factors.

Full size image
Figure 3
figure 3

(a) OS by liver metastasis. (b) OS by pathological grading. (c) OS by Surgery. (d) OS by Chemotherapy. (e) OS by Age of diagnosis.

Full size image

Development training and validation of the nomogram

Based on the independent prognostic factors obtained above, we constructed a novel prognostic nomogram to visually calculate the OS rates of PPS patients at 1, 2, and 3 years (Fig. 4). Then, the differentiation ability of the Nomogram was assessed by plotting ROC curves and calculating the corresponding AUC values. The AUC values for the development training group were 0.856, 0.864, and 0.853 at 1, 2, and 3 years, respectively (Fig. 5a), and the AUC values for the validation group were 0.741, 0.793, and 0.741, respectively (Fig. 5b), indicating that the model has good discriminatory power. The internal (Fig. 6a–c) and external (Fig. 6d–f) calibration plots showed that the 1-, 2-, and 3-year OS rates predicted by the column line plots were similar to the actual 1-, 2-, and 3-year OS rates with a high degree of consistency. In addition, the internal (Fig. 7a–c) and external (Fig. 7d–f) DCA plots also indicated that the model has clinical practice efficacy.

Figure 4
figure 4

A prognostic nomogram to predict the 1, 2, and 3 years OS in PPS patients.

Full size image
Figure 5
figure 5

Time-dependent ROC curve analysis of the nomogram for the 1, 2, and 3 years in the training set (a) and the validation set (b).

Full size image
Figure 6
figure 6

The calibration curves for predictions of overall survival in the training set (ac) and in the validation set (df) at 1, 2, and 3 years.

Full size image
Figure 7
figure 7

The decision curve analysis of the nomogram in the training set (ac) and in the validation set (df) at 1, 2, and 3 years.

Full size image

Discussion

Sarcoma is a malignant tumor originating from mesenchymal tissues such as fibrous tissue, fat, muscle, blood vessels and lymphatic vessels, bone, and cartilage tissue. We extracted more than a dozen pathological types of sarcomas commonly originating from the lung in the last decade from the SEER database, such as fibrosarcoma, synovial sarcoma, liposarcoma, smooth muscle sarcoma, rhabdomyosarcoma, and angiosarcoma, etc. To our statistical analysis, synovial sarcoma was the most common pathological type, accounting for 17.19%, which was inconsistent with previous studies. Some studies reported that smooth muscle sarcoma was the most commonly reported type of PPS15,16. The sudy of Petrov et al. in 48 cases of PPS was fibrosarcoma, which accounted for 37.5%4. We considered more abundant and larger samples in the SEER database.PPS is a very rare malignancy with an inferior poor prognosis. And most of the studies on PPS were case reports or were single-center case series, and large samples are scarce. No effective treatment has been found so far. Evaluation of prognostic factors will help clinicians to make scientific and effective clinical decisions. Therefore, we explored the independent factors affecting the prognosis and found that liver metastasis, pathological grade, surgery, age, and chemotherapy have a significant effect on its prognosis. And we also developed a novel predictive nomogram, which can be used to visually personalize the OS rates at 1, 2, and 3 years. In Fig. 4, a patient with PPS < 65 years old, with pathological grade III-IV, without liver metastases and treated only with surgery, had a total of 198 points calculated by the model, and the corresponding 1-year, 2-year, and 3-year OS rates were 51.7%, 38.7%, and 37%, respectively. If the patient also received chemotherapy, the total number of points calculated by the model would be approximately 168, with corresponding 1-year, 2-year, and 3-year OS rates of 72%, 62.5%, and 62%, respectively. Thus, postoperative chemotherapy for PPS significantly improved OS. In addition, we validated the model, the results showed that it has good discrimination, predictive accuracy, and also favorable clinical practice utility, which can be used to clinically guide clinical decision-making.

According to the model we constructed, the main treatments for PPS are surgery, chemotherapy, and radiotherapy, consistent with other common solid malignancies. Complete tumor resection is the widely accepted treatment for PPS. Some endobronchial sarcomas can be treated with conservative pneumonectomy, including carina excision18,19. Previous studies have shown that the resection rate for PPS is 50–100%1,3,4,14,19, while our results was slightly lower, with approximately 47.3% undergoing surgical treatment. PPS is more aggressive and has a lower survival rate compared to other malignancies originating from the lung, with a 3-year postoperative survival rate of 17–50%6,12,15,16. Bacha et al.17 showed a 5-year postoperative OS rate of 50% for PPS, while the 5-year postoperative OS rate for PPS in the present study was 33%, lower than it. It may be because most of the PPS patients included in this study had a high pathological grade. Sarcomas with high-grade malignancies have the worse prognosis than lower grades15. Similar findings were found in our study, where patients with pathological grade III–IV had a significantly lower mOS than grade I–II (6 months vs. 53 months). Moreover, Régnard et al.1 and Petrov et al.4 studies confirmed that those with the earlier TNM stages in PPS patients favored survival. Several earlier studies also showed1,15,19 that patients who underwent complete resection had significantly higher OS rates compared to inoperability or positive surgical margins. Therefore, early complete tumor resection is particularly important to improve OS rates in PPS3,4. However, radiotherapy does not benefit patients in some cases. Spraker et al.6 showed that was no statistical difference in OS between patients who received surgery and radiotherapy and those who received surgery alone. In our study, we also did not find that radiotherapy improved the OS rate of PPS.

More interestingly, pulmonary sarcomatoid carcinoma (PSC) is a group of poorly differentiated non-small cell lung cancer (NSCLC) containing a sarcomatoid component, including giant cell carcinoma, spindle cell carcinoma, pleomorphic carcinoma, and carcinosarcoma. In some cases, PSC is difficult to distinguish from pulmonary sarcoma (PS), and PS has a better prognosis than PSC20,21 and worse than NSCLC21. The OS rate of PPS we showed was also better compared to that of PSC shown in previous studies22. Moreover, chemotherapy does not improve the OS rate of PSC22,23,24,25, whereas our study showed that chemotherapy improves the OS rate of PPS. Therefore, we must correctly identify PSC with PPS to make appropriate treatment. Age and liver metastasis were independent factors affecting the prognosis of PPS in our study. Elderly people (≥ 65 years) and those who developed liver metastases had a poor prognosis, none of whom survived more than one year. In addition, pulmonary, bone, and brain metastases had no significant effect on the prognosis of PPS. Although some the prognostic factors are to be expected. in our study, We further quantified them and clarified the magnitude of the weighting of each factor. To our knowledge, this is the largest sample size study with real-world data on PPS prognosis. Consequently, our research is great significance and clinical guiding value.

Some limitations are still in our study. On the one hand, the SEER database does not include information on specific regimens of chemotherapy, radiotherapy, and surgery, so we cannot discuss the pros and cons of specific treatment regimens on the prognosis of PPS. On the other hand, due to the limitation of retrospective study and the long time span, our conclusions may be biased. However, the SEER database is still in the process of being updated, and we believe that these issues will be resolved soon.