Professional Documents
Culture Documents
Title: Machine learning integration of rheumatoid arthritis synovial histology and RNAseq
Authors: Dana E. Orange, MD, MSc*1,2,3 Phaedra Agius, PhD*3, Edward F. DiCarlo, MD4,
Nicolas Robine, PhD3, Heather Geiger, PhD3, Jackie Szymonifka, PhD1, Michael
McNamara, BS1, Ryan Cummings, AB1, Kathleen M. Andersen, BSc1, Serene Mirza, BS1,
Mark Figgie, MD5, Lionel Ivashkiv, MD, PhD6, Alessandra B. Pernis, PhD6, Caroline Jiang,
PhD8, Mayu Frank, NP2,3, Robert Darnell, MD, PhD2,3, Nithya Lingampali, BS9, William
Robinson, MD, PhD9, Ellen Gravallese, MD10, Vivian P. Bykerk, MD1*, Susan M. Goodman,
MD1* and Laura T. Donlin, PhD*7, The Accelerating Medicine Partnership: RA/SLE Network
7 Arthritis and Tissue Degeneration Program and the David Z. Rosensweig Genomics
This article has been accepted for publication and undergone full peer review but has not
been through the copyediting, typesetting, pagination and proofreading process, which may
lead to differences between this version and the Version of Record. Please cite this article as
doi: 10.1002/art.40428
Worcester, MA
Corresponding Authors:
Dana E Orange
Telephone: 917-439-9625
Fax: 212-794-1999
Email: dorange@rockefeller.edu
Laura T Donlin
Telephone: 212-774-2743
Fax: 212-774-2560
Email: DonlinL@HSS.EDU
Funding was provided by the New York Genome Center, Rockefeller University grant # UL1
TR000043 from the National Center for Advancing Translational Sciences (NCATS),
Clinical Translational Science Center (CTSC), and The Block Family Foundation. LI was
Accepted Article
supported by grant # AR046713.
This work was supported by the Accelerating Medicines Partnership (AMP) in Rheumatoid
Arthritis and Lupus Network. AMP is a public-private partnership (AbbVie Inc., Arthritis
Research Alliance, Merck Sharp & Dohme Corp., National Institutes of Health, Pfizer Inc,
Inc.), created to develop new ways of identifying and validating promising biological targets
for diagnostics and drug development Funding was provided through grants from the
Abstract
Methods: Twenty histologic features were assessed on 129 synovial tissue samples.
Consensus clustering was performed on gene expression data from a subset of 45 synovial
samples. Support vector machine learning was used to predict gene expression subtypes
using histology data as input. Corresponding clinical data were compared across subtypes.
Results: Consensus clustering of gene expression data revealed three distinct synovial
TGF β, glycoproteins and neuronal genes, and a mixed subtype. Machine learning applied
scoring histology features. Patients with highly inflammatory synovial subtypes exhibited
Accepted Article
higher levels of markers of systemic inflammation and autoantibodies. CRP was
significantly correlated with pain in the high inflammatory group but not the others.
Conclusion: Gene expression analysis of synovial tissue revealed three distinct synovial
subtypes. We used these labels to generate a histology scoring algorithm that associates
with levels of ESR, CRP and autoantibodies. Comparison of gene expression patterns to
Introduction
Rheumatoid arthritis (RA) is the most prevalent autoimmune arthritis, in which extensive
inflammation in synovial tissue can lead to joint destruction. Assessment of synovium has
classification of RA synovium is not yet factored into current RA diagnosis, nor treatment
guidelines [6]. A hematoxylin and eosin (H&E) stain based assessment of RA synovium is
routine offering by clinical pathology laboratories. The Krenn scoring system of H&E stained
62% sensitive and 96% specific for the diagnosis of rheumatic diseases, it does not
We reasoned a more granular histology scoring system could be useful to subtype and
guide treatment of RA. Others have explored the significance of lymphocyte aggregates
sedimentation rate (ESR) and c-reactive protein (CRP) but not with factors with higher
specificity for such as anti-citrullinated peptide antibodies (ACPA) and rheumatoid factor
subtypes.
Accepted Article
Assessments of H&E stained slides can detect an array of inflammatory features including
glycoproteins that retain water and therefore form gels in RA and osteoarthritis (OA)
synovium, but not normal synovial tissue[16] as well as detritus, small fragments of
cartilage or bone[17]. Here we aimed to evaluate the relative utility of 20 such features in
with the goal of developing an algorithm to score histology features in a manner that
We performed an integrative analysis of clinical, histologic and gene expression data from a
cohort of 123 RA patients and six OA patients to provide insights into tissue inflammation
and sub-classification of RA. Gene expression cluster analysis identified three synovial
subtypes, which were used as labels to train a support vector machine (SVM) learning
algorithm with histology scores as inputs (features). This analysis produced a histology
scoring algorithm that predicts the three gene expression subtypes using only histology
features and corresponds with acute phase reactants and autoantibody levels.
Methods
Patient data
Against Rheumatism 2010[18] and/or 1987 criteria. Synovial samples from an additional six
serostatus were coded as negative, low or medium positive (1-3 times the upper limit of
Accepted Article
normal) and high positive (>3 times the upper limit of normal). Pain was assessed by asking
patients the question “How much pain have you had because of your condition over the
past week? Please indicate how severe your pain has been on a scale of 0-10.” This study
was approved by the HSS institutional review board (#2014-233), the Rockefeller University
IRB (#DOR0822) and Biomedical Research Alliance of New York (15-08-114-385) and
Sample processing
Adjacent areas of synovial tissue were placed into histology (OCT frozen blocks) and
RNA sequencing
RNA was extracted, Qiagen RNAeasy Mini kit (Qiagen # 74104) and libraries were
prepared using Truseq mRNA stranded Library kits at New York Genome Center and 50
base pair, paired-end reads were sequenced on HiSeq2500. Reads were aligned to hg19
using STAR [19]. Samples with greater than 0.1% globin mRNA were excluded from further
samples of sufficient quality were processed in three separate batches. To account for any
batch effects, we used ComBaT in the Bioconductor SVA package[20] and DESeq2[21] to
RNA-seq clustering
RNA-seq data was clustered using consensus clustering, an iterative clustering method
where small data perturbations (using the rnorm function in the basic stats package in R)
are introduced at each clustering iteration. The clustering iterations offer the opportunity to
computation. We used the k-means clustering method, which typically does not converge to
Accepted Article
the same solution when clustering the same data multiple times (contrary to agglomerative
times into 2, 3, 4 and 5 groups. The clustering iterations were then used to generate
(number of times the pair of samples cluster together divided by the number of clustering
iterations). This score ranges from 0 to 1, with scores of 1 indicating co-clustering of the two
samples.
across synovial subtypes identified via clustering, with the adjusted p-value of 0.01. We
eliminated chrX/Y genes to remove sex biases, as well as IgG variable genes (V,D and J)
since the individual V,D and J gene segments are counted as individual immunoglobulin
Pathway analysis
The Database for Annotation, Visualization and Integrated Discovery (DAVID v6.8) [22] was
used with default parameters to identify functional annotation clusters of genes. Genes
differentially expressed in High vs other groups with padj<0.01 and fold change <-0.5 were
used to characterize pathways relatively enriched in both Low and Mixed subtypes. Genes
differentially expressed in High vs other groups with padj<0.05 and increasing in order from
Low to Mixed to High were used to identify pathways associated with gradual increase in
expression from Low to Mixed to High. Enrichment scores are a modified Fisher Exact P
In order to deconvolute the cellular composition of the 3 subtypes in our data, we used an
Accepted Article
algorithm called CIBERSORT: Cell-Type Identification by estimating relative Subsets Of
known RNA Transcripts [23,24], a machine learning system trained on the profiles of 22
distinct leukocyte datasets over 547 genes with defined “barcodes” of gene expression
e1071) to predict the three RNA-seq subtypes, using leave-one-out cross-validation to train
and predict transcriptomic subtypes. Weights attributed to each feature were extracted from
a SVM. All histology features were presented as binary vectors – histology features having
more than 2 categories were converted to a binary representation, so that all features were
Histology Scoring
129 synovial samples were preferentially taken from grossly inflamed (dull and opaque)
synovium. If no inflammation was apparent, samples were taken from standard locations:
the femoral aspects of the medial and lateral gutters, and the central supratrochlear region
in the suprapatellar pouch. Tissue samples were snap frozen and stained with Harris
features (synovial lining hyperplasia, lymphocytes, plasma cells, Russell bodies, binucleate
plasma cells, fibrin, synovial lining giant cells, sublining giant cells, fibrosis, detritus,
approach that is outlined in detail in Supplemental Methods. 129 samples were scored by
one pathologist (ED) and 40 samples were scored again by a second pathologist (EG) to
were diluted 1:30 ratio in a proprietary dilution buffer (Bio-Rad Laboratories), mixed with the
and then incubated with an anti-human IgG antibody conjugated to phycoerythrin (Jackson
Statistics
Kappa statistics were used to assess inter-rater reliability of the histology scores sourced
from the two pathologists. For variables with three or more response levels, weighted kappa
statistics were calculated to give credit for differences that are close as opposed to treating
any difference the same. Inter-rater reliability was considered as none to slight if 0.01-0.2,
fair as 0.21-0.40, moderate as 0.41-0.60, and substantial as 0.61-0.80 and almost perfect if
0.81-1.00[24]. The Jonckheere-Terpstra two-sided test for trend was used to compare
pathway enrichment scores of GSVA data across the three synovial subtypes. Kruskal-
Wallis test with Dunn’s multiple comparison tests was used to compare CIBERSORT RNA
scores and clinical features among the three synovial subtypes. Chi-square test was used
to detect differences above those expected by chance in binary data among the three
synovial subtypes. ANOVA with Tukey’s multiple comparisons test was used to compare
log2 transformed mean fluorescent intensities of ACPA among the three synovial subtypes.
Spearman’s correlation coefficients were calculated between pain and gene expression
Clinical characteristics of 123 patients with RA are presented in Supplemental Table 1. The
majority of patients were female and 47% and 50% were seropositive for RF and CCP,
respectively. Despite average disease duration of 14 years, the average disease activity
score in 28 joints (DAS28) was 3.8 (moderate) and 53% of patients were treated with a
biologic agent just prior to surgery. Joints included hips (40%), knees (57%) and shoulders
(3%).
We first sought to determine the prevalence and feasibility of scoring the candidate
histology features. Fourteen of the 20 features were observed in more than 5% of samples
(Figure 2A). We next evaluated the inter-rater reliability of these histology features on a
subset of samples (Figure 2B). Representative images of the features with frequency > 5%
and at least fair inter-rater reliability are presented in Figure 2C and 2D. Features that were
seen in at least 5% of samples, with at least fair inter-pathologist reliability included: plasma
cells, fibrin, binucleate plasma cells, Russell bodies, neutrophils, lymphocytes, synovial
Independent of the histologic analysis, consensus clustering of the top 500 most variable
cluster together, with the 45 samples constrained to cluster into k=2, 3, 4 and 5 clusters.
The red boxes represent co-clustering samples (likelihood scores of 1), blue represents
samples that never co-cluster (scores at 0), and white represents inconsistent co-clustering
patterns (scores at 0.5). When the samples were partitioned into 2 or 3 clusters, the
the majority of the likelihood scores approached 1 or 0. However when the samples were
Accepted Article
clustered into 4 or more groups, significantly less consistency was observed. This
visualization is a statistically robust confirmation that the 45 RNA-seq samples form at most
three distinct subgroups. We also explored this analysis using differing numbers of variable
genes (Supplemental Figure 1) and identified very minor shifts across the three consensus
clustering subtypes, lending confidence to the k=3 sample partitioning. Principal component
analysis of the samples using the top 500 most variable genes demonstrated agreement
with the three sample subtypes (Figure 3B), validating the clustering.
the 3 synovial clusters. Comparisons of each cluster to the others are presented in
Supplemental Tables 2, 3, and 4. More than half of these separated Low from High, while
the majority of the remaining genes separated Mixed from High. The Mixed subtype shares
features with both High and Low subtypes. Since it is a blend of the two subtypes, there are
very few genes that are unique to the Mixed group. Heatmaps of the top 1000 and top 50
most variable genes (as ranked by the p-adjusted value output by DESeq2) depict the main
gene expression patterns segregating the subtypes (Figure 3C). The largest gene block
displays an increasing expression pattern as one progresses from the Low to High
clustering on genes that increase in this order using DAVID GO analysis and found that the
highest enrichment scores corresponded with immunity, immune cell signaling pathways
such as SH2, SH3 and kinases, immunoglobulins, as well as chemokines and cytokines
A smaller block of genes showed higher expression in Low and Mixed than in High
glycoproteins (Figure 4B). These genes included markers of lining layer fibroblasts such as
Accepted Article
CD55 [25, 26], TGF-β superfamily genes (TGFBR3, TGFB2, BMP4 and BMP6) and genes
that mediates core O-glycan branching, a critical step in mucin synthesis[27]. Another
enriched group of genes in the low inflammatory subtype belonged to the protocadherins
synapses that are critical for axon targeting and neuron survival[28-31]. Other significantly
enriched pathways in those involved in neuronal pain processing. For example, GLRA3
(glycine receptor alpha 3) is required for central pain sensitization[32], and ADRA1B (alpha-
lymphoid, myeloid, fibroid and low-inflammatory synovial subtypes [5]. Samples from our
High inflammatory subtype demonstrated high expression of both myeloid genes and
lymphoid genes signature genes (Supplemental Figure 2). The previously described
“fibroid” synovial subtype was enriched in TGF-β signaling pathway genes with a relative
paucity of inflammatory gene expression [5]. We confirmed that these genes were also
Identifying the cellular source of gene expression variation can be challenging in samples
containing various cell types, due to differences in both proportions and total cell quantities.
To better characterize the cell types responsible for gene expression differences amongst
the synovial tissues, we applied the machine-learning framework for Cell-Type Identification
analysis of gene expression profiles across the three synovial subtypes. The clusters were
(23%), Mixed (32%) and High inflammatory group (68%) (Figure 4C). The High
Accepted Article
inflammatory group harbored significantly increased inferred fractions of neutrophils, M2,
One of our main goals was to determine the synchrony between synovial histology features
and their genomic subtypes, the existence of which enables a cheaper histology-based
approach to characterizing synovial tissue. To this end, we implemented a leave one out
cross validation, SVM classification system to predict synovial genomic subtypes for our 45
samples, using a binary representation for the histology scores as training features, and
genomic subtypes as training labels for the model. The predictive power of this system was
evaluated by measuring the area under the curve (AUC) of receiver-operating curves
(ROC) (Figure 5A). Models separating High from others and Low from others performed
best (AUC=0.88 and 0.71 respectively) while models separating Mixed subtypes were
harder to predict (AUC= 0.59). We compared the predicted synovial subtypes to those
assigned by RNA-seq clustering, and found that 39 out of 45 of the samples agreed.
Feature weights were extracted from the modeling system and are shown in Figure 5B
where we observed that the most discriminatory features were, in general, also had
After observing satisfactory training and testing results across the 45 RNA-seq samples, we
computed two SVMs using all 45 samples: one to discriminate Low from others, and one to
discriminate High from others. We used these models to predict genomic subtypes on the
remaining 82 samples for which we had complete histology and clinical laboratory data
available, but not gene expression data. We compared the distributions of histology
histology scoring algorithm (Figure 5C, right panels). The distributions of histology features
Accepted Article
were similar whether the samples were classified by gene expression clusters (Figure 5C,
left panels) or by the histology feature algorithm (Figure 5C, right panels). For example,
lymphocyte scores are always highest in the high inflammatory subtype, regardless of how
the subtypes are classified. This is an expected result and serves to validate our approach.
gene expression clusters versus the 82 RA patients classified by the histology scoring
algorithm (Figure 5D), and again found consistent patterns. Given that clinical features were
not part of development of the algorithm, this consistent distribution of clinical features
demonstrates that both classification methods associate in meaningful ways with clinical
laboratory features. For example, low ESR values were most commonly present in Low
inflammatory subtypes and rarely present in High inflammatory subtypes, whether classified
independent set of clinical features provides additional confirmation for the histology scoring
algorithm.
We next compared the clinical features of all 123 RA patients who were classified using the
histology scoring algorithm. Firstly, we compared the histology features across the three
subtypes and found that lymphocytes, plasma cells, lining hyperplasia, binucleate plasma
cells, Russell bodies, fibrin, and neutrophils were significantly increased in the High
inflammatory samples (Figure 6A). This histology data was consistent with the fractions of
cells inferred from the gene expression data (Figure 4C). Similarly, ESR, CRP, RF and
CCP were increased in patients with High inflammatory scores (Figure 6B). We also
compared the fine specificity of ACPA (Supplemental Figure 3), and identified significantly
patients in the High inflammatory group (Figure 6C). Unexpectedly, these subtype
counts, swollen joint counts, or disease duration (Figure 6B). The Low inflammatory
Accepted Article
subtype had high pain scores (median pain score =6/10) but little inflammation in the tissue
(according to gene expression and histology assessments or blood (ESR and CRP). We
therefore hypothesized that pain might be driven by distinct mechanisms in the various
coefficients between the level of acute phase reactants and pain scores across all samples
grouped together and when parsed according to patient synovial histology subtype. We
found a weak, non-significant correlation between pain and acute phase reactants when we
analyzed all patients together (Figure 6D), that became more clear when we divided the
synovium, but not the other synovial subtypes, had a stronger and significant correlation of
pain with CRP. This suggests that pain is associated with inflammation in patients with
High inflammatory subtypes and that pain may be driven by distinct mechanisms in the
other patients.
Discussion
machine learning model trained on these subtypes, predicting genomic subtypes from
histology data. Through our modeling system, we found that the histologic features that
most strongly defined the High inflammatory subtype included three plasma cell features:
binucleate plasma cells, plasma cell percentage and Russell bodies (Figure 5B).
Deconvolution of gene expression profiles indicated that H&E scores of these plasma cell
features were a harbinger of a leukocyte panoply including T follicular helper cells, memory
resting and activated CD4 T cells, CD8 T cells, monocytes, M0, M1 and M2 macrophages,
and neutrophils. Our gene expression analysis did not identify distinct myeloid and
differences in the patient populations or treatments account for the discrepant results.
Accepted Article
Analysis of the Low inflammatory subtype identified expression of genes involved in
glycoprotein production and TGF β pathways (fibroid genes) as well as neuronal genes.
This discovery is consistent with results of at least one other prior study in which
subtype of RA [36]. A common theme among the neuronal genes in this cluster is that they
play a role in a maladaptive response of the nervous system to damage. It is interesting that
this subtype is characterized by a paucity of inflammatory infiltrates, yet maintains high pain
scores and multiple tender/swollen joints - this too is consistent with other findings of
patients with established RA [37]. While it is possible that these patients had OA of the
arthroplasty joint and active rheumatoid arthritis in other joints, given minimal systemic and
synovial inflammation, it raises the question of whether the other joints might be perceived
(by the evaluating rheumatologist) as tender and swollen due to mechanisms other than
inflammation. For example, synovial proteoglycans such as mucin, which was common in
synovial samples, has a jelly-like consistency and could be perceived by the evaluating
clinician as synovial swelling. Similarly, tenderness and pain could be due to non-
inflammatory mechanisms and this is consistent with the dissociation of pain scores from
systemic inflammatory markers (Figure 6D) as well as enrichment for neuronal gene
expression (Figure 4B) in the Low and Mixed synovial subtypes. Our work suggests that RA
patients with longstanding disease and poor response to anti-inflammatory treatment may
the cause of pain in patients with little tissue inflammation is critical because non-
inflammatory pain and mucin related synovial thickening could result in high tender and
swollen joint counts in the absence of systemic inflammation (low ESR and CRP). We
inflammation.
Accepted Article
There are several important limitations of this work. The lack of normal synovium tissues
makes it challenging to draw conclusions about the genes overexpressed in the Low
these results is that the cell bodies of sensory neurons reside in the dorsal root ganglion
and are not captured when sampling synovial tissue. It is possible that synaptic mRNA [38]
from damaged nerve fibers were captured in our tissue samples, or that resident synovial
cells express a broad array of neural genes in response to inflammation. For example,
substance P [40]. Another limitation of this work is that our tissue samples were dissociated
into single cell suspensions prior to RNA fixation. Based on the significant numbers of
plasma cells counted by histology assessment, it is likely that the CIBERSORT inferred
plasma cell frequencies were lower than those observed by H&E due to plasma cell fragility
and death during processing. Additional processing artifacts include immune cell activation
due to various factors during dissociation. CIBERSORT was trained on microarray data and
can only detect 22 potential cell types, so it is quite possible that other cell types, such as
plasmablasts and peripheral helper T (TPH) [41] cells, are present but not annotated by
CIBERSORT. Yet one other limitation in this project is that the machine learning model was
trained and tested on just 45 synovial samples – a larger dataset would offer more
statistical power and we plan to run more samples in future work. Finally, the cohort
studied had longstanding disease and exposure to various treatments. Further assessment
of small joint tissue from early RA patients would be useful to characterize features
datasets identified three distinct molecular subtypes of RA that correlate with specific
Accepted Article
clinical phenotypes. The High inflammatory subtype is associated with high synovial and
characterized by high neuronal and glycoprotein gene expression, and pain scores that are
Figure Legends:
of H&E stained synovium: 10 synovial histology features retained for modeling. C) Images
obtained at 100x, scale bars indicate 200μm. D) Images obtained at 20x, scale bars
indicate 20μm.
Figure 3: Gene expression analysis of 45 synovial tissues identifies three distinct synovial
subtypes. A) Consensus clustering heatmaps using the top 500 most variable genes,
samples. Red denotes samples clustered together consistently, blue denotes samples do
not cluster together consistently and white denotes inconsistent co-clustering. Samples
were labeled green, orange and yellow according to partitioning obtained when constraining
the clustering algorithm to k=3 clusters. B) PCA plot of RNA-seq data using the top 500
most variable genes. Samples are colored according to cluster. C) Heatmap of normalized
gene expression of the top 1000 and 50 DEG across three clusters. Gene names are listed
expression.
Accepted Article
Figure 4: Gene expression characteristics of three synovial subtypes. A) Enrichment
levels from Low to Mixed to High. B) Enrichment scores of functional annotation clusters of
genes enriched in Low and Mixed relative to High inflammatory subtypes. Functional
groups with enrichment scores >3 are presented. C) CIBERSORT inferred fraction of
leukocyte cell types according to three synovial subtypes. *p<0.05 according to Kruskal-
Wallis test.
weights for 10 histology features separating High-inflammatory samples from others in the
histology scoring algorithm. C) Frequency distribution of raw histology scores among three
synovial subtypes classified by either RNA-seq clustering (left panel, N=39 RA patients) or
clinical laboratory results (Rank score 0=minimum, 1=25th percentile, 2=50th percentile,
3=75th percentile, 4=maximum) among three synovial subtypes classified by either gene
expression clustering (left panel, N=39 RA patients) or histology scoring algorithm (right
comparison test comparing the mean rank of each group to the Low inflammatory group.
Lower panel: Average incidence of various binary histology features. *p<0.05, **p<0.01,
Wallis test with Dunn’s multiple comparison test comparing the mean rank of each group to
Accepted Article
the Low inflammatory group. C) Log2 (plasma mean fluorescent intensity (MFI) of
antibodies to peptides divided by the median value per peptide) of the 3 putative RA-
associated autoantigens with significantly different levels among the three synovial
subtypes according to ANOVA with Tukey’s multiple comparison testing. Mean values for
samples from patients assigned to each synovial subtype are presented. Filaggrin=
Spearman’s correlation coefficients (ρ) of pain with either acute phase reactants across
Acknowledgements:
We would like to thank Nathalie Blachere for editorial assistance. Funding was provided by
the New York Genome Center, Rockefeller University grant # UL1 TR000043 from the
Clinical Translational Science Center (CTSC), and The Block Family Foundation. LI was
This work was supported by the Accelerating Medicines Partnership (AMP) in Rheumatoid
Arthritis and Lupus Network. AMP is a public-private partnership (AbbVie Inc., Arthritis
Research Alliance, Merck Sharp & Dohme Corp., National Institutes of Health, Pfizer Inc,
Inc.), created to develop new ways of identifying and validating promising biological targets
for diagnostics and drug development Funding was provided through grants from the
4. Klaasen, R., et al., The relationship between synovial lymphocyte aggregates and
the clinical response to infliximab in rheumatoid arthritis: a prospective study.
Arthritis Rheum, 2009. 60(11): p. 3217-24.
5. Dennis, G., Jr., et al., Synovial phenotypes in rheumatoid arthritis correlate with
response to biologic therapeutics. Arthritis Res Ther, 2014. 16(2): p. R90.
7. Krenn, V., et al., [Histopathological degeneration score of fibrous cartilage. Low- and
high-grade meniscal degeneration]. Z Rheumatol, 2010. 69(7): p. 644-52.
10. Pearle, A.D., et al., Elevated high-sensitivity C-reactive protein levels are associated
with local inflammatory findings in patients with osteoarthritis. Osteoarthritis
Cartilage, 2007. 15(5): p. 516-23.
11. Thurlings, R.M., et al., Synovial lymphoid neogenesis does not define a specific
clinical rheumatoid arthritis phenotype. Arthritis Rheum, 2008. 58(6): p. 1582-9.
12. Prieto-Potin, I., et al., Characterization of multinucleated giant cells in synovium and
subchondral bone in knee osteoarthritis and rheumatoid arthritis. BMC
Musculoskelet Disord, 2015. 16: p. 226.
14. Orlovskaya, G.V., P.Y. Muldiyarov, and I.S. Kazakova, Synovial plasma cells in
Accepted Article
rheumatoid arthritis. Electron microscope and immunofluorescence studies. Ann
Rheum Dis, 1970. 29(5): p. 524-32.
15. Perry, M.E., et al., Binucleated and multinucleated forms of plasma cells in synovia
from patients with rheumatoid arthritis. Rheumatol Int, 1997. 17(4): p. 169-74.
16. Volin, M.V., et al., Expression of mucin 3 and mucin 5AC in arthritic synovial tissue.
Arthritis Rheum, 2008. 58(1): p. 46-52.
17. Revell, P.A., et al., The synovial membrane in osteoarthritis: a histological study
including the characterisation of the cellular infiltrate present in inflammatory
osteoarthritis using monoclonal antibodies. Ann Rheum Dis, 1988. 47(4): p. 300-7.
18. Villeneuve, E., J. Nam, and P. Emery, 2010 ACR-EULAR classification criteria for
rheumatoid arthritis. Rev Bras Reumatol, 2010. 50(5): p. 481-3.
19. Dobin, A., et al., STAR: ultrafast universal RNA-seq aligner. Bioinformatics, 2013.
29(1): p. 15-21.
20. Leek, J.T. and J.D. Storey, Capturing heterogeneity in gene expression studies by
surrogate variable analysis. PLoS Genet, 2007. 3(9): p. 1724-35.
21. Love, M.I., W. Huber, and S. Anders, Moderated estimation of fold change and
dispersion for RNA-seq data with DESeq2. Genome Biol, 2014. 15(12): p. 550.
22. Dennis, G., Jr., et al., DAVID: Database for Annotation, Visualization, and Integrated
Discovery. Genome Biol, 2003. 4(5): p. P3.
23. Sokolove, J., et al., Autoantibody epitope spreading in the pre-clinical phase
predicts progression to rheumatoid arthritis. PLoS One, 2012. 7(5): p. e35296.
24. McHugh, M.L., Interrater reliability: the kappa statistic. Biochem Med (Zagreb),
2012. 22(3): p. 276-82.
25. Palmer, D.G., et al., Features of synovial membrane identified with monoclonal
antibodies. Clin Exp Immunol, 1985. 59(3): p. 529-38.
26. Smith, M.D., et al., Microarchitecture and protective mechanisms in synovial tissue
from clinically and arthroscopically normal knee joints. Ann Rheum Dis, 2003. 62(4):
p. 303-7.
27. Yeh, J.C., E. Ong, and M. Fukuda, Molecular cloning and expression of a novel
beta-1, 6-N-acetylglucosaminyltransferase that forms core 2, core 4, and I branches.
J Biol Chem, 1999. 274(5): p. 3215-21.
28. Leung, L.C., et al., Coupling of NF-protocadherin signaling to axon guidance by cue-
induced translation. Nat Neurosci, 2013. 16(2): p. 166-73.
31. Asahina, H., et al., Distribution of protocadherin 9 protein in the developing mouse
nervous system. Neuroscience, 2012. 225: p. 88-104.
32. Harvey, R.J., et al., GlyR alpha3: an essential target for spinal PGE2-mediated
inflammatory pain sensitization. Science, 2004. 304(5672): p. 884-7.
36. van Baarsen, L.G., et al., Synovial tissue heterogeneity in rheumatoid arthritis in
relation to disease activity and biomarkers in peripheral blood. Arthritis Rheum,
2010. 62(6): p. 1602-7.
37. Horton, S.C., C.A. Walsh, and P. Emery, Established rheumatoid arthritis: rationale
for best practice: physicians' perspective of how to realise tight control in clinical
practice. Best Pract Res Clin Rheumatol, 2011. 25(4): p. 509-21.
38. Meer, E.J., et al., Identification of a cis-acting element that localizes mRNA to
synapses. Proc Natl Acad Sci U S A, 2012. 109(12): p. 4639-44.
41. Rao, D.A., et al., Pathologically expanded peripheral T helper cell subset drives B
cells in rheumatoid arthritis. Nature, 2017. 542(7639): p. 110-114.