Newfoundland First Nation to study genetic links with ancient Beothuk – Global News

By StaffThe Canadian Press

Posted December 17, 2019 3:09 pm

-A

A+

A Newfoundland First Nation has announced a study of genetic links between its members and ancient Indigenous inhabitants of the island, including the Beothuk people.

Miawpukek First Nation announced the study this month, to be done in partnership with Terra Nova Genomics, Inc. and funded by a National Geographic Explorers grant of US$30,000.

READ MORE: Rare DNA quirk could reveal mysteries of Newfoundlands first settlers

Chief MiSel Joe says the study offers an opportunity to compare oral stories that trace family histories back to the Beothuk widely thought to be extinct with scientific evidence.

Researchers plan to begin looking at DNA testing kits from a sample group of 20 people, eventually expanding to assess samples from as many volunteers as possible.

Genetics professor Steven Carr with Terra Nova Genomics says the study is the largest of its kind with an Indigenous group in Canada.

Story continues below advertisement

READ MORE: Remains of two Beothuk people to be returned to Canada

Testing is set to begin in January and Carr says it may be a year or more before findings are ready for publication.

2019 The Canadian Press

Read this article:
Newfoundland First Nation to study genetic links with ancient Beothuk - Global News

Researcher taking the fight to cancer – Purdue Agricultural Communications

Tuesday, December 17th, 2019

by Kristen Lansing

Christopher Roberts peers into the eyepieces of a microscope and rolls the knobs back and forth until the specimen on the slide becomes perfectly clear. He carefully examines the slide of cancerous cells, looking for any changes that could indicate a breakthrough.

I see the cancerous cells in the zebrafish and plant cells and how they have changed since I last looked at them, said Roberts, a junior biochemistry major from Sheridan, Indiana.

Roberts focus is pre-medicine. At the moment, he is absorbed in cancer research, and his hands-on work has reinforced what he learned in the classroom. He started his research as a freshman, when he jumped at the chance to study genes that suppress cancer cells.

He focuses on two genes: the PKL pathway gene and CHD5. He studies how these genes work in Arabidopsis (flowering plants) because they are similar to the human genome.

Its challenging because its not something thats been done before, Roberts said. Its uncharted territory, so knowing what the next step is can be difficult sometimes.

The CHD5 gene can help suppress cancer cells. Roberts explained that humans can have a defective CHD5 gene or lack it altogether. These people, researchers have found, have a higher risk of getting cancer. By studying these genes, researchers (including Roberts) hope to understand why cancer happens.

Roberts interest in what causes cancer goes beyond the research lab. He is the head of Be the Match at Purdue, a philanthropic organization that connects volunteers with cancer patients who need life-saving bone marrow transplants.

This bone marrow registry allows participants to directly help patients with life-threatening cancers like leukemia and lymphoma. Roberts said he focuses on getting as many people as possible to register.

You can save a life just by doing a five-minute registration, Roberts said.

Matches are rare; however, in the two years Roberts has been involved with the group, he has known many people who have matched, including two of his Farmhouse Fraternity brothers. Matches are rare because the donor must be genetically similar to the patient in need.

Finding out that they got matched, which probably saved the kids life, is pretty awesome, Roberts said.

Many people develop cancer every day, and Roberts said the registry is an easy and effective way to help others.

Purdue may seem like an unusual choice to be a pre-med major because it doesnt have a medical school. But Roberts said that coming to Purdue, being involved with Be the Match, and working in the research lab solidified his dream of being a doctor. He said Purdues emphasis on research is a terrific experience that students may not get in other programs.

Ive been a part of this research lab for three years, and Ive gotten to the point where I could possibly be added to a scientific research paper, he said.

Being added as a contributor to a research paper is a considerable accomplishment for an undergraduate.

Roberts credited his high school biology teacher and their friendship for encouraging him to pursue medicine. This obviously meant seeking a school with a top-tier science program. Roberts visited several other big universities including Notre Dame and Ohio State, but he found something special about Purdue.

When I came to Purdue Biochem, it just felt like home to me, he said.

Roberts said the department staff is what really sold him on Purdue. When he visited campus, Roberts said three people (including Joseph Ogas, associate head and professor of biochemistry), personally showed him around the labs and demonstrated what they do. Roberts was impressed that a research professor would take time out of his busy schedule for that.

It is a big part of why he works in Ogas lab today.

He wants you to know and understand what it is youre doing, as well as he does, Roberts said. His door is always open and hes always willing to answer questions.

Roberts said the research he is doing is valuable, but he doesnt necessarily see himself in cancer research after graduation. Instead, he said he wants to focus on orthopedics or radiology.

I think the technological aspects are what draws me to both fields, Roberts said. Ive shadowed surgeons who dont even have to physically see the patient to diagnose the problem.

Luckily, he doesnt have to choose right away! Medical schools allow students to learn about each specialty area and hone their skills. Most students match with a specialty toward the end of their junior year or half way through senior year of med-school.

Oncology is a very competitive specialty to get into, Roberts said. Im planning on moving away from it [oncology] for a little while, but I guess Ill see when Im there what strikes my interests.

While Roberts has big aspirations for his future, for now you can find him in the lab peering into microscopes, looking for the answer to why cancer happens.

Purdue Biochemistry

Be the Match

Follow this link:
Researcher taking the fight to cancer - Purdue Agricultural Communications

Mitochondria Are the Canary in the Coal Mine for Cellular Stress – Technology Networks

Mitochondria, tiny structures present in most cells, are known for their energy-generating machinery. Now, Salk researchers have discovered a new function of mitochondria: they set off molecular alarms when cells are exposed to stress or chemicals that can damage DNA, such as chemotherapy. The results, published online in Nature Metabolism,could lead to new cancer treatments that prevent tumors from becoming resistant to chemotherapy.

"Mitochondria are acting as a first line of defense in sensing DNA stress. The mitochondria tell the rest of the cell, 'Hey, I'm under attack, you better protect yourself,'" says Gerald Shadel, a professor in Salk's Molecular and Cell Biology Laboratory and the Audrey Geisel Chair in Biomedical Science.

Most of the DNA that a cell needs to function is found inside the cell's nucleus, packaged in chromosomes and inherited from both parents. But mitochondria each contain their own small circles of DNA (called mitochondrial DNA or mtDNA), passed only from a mother to her offspring. And most cells contain hundreds--or even thousands--of mitochondria.

Shadel's lab group previously showed that cells respond to improperly packaged mtDNA similarly to how they would react to an invading virus--by releasing it from mitochondria and launching an immune response that beefs up the cell's defenses.

In the new study, Shadel and his colleagues set out to look in more detail at what molecular pathways are activated by the release of damaged mtDNA into the cell's interior. They homed in on a subset of genes known as interferon-stimulated genes, or ISGs, that are typically activated by the presence of viruses. But in this case, the team realized, the genes were a particular subset of ISGs turned on by viruses. And this same subset of ISGs is often found to be activated in cancer cells that have developed resistance to chemotherapy with DNA-damaging agents like doxyrubicin.

To destroy cancer, doxyrubicin targets the nuclear DNA. But the new study found that the drug also causes the damage and release of mtDNA, which in turn activates ISGs. This subset of ISGs, the group discovered, helps protect nuclear DNA from damage--and, thus, causes increased resistance to the chemotherapy drug. When Shadel and his colleagues induced mitochondrial stress in melanoma cancer cells, the cells became more resistant to doxyrubicin when grown in culture dishes and even in mice, as higher levels of the ISGs were protecting the cell's DNA.

"Perhaps the fact that mitochondrial DNA is present in so many copies in each cell, and has fewer of its own DNA repair pathways, makes it a very effective sensor of DNA stress," says Shadel.

Most of the time, he points out, it's probably a good thing that the mtDNA is more prone to damage--it acts like a canary in a coal mine to protect healthy cells. But in cancer cells, it means that doxyrubicin--by damaging mtDNA first and setting off molecular alarm bells--can be less effective at damaging the nuclear DNA of cancer cells.

"It says to me that if you can prevent damage to mitochondrial DNA or its release during cancer treatment, you might prevent this form of chemotherapy resistance," Shadel says.

His group is planning future studies on exactly how mtDNA is damaged and released and which DNA repair pathways are activated by the ISGs in the cell's nucleus to ward off damage.

Reference:Wu, Z., Oeck, S., West, A. P., Mangalhara, K. C., Sainz, A. G., Newman, L. E., Shadel, G. S. (2019). Mitochondrial DNA stress signalling protects the nuclear genome. Nature Metabolism. https://doi.org/10.1038/s42255-019-0150-8

This article has been republished from the following materials. Note: material may have been edited for length and content. For further information, please contact the cited source.

See the rest here:
Mitochondria Are the Canary in the Coal Mine for Cellular Stress - Technology Networks

Salk researchers uncover new function of mitochondria – News-Medical.net

Mitochondria, tiny structures present in most cells, are known for their energy-generating machinery. Now, Salk researchers have discovered a new function of mitochondria: they set off molecular alarms when cells are exposed to stress or chemicals that can damage DNA, such as chemotherapy. The results, published online in Nature Metabolism on December 9, 2019, could lead to new cancer treatments that prevent tumors from becoming resistant to chemotherapy.

Mitochondria are acting as a first line of defense in sensing DNA stress. The mitochondria tell the rest of the cell, 'Hey, I'm under attack, you better protect yourself.'"

Gerald Shadel, professor in Salk's Molecular and Cell Biology Laboratory and the Audrey Geisel Chair in Biomedical Science

Most of the DNA that a cell needs to function is found inside the cell's nucleus, packaged in chromosomes and inherited from both parents. But mitochondria each contain their own small circles of DNA (called mitochondrial DNA or mtDNA), passed only from a mother to her offspring. And most cells contain hundreds--or even thousands--of mitochondria.

Shadel's lab group previously showed that cells respond to improperly packaged mtDNA similarly to how they would react to an invading virus--by releasing it from mitochondria and launching an immune response that beefs up the cell's defenses.

In the new study, Shadel and his colleagues set out to look in more detail at what molecular pathways are activated by the release of damaged mtDNA into the cell's interior. They homed in on a subset of genes known as interferon-stimulated genes, or ISGs, that are typically activated by the presence of viruses. But in this case, the team realized, the genes were a particular subset of ISGs turned on by viruses. And this same subset of ISGs is often found to be activated in cancer cells that have developed resistance to chemotherapy with DNA-damaging agents like doxyrubicin.

To destroy cancer, doxyrubicin targets the nuclear DNA. But the new study found that the drug also causes the damage and release of mtDNA, which in turn activates ISGs. This subset of ISGs, the group discovered, helps protect nuclear DNA from damage--and, thus, causes increased resistance to the chemotherapy drug. When Shadel and his colleagues induced mitochondrial stress in melanoma cancer cells, the cells became more resistant to doxyrubicin when grown in culture dishes and even in mice, as higher levels of the ISGs were protecting the cell's DNA.

"Perhaps the fact that mitochondrial DNA is present in so many copies in each cell, and has fewer of its own DNA repair pathways, makes it a very effective sensor of DNA stress," says Shadel.

Most of the time, he points out, it's probably a good thing that the mtDNA is more prone to damage--it acts like a canary in a coal mine to protect healthy cells. But in cancer cells, it means that doxyrubicin--by damaging mtDNA first and setting off molecular alarm bells--can be less effective at damaging the nuclear DNA of cancer cells.

"It says to me that if you can prevent damage to mitochondrial DNA or its release during cancer treatment, you might prevent this form of chemotherapy resistance," Shadel says.

His group is planning future studies on exactly how mtDNA is damaged and released and which DNA repair pathways are activated by the ISGs in the cell's nucleus to ward off damage.

Source:

Journal reference:

Wu, Z., et al. (2019) Mitochondrial DNA stress signalling protects the nuclear genome. Nature Metabolism. doi.org/10.1038/s42255-019-0150-8.

Here is the original post:
Salk researchers uncover new function of mitochondria - News-Medical.net

GenScript Biotech to Host Global Forum on Cell and Gene Therapy and the Booming China Market During JPM Week – BioSpace

NANJING, China, Dec. 16, 2019 /PRNewswire/ -- GenScript Biotech Corp., one of the leadingbiotechnology companies inChina, today announcedits inaugural GenScript Biotech Global Forum on Jan. 14 in San Francisco, coinciding with the JP Morgan Healthcare Conference week. The Forum, exploring the theme "Cell and Gene Therapy and the Booming China Market," will feature gene and cell therapy leaders in industry, academia and the investment community and is expected to draw several hundred attendees.

"Advancements in cell and gene therapy have attracted global attention in recent years, as the promise of bringing life-changing treatments to cancer patients and others comes closer to reality," said Frank Zhang, PhD., founder and CEO of GenScript. "GenScript's Global Forum aims to foster closer collaborations among scientists, regulators, and industry, not just in the booming China market but around the globe. We hope that by working together we can advance the industry and accelerate drug development."

GenScript's Global Forum, will take place from 1:30 p.m. to 5:30 p.m. at the Grand Hyatt San Francisco. Highlights of the agenda include:

For more information about the Forum and to register for the event please visit hereor https://www.genscript.com/biotech-global-forum-2020.html.

About GenScript Biotechnology

GenScript Biotech Corporation (Stock Code: 1548.HK) is a global biotechnology group. GenScript's businesses encompass four major categories based on its leading gene synthesis technology, including operation as a Life Science CRO, enzyme and synthetic biology products, biologics development and manufacturing, as well as cell therapy.

Founded in 2002 and listed on the Hong Kong Stock Exchange in 2015, GenScript has an established global presence across Greater China, North America, the EU, and Asia Pacific. Today, over 300,000 customers from over 160 countries and regions around the world have used GenScript's premier, convenient, and reliable products and services.

GenScript currently has more than 2900 employees globally, 34% of whom hold master's and/or Ph.D. degrees. In addition, GenScript has a number of leading commercial technologies, including more than 100 patents and over 270 patent applications. As of June 2019, GenScript's products and services have been cited by 40,300 scientific papers worldwide.

GenScript is committed to striving towards its vision of being the most reliable biotech company in the world to make humans and nature healthier through biotechnology.

For more information, please visit https://www.genscript.com/

Contact:

Corporate:Fiona CheCorporate Communication Manager, GenScript+86 -025-58897288-6321Fiona.che@genscript.com

MediaSusan ThomasPrincipal, Endpoint Communications(619) 540-9195susan@endpointcommunications.net

View original content to download multimedia:http://www.prnewswire.com/news-releases/genscript-biotech-to-host-global-forum-on-cell-and-gene-therapy-and-the-booming-china-market-during-jpm-week-300975054.html

SOURCE GenScript Biotech Corp.

Read the original post:
GenScript Biotech to Host Global Forum on Cell and Gene Therapy and the Booming China Market During JPM Week - BioSpace

Penn researchers track defective sperm epigenome linked to male infertility – News-Medical.net

One out of eight couples has trouble conceiving, with nearly a quarter of those cases caused by unexplained male infertility. For the past decade, research has linked that infertility to defective sperm that fail to "evict" proteins called histones from DNA during development. However, the mechanisms behind that eviction and where this is happening in the sperm DNA has remained both controversial and unclear.

Now, researchers at Penn Medicine show, using newer genome-wide DNA sequencing tools, the precise genetic locations of those retained histones, as well as a key gene regulating it. The findings were published in Developmental Cell.

Taking it a step further, the researchers created a new mouse model with a mutated version of the gene, Gcn5, which allows investigators to closely track the defects in sperm from the early stages of sperm development through fertilization and on. This is an important step forward as it could lead to a better understanding of not only infertility in men -- and ways to potentially reverse it -- but also the suspected epigenetic mutations being passed onto the embryo from males either naturally or through in vitro fertilization.

Epigenetics, the factors influencing an organism's genetics that are not encoded in the DNA, play a strong role in sperm and egg formation.

For men who have unexplained infertility, everything may look normal at the doctors: normal semen counts, normal motility. Yet they can still have problems conceiving. One explanation for persistent problems is histones being in the wrong location, which may affect sperm and then early development. Now, we have a really good model to study what happens when you don't get rid of the histones appropriately in the sperm and what that may look like in the embryo."

Lacey J. Luense, PhD, first author, research associate in the lab of study senior author, Shelley L. Berger, PhD, the Daniel S. Och University Professor in the departments of Cell and Developmental Biology and Biology, and director of the Penn Epigenetics Institute

Healthy sperm lose 90 to 95 percent of histones, the main proteins in chromatin that package DNA and turn genes on and off, and replace them with protamines, which are smaller proteins able to properly pack the DNA into tiny sperm. Given the role of retained histones in infertility and embryonic development, there is great interest in determining the genomic locations so they could potentially be utilized for further study and ultimately treatment.

Past studies have produced conflicting results on the whereabouts of histones. A technology known as MNase-sequencing that uses an enzymatic reaction to pinpoint location has placed the retained histones on important gene promotors. Other studies with the same approach found histones at DNA repeats and placed in so-called "gene deserts," where they play less of a role in regulation.

"There has been controversy in the field trying to understand these discrepant data," Luense said. "In this new study, we found that both of these previously described models are correct. We find histones on genes that appear to be important for embryo development, but we also find them at repetitive elements, places that do need to be turned off and to prevent expression of these regions in the embryo."

The researchers applied a technology known as ATAC-sequencing, a more precise and faster approach, to track waves of histones at unique sites across the genome during the early and late stages of sperm development in mice. ATAC-seq can identify parts of the genome open and closed -- in this case, regions that retain the sperm histones -- and then make a cut and tag the DNA, which can then be sequenced.

In the mouse models created with the mutated Gcn5 gene, the researchers found these mice to have very low fertility. The researchers also showed that retained histones in normal mice sperm correlated with histone positions in very early embryos, supporting the hypothesis that paternal histones transfer epigenetic information to the next generation.

Having this type of mutant model gives scientists a tool to closely study the mechanisms underlying the mutated sperm's trajectory and understand what effect it may have on the embryo and in development. It also opens an opportunity to study potential therapeutic targets.

"Right now, the burden of IVF and other assisted-reproductive technologies fall on women. Even it's the male factor, it's still women who have to go through hormone injections and procedures," Berger said. "Now imagine being able to apply epigenetic therapeutics to change the levels of histones and protamines in males before embryogenesis? That's one of the questions we want to explore and this model will allow us to move toward that direction."

There are numerous available epigenetic drugs used to treat cancer and other diseases. Given their mechanisms, treating sperm with drugs to increase histone eviction is one potential route to explore.

Limitations with human embryos in science have led to a lack of overall research on infertility and the role of the father's epigenome on embryo development, which underscores the importance of studies such as this, the researchers said.

"There are a lot different factors that can alter the sperm epigenome: diet, drugs, alcohol, for example," Luense said. "We are just now starting to understand how that can affect the child and affect development. These initial, basic studies that we are doing are critical, so we can better understand what's driving these epigenetic mutations."

Go here to see the original:
Penn researchers track defective sperm epigenome linked to male infertility - News-Medical.net

BioRestorative Therapies Featured in IEEE Pulse Magazine’s Cover Story About Stem Cell Therapies for Low Back Pain – GlobeNewswire

MELVILLE, N.Y., Dec. 16, 2019 (GLOBE NEWSWIRE) -- BioRestorative Therapies, Inc. (BioRestorative or the Company) (OTC: BRTX), a life sciences company focused on stem cell-based therapies, announced today feature coverage in the news outlet, IEEE Pulse, a magazine of the IEEE Engineering in Medicine and Biology Society. According to IEEE, it is the worlds largest technical professional organization for the advancement of technology.

To view the IEEE Pulse Magazines article featuring BioRestorative, click here.

The published cover-story article features commentary from Francisco Silva, Chief Scientist and Vice President of Research and Development for BioRestorative, regarding BRTX-100, the Companys lead therapeutic candidate for chronic lumbar disc disease. Once the U.S. Food and Drug Administration (FDA) authorizes the sale of BRTX-100, we would ship it to your doctor, and with a 30-minute procedure the material would be injected into your disc in a 1.5 ml solution, explains Silva. He elaborates on the product, discussing growing and expanding stem cells from the patients bone marrow under hypoxic conditions that mimic those in the normal intervertebral space. We are enriching the cells to be able to survive in this harsh environment, says Silva.

In addition to BRTX-100, the magazine article also highlights BioRestoratives other research pursuit, its ThermoStem program, utilizing brown adipose (fat) derived stem cells to target treatment of metabolic diseases and disorders, like diabetes, obesity and hypertension.

About BioRestorative Therapies, Inc.

BioRestorative Therapies, Inc. (www.biorestorative.com) develops therapeutic products using cell and tissue protocols, primarily involving adult stem cells. Our two core programs, as described below, relate to the treatment of disc/spine disease and metabolic disorders:

Disc/Spine Program (brtxDISC): Our lead cell therapy candidate, BRTX-100, is a product formulated from autologous (or a persons own) cultured mesenchymal stem cells collected from the patients bone marrow. We intend that the product will be used for the non-surgical treatment of painful lumbosacral disc disorders. The BRTX-100 production process utilizes proprietary technology and involves collecting a patients bone marrow, isolating and culturing stem cells from the bone marrow and cryopreserving the cells. In an outpatient procedure, BRTX-100 is to be injected by a physician into the patients damaged disc. The treatment is intended for patients whose pain has not been alleviated by non-invasive procedures and who potentially face the prospect of surgery. We have received authorization from the Food and Drug Administration to commence a Phase 2 clinical trial using BRTX-100 to treat persistent lower back pain due to painful degenerative discs.

Metabolic Program (ThermoStem): We are developing a cell-based therapy to target obesity and metabolic disorders using brown adipose (fat) derived stem cells to generate brown adipose tissue (BAT). BAT is intended to mimic naturally occurring brown adipose depots that regulate metabolic homeostasis in humans. Initial preclinical research indicates that increased amounts of brown fat in the body may be responsible for additional caloric burning as well as reduced glucose and lipid levels. Researchers have found that people with higher levels of brown fat may have a reduced risk for obesity and diabetes.

Forward-Looking Statements

This press release contains "forward-looking statements" within the meaning of Section 27A of the Securities Act of 1933, as amended, and Section 21E of the Securities Exchange Act of 1934, as amended, and such forward-looking statements are made pursuant to the safe harbor provisions of the Private Securities Litigation Reform Act of 1995. You are cautioned that such statements are subject to a multitude of risks and uncertainties that could cause future circumstances, events or results to differ materially from those projected in the forward-looking statements as a result of various factors and other risks, including, without limitation, whether the Company will be able to consummate the private placement and the satisfaction of closing conditions related to the private placement and those set forth in the Company's Form 10-K filed with the Securities and Exchange Commission. You should consider these factors in evaluating the forward-looking statements included herein, and not place undue reliance on such statements. The forward-looking statements in this release are made as of the date hereof and the Company undertakes no obligation to update such statements.

CONTACT:Email: ir@biorestorative.com

Follow this link:
BioRestorative Therapies Featured in IEEE Pulse Magazine's Cover Story About Stem Cell Therapies for Low Back Pain - GlobeNewswire

Characterizing smoking-induced transcriptional heterogeneity in the human bronchial epithelium at single-cell resolution – Science Advances

Abstract

The human bronchial epithelium is composed of multiple distinct cell types that cooperate to defend against environmental insults. While studies have shown that smoking alters bronchial epithelial function and morphology, its precise effects on specific cell types and overall tissue composition are unclear. We used single-cell RNA sequencing to profile bronchial epithelial cells from six never and six current smokers. Unsupervised analyses led to the characterization of a set of toxin metabolism genes that localized to smoker ciliated cells, tissue remodeling associated with a loss of club cells and extensive goblet cell hyperplasia, and a previously unidentified peri-goblet epithelial subpopulation in smokers who expressed a marker of bronchial premalignant lesions. Our data demonstrate that smoke exposure drives a complex landscape of cellular alterations that may prime the human bronchial epithelium for disease.

The human bronchus is lined with a pseudostratified epithelium that acts as a physical barrier against exposure to harmful environmental insults such as inhaled toxins, allergens, and pathogens (1, 2). The bronchial epithelium is a complex tissue, predominantly composed of ciliated, goblet, club, and basal epithelial cells. These cell types cooperate to perform mucociliary clearance, which is the process that mediates the capture and removal of inhaled substances (1, 2). Goblet cells secrete components of a mucosal lining that entraps inhaled particulate matter, which is propelled out of the airways by mechanical beating of ciliated cells (1, 2). Club cells have both secretory (3) and progenitor (4) functions, and basal cells are multipotent progenitors responsible for normal tissue homeostasis (57). Interplay among these cells is required for proper function and long-term maintenance of the bronchial epithelium, but exposure to substances, such as tobacco smoke, might alter or injure specific cell types and lead to tissue-wide dysfunction.

Inhalation of tobacco smoke exposes the bronchial epithelium to toxins, carcinogens, and free radicals (811), but cellular injuries and abnormalities associated with this exposure are complex and have not been fully characterized. Previous studies have described smoking-induced epithelial changes, such as increased goblet cell numbers (1214) and reduced ciliary length (15, 16). Robust transcriptomic alterations have also been observed in the bronchial epithelium of smokers, involving the up-regulation of genes linked to xenobiotic metabolism and the oxidative stress response (17, 18). Furthermore, it has been reported that a subset of gene expression alterations detected in smokers persists years after smoking cessation (18). However, the aforementioned transcriptomic studies profiled bronchial tissue in bulk, masking cell typespecific contributions to the smoking-associated gene expression signature.

To overcome the limitations of bulk tissue analyses, we used single-cell RNA sequencing (scRNA-Seq) to profile the transcriptomes of individual bronchial cells from healthy never and current smokers. We identified bronchial subpopulations using an unsupervised machine learning algorithm and immunostained bronchial tissue from independent cohorts of never and current smokers to validate robust smoking-associated findings. In the airways of smokers, we described a metabolic response specific to ciliated cells, a shift in the presence of club and goblet cells, and the emergence of a previously uncharacterized epithelial subpopulation.

Bronchial brushings were procured by bronchoscopy from the right mainstem bronchus of six healthy current smokers and six healthy never smokers (table S1), and single ALCAM+ epithelial cells (19) and CD45+ white blood cells (WBCs) were isolated from each donor (Fig. 1A and fig. S1). The CEL-Seq scRNA-Seq protocol (20) was used to profile the transcriptomes of 84 epithelial cells and 11 WBCs from each of the 12 donors (1140 total cells: 1008 epithelial cells and 132 WBCs). Low-quality cells were excluded from downstream analyses, leaving 796 cells (753 epithelial cells and 43 WBCs) (figs. S2 and S3) expressing an average of 1817 genes per cell. Expression of known marker genes for bronchial cell types was detected in largely nonoverlapping cells, including KRT5 for basal cells, FOXJ1 for ciliated cells, SCGB1A1 for club cells, MUC5AC for goblet cells, and CD45 for WBCs (Fig. 1B). Given the relatively small number of subjects, we sought to determine whether smoking-associated gene expression changes identified in these donors reflected those observed in a larger, independent cohort of never and current smokers. Data from all cells procured from each donor were combined to generate in silico bulk bronchial brushings. Analysis of differential expression between never and current smoker in silico bulk samples revealed associations that were highly correlated (Spearmans r = 0.45) with those observed in a previously published bulk bronchial brushing dataset generated by microarray (fig. S4) (18).

(A) Bronchial brushings were procured from the right mainstem bronchus of six never smokers and six current smokers. Bronchial tissue was dissociated, single cells were isolated by fluorescence-activated cell sorting (FACS), and single-cell RNA libraries were prepared and sequenced. (B) t-distributed stochastic neighbor embedding (t-SNE) was performed to illustrate transcriptomic relationships among cells. Donor smoking status (NS, never smoker; CS, current smoker) was visualized for each cell as well as expression of bronchial cell type marker genes [z-normalized transcripts per million (TPM) values] across all cells: KRT5 (basal), FOXJ1 (ciliated), SCGB1A1 (club), MUC5AC (goblet), and CD45 (WBC). (C) An unsupervised analytical approach (LDA) was used to identify distinct cell clusters and sets of coexpressed genes. Cell clusters were defined by unique gene set expression patterns, and never or current smoker cell enrichment was assessed.

To characterize cellular subpopulations beyond known cell type markers, we used latent Dirichlet allocation (LDA) as an unsupervised framework to assign cells to clusters and identify distinct sets of coexpressed genes across all cells (Fig. 1C). LDA divided the dataset into 13 distinct cell clusters and 19 sets of coexpressed genes (Fig. 2, A and B, and figs. S5 to S8). Each cell cluster was defined by the expression of a unique combination of gene sets, and each gene set was defined by a unique expression pattern among clusters (Fig. 2, A and B, and fig. S9). Cell types were defined for 8 of the 13 clusters based on medium to high marker gene expression: Cell clusters C-2 and C-4 expressed KRT5, C-5 and C-11 expressed FOXJ1, C-1 and C-8 expressed SCGB1A1, and C-3 expressed MUC5AC (Fig. 2C). Cluster C-7 expressed WBC marker CD45 (Fig. 2C), and Fishers exact test was used to show that C-7 was enriched with sorted CD45+ cells (P = 9.6 1047, Fishers exact test). C-7 cells also expressed several T cell receptor genes (e.g., TRBC2 and TRGC1), indicating a T cell lineage (fig. S10). Low levels of SCGB1A1 transcripts were detected in cluster C-10 (SCGB1A1low), and CFTR was expressed by cluster C-13, which suggests that these cells may be ionocytes (fig. S11) (21). Marker gene expression was not detected in clusters C-6, C-9, and C-12 (Fig. 2C). Enrichment [false discovery rate (FDR) q < 0.05] of current smoker cells was observed in goblet cell cluster C-3, as well as C-9 and C-12, whereas that of never smoker cells was observed in club cell cluster C-1 and basal cluster C-4 (Fig. 2D). Donor-specific contributions of cells to each cluster were variable; however, most of the never and current smokers contributed to each never and current smokerassociated cell cluster, respectively (fig. S12). Furthermore, a subset of gene sets expressed by specific clusters of ciliated, club, goblet, and basal cells, as well as those without a cell type designation, was differentially expressed between never and current smokers in transcriptomic data generated from bulk bronchial tissue (Fig. 2, A and B, and fig. S13) (18). Therefore, smoking-induced gene expression changes reported in bulk tissue are likely driven by alterations to multiple bronchial cell types.

(A) Global transcriptomic profiles of 13 bronchial cell clusters were defined by expression of unique combinations of 19 gene sets and visualized by heatmap (z-normalized TPM values). (B) A MetaGene was generated for each gene set (GS-1 to GS-19), and mean cluster-specific expression was designated: high (pink), medium (white), low (light gray), or not expressed (dark gray). (C) Mean expression of marker genes was summarized for each cluster designated: high (pink), medium (white), low (light gray), or not expressed (dark gray). (D) Per-cluster percentage of total cells and the ratio of never and current smoker cells were calculated, and per-cluster statistical enrichment (FDR q < 0.05, indicated in blue) of NS or CS cells was assessed.

We characterized transcriptomic similarities and differences among FOXJ1+ clusters C-5 and C-11 to define ciliated cell subpopulations detected in never and current smokers. Our data revealed that both clusters of ciliated cells expressed gene set GS-2 but could be differentiated based on expression of gene set GS-3 by cluster C-5 and gene set GS-7 by cluster C-11 (Fig. 3A). GS-2 contains FOXJ1, in addition to genes involved with ciliary assembly, maintenance, and function, such as motor protein genes (e.g., DYNLL1 and DNAH9) and intraflagellar transport genes (e.g., IFT57 and IFT172) (Fig. 3A and extended table S3). GS-2 also includes antioxidant genes (e.g., PRDX5, GPX4, and GSTA2), known transcriptional regulators of ciliogenesis [e.g., RFX2 (22, 23) and RFX3 (24, 25)], and surface proteins not previously attributed to ciliated cells (e.g., CDHR3 and CD59). GS-3 contains genes with known roles in airway ciliary biology, such as IFT88 (required for ciliary formation) (2628) and DNAH5 (required for ciliary motility) (2931). By contrast, gene set GS-7 is enriched with cell cycleassociated genes (extended table S3), such as CDK1 and CCNB1 (G1-S transition) and TOP2A (S-phase DNA replication), as well as the transcription factor HES6. Therefore, clusters C-5 and C-11 likely represent functionally distinct subpopulations of FOXJ1+ ciliated cells.

(A) Expression of gene sets GS-2, GS-3, and GS-7 in clusters C-5 and C-11 was visualized by heatmap (z-normalized TPM values). (B) Cluster C-5 was split into never and current smoker subsets, and expression of GS-8 genes was visualized by heatmap. (C) Bronchial tissue procured from an independent cohort of never and current smokers (UMCG cohort, table S2) was immunostained for AKR1B10, Ac--Tub, and KRT8. Representative images of never smoker (left) and current smoker (right) tissue were displayed. Arrows specify examples of AKR1B10+ ciliated cells (Ac--Tub+). (D) An increase in tissue length (m)normalized numbers of AKR1B10+ Ac--Tub+ cells was observed in current smokers relative to never smokers [P = 7.4 107, Wilcoxon rank-sum (WRS) test].

We found that ciliated cells from current smokers expressed a distinct transcriptional signature. Specifically, the current smoker subset of cluster C-5 FOXJ1+ cells expressed gene set GS-8, which was enriched with genes encoding enzymes implicated in aldehyde and ketone metabolism, such as ALDH3A1, AKR1C1, and AKR1B10 (Fig. 3B). This finding suggested that the gene expression response to toxic aldehydes and ketones present in tobacco smoke (8, 9) might be restricted to ciliated epithelial cells. To confirm that this set of enzymes localized to ciliated cells, we immunostained bronchial tissue procured from an independent cohort of never and current smokers [University Medical Center Groningen (UMCG) cohort, table S2] for the aldo-keto reductase AKR1B10, as well as cilia-specific acetylated -tubulin (Ac--Tub) and the luminal cytokeratin KRT8, which is expressed by all nonbasal cells (Fig. 3C). We found that AKR1B10 was robustly expressed in the airways of current smokers, and numbers of AKR1B10+ ciliated cells were significantly higher than those observed in never smokers (P = 7.4 107; Fig. 3, C and D). AKR1B10 was detected throughout the cytoplasm of smoker ciliated cells, as well as at the base of the cilia (Fig. 3C). AKR1B10+ ciliated cells were uncommon in never smokers, and overall low magnitude of AKR1B10 expression was observed in these cells (Fig. 3C). We detected rare instances of nonciliated AKR1B10+ KRT8+ cells (fig. S14A), but AKR1B10+ KRT8 cells were not observed. We also confirmed that AKR1B10 was not expressed by current smoker MUC5AC+ goblet cells (fig. S14B). Overall, these results demonstrate that ciliated cells express a specific set of detoxification genes in response to smoke exposure.

Our data revealed that the largest cluster of SCGB1A1+ cells, C-1, was enriched with never smoker cells (Fig. 2D), indicating that this subpopulation of club cells was depleted from the airways of smokers. C-1 cells distinctly expressed high levels of gene set GS-19, which contains MUC5B, in addition to SCGB3A1 and transcription factors TCF7, FOS, and JUN (Fig. 4A). However, SCGB1A1 (included in gene set GS-17) was also highly expressed by cluster C-8, which was not affected by smoking status (Fig. 2D). Therefore, these results indicate that smoking is associated with a decrease in MUC5B+ SCGB1A1+ (C-1) club cell content. Furthermore, gene set GS-13, which contains immunologically relevant genes BPIFB1 (32) and PIGR (33) (Fig. 4A), was expressed by SCGB1A1+ cells (C-1 and C-8) as well as MUC5AC+ cluster C-3, indicating that there may be functional overlap among club and goblet cells.

(A) Expression of gene sets GS-19, GS-17, GS-13, and GS-1 in clusters C-1, C-8, and C-3 was visualized by heatmap (z-normalized TPM values). Bronchial tissue procured from an independent cohort of never and current smokers (UMCG cohort, table S2) was immunostained for MUC5B and MUC5AC. (B) Representative images of never smoker tissue, MN current smoker tissue, and current smoker GCH were displayed. Arrows specify examples of MUC5B+, MUC5B+ MUC5AC+, and MUC5AC+ cells. Changes in tissue length (m)normalized numbers of (C) MUC5B+ cells (MN decrease, P = 0.02; GCH decrease, P = 1.8 105), (D) MUC5B+ MUC5AC+ cells (GCH decrease, P = 0.02), and (E) MUC5AC+ cells (MN increase, P = 1.5 106; GCH increase, P = 7.4 107) were observed (WRS test) in current smoker MN and GCH tissue relative to never smokers (WRS test). (F) Average proportions of MUC5B+, MUC5B+ MUC5AC+, and MUC5AC+ cells observed in never smokers, as well as MN and GCH current smoker tissue are displayed.

The MUC5AC+ goblet cell cluster C-3 was significantly enriched with current smoker cells (Fig. 2D), which is consistent with previous studies showing that smoking is associated with increased bronchial goblet cell abundance (1214). Cluster C-3 expressed gene set GS-1, which contains the goblet cell marker gene MUC5AC as well as several genes with known roles in goblet cell biology, such as SPDEF (34), AGR2 (35), and TFF3 (36) (Fig. 4A). Genes associated with the unfolded protein response are present in GS-1 (e.g., KDLER3 and DNAJC10) (extended table S3). We also identified several unique goblet cell surface markers (e.g., CLDN10, TSPAN8, and TSPAN13), as well as a transcription factor (NKX3-1) whose role in the goblet cell transcriptional program is unknown (Fig. 4A). Therefore, these data indicate that smoking is associated with increased numbers of MUC5AC+ goblet cells.

To confirm smoking-associated shifts in club and goblet cell numbers, we immunostained bronchial tissue procured from an independent cohort of never and current smokers (UMCG cohort, table S2) for markers of club (MUC5B) and goblet (MUC5AC) cells (Fig. 4B). Imaging data revealed cell subpopulations that exclusively express MUC5B or MUC5AC, as well as those that coexpress both MUC5B and MUC5AC (Fig. 4B). The airways of never smokers contained similar numbers of MUC5B+, MUC5B+, MUC5AC+, and MUC5AC+ cells (Fig. 4, B and F). The bronchial epithelium of current smokers, however, took on two distinct phenotypes: tissue regions described as morphologically normal (MN), which were similar to never smokers, and regions characterized by high MUC5AC+ cell density, referred to as goblet cell hyperplasia (GCH) (Fig. 4B and fig. S15). In the MN smoker tissue, we observed a significant decrease in MUC5B+ cells (P = 0.02) (Fig. 4C) and a significant increase in MUC5AC+ cells (P = 1.5 106) (Fig. 4E), relative to never smokers, but no change in MUC5B+ MUC5AC+ content was observed (Fig. 4D). Differences between smoker GCH and never smoker epithelium, however, were more pronounced. Near-complete loss of MUC5B+ cells was observed in smoker GCH (P = 1.8 105; Fig. 4C), along with a significant loss of MUC5B+ MUC5AC+ cells (P = 0.02; Fig. 4D), relative to never smokers. GCH-associated alterations were accompanied by a 13-fold increase in MUC5AC+ cells (P = 7.4 107; Fig. 4, E and F). Additional immunostaining for KRT5 expression in the same bronchial tissue revealed that basal cell content was not affected by smoking status and did not vary between MN and GCH regions (fig. S16). Overall, these findings indicate that smoking is associated with a loss of club cells, increased numbers of goblet cells, and substantial GCH airway remodeling.

We sought to establish the identity of cluster C-9, which was strongly enriched with current smoker cells and did not express established cell type marker genes (e.g., KRT5, FOXJ1, SCGB1A1, and MUC5AC) (Fig. 2C). C-9 cells expressed high levels of gene set GS-12, which contains the luminal cytokeratin KRT8 (Fig. 5A). Additional cytokeratin genes were also present in GS-12, such as KRT13 and KRT19, as well as antioxidant genes, such as TXN and GPX1 (Fig. 5A). Cluster C-9 also expressed gene set GS-16, which was detected at low levels in MUC5AC+ cells (C-3) and contained the xenobiotic metabolism gene CYP1B1 (Fig. 5A). Furthermore, high expression of gene set GS-15 was detected in both C-9 and MUC5AC+ cells (C-3) (Fig. 5, A to C), suggesting that this cluster may have a functional relationship with goblet cells. GS-15 contains several genes previously reported to be persistently up-regulated after smoking cessation (e.g., CEACAM5, CEACAM6, and UPK1B) (18), one of which has been explicitly linked to lung squamous cell carcinoma (SCC) and premalignancy (CEACAM5) (37).

(A) Expression of gene sets GS-12, GS-16, GS-15, and MUC5AC in clusters C-3 and C-9 was visualized by heatmap (z-normalized TPM values). (B) t-SNE was used to visualize cluster C-3 and C-9 cells as well as (C) CEACAM5 expression (z-normalized TPM values) across all cells. (D) Bronchial tissue procured from an independent cohort of never and current smokers (UCL cohort, table S3) was immunostained for CEACAM5, KRT8, and MUC5AC. Representative images of never smoker tissue and current smoker GCH were displayed. Arrows specify examples of CEACAM5+ KRT8+ MUC5AC PG cells. (E) A significant increase in tissue length (m)normalized numbers of CEACAM5+ KRT8+ MUC5AC cells in current smoker GCH tissue, relative to never smokers, was observed (P = 0.004, WRS test).

To validate the presence of cluster C-9 cells in the airways of current smokers, we immunostained bronchial tissue procured from a second independent cohort of never and current smokers [University College London (UCL) cohort, table S3] for KRT8, MUC5AC (goblet cells), and Ac--Tub (ciliated cells). KRT8+ MUC5AC Ac--Tub cells that were morphologically distinct from goblet and ciliated cells were detected in significantly higher numbers in GCH regions of current smokers relative to never smokers (fig. S17). To confirm that there was functional overlap between goblet cells and this subpopulation of KRT8+ MUC5AC Ac--Tub cells, we immunostained bronchial tissue (UCL cohort, table S3) for CEACAM5, in addition to KRT8 and MUC5AC. Increased numbers of CEACAM5+ KRT8+ MUC5AC cells were detected in GCH regions of current smokers relative to never smokers (P = 0.004) (Fig. 5, D and E), although variable content among donors was observed. Within current smoker GCH tissue regions, CEACAM5+ KRT8+ MUC5AC cells were typically found in close proximity to goblet cells (CEACAM5+ KRT8+ MUC5AC+) and were therefore named peri-goblet (PG) cells (UCL cohort, Fig. 5D; UMCG cohort, fig. S18). CEACAM5 expression in goblet cells was phenotypically punctate and colocalized with MUC5AC in both never and current smokers (Fig. 5D and fig. S18). In PG cells, however, CEACAM5 localized to the plasma membrane and cytoplasm (Fig. 5D and fig. S18). Overall, these data indicate that PG cells are a previously unidentified, bronchial epithelial subpopulation associated with smoking-induced GCH.

Previous transcriptomic studies have shown that smoking is associated with a robust bronchial gene expression signature (17, 18). Interrogation of bronchial tissue at single-cell resolution revealed that elements of this signature were derived from different cell subpopulations. Overall, we found smoking-associated phenotypes that included a metabolic response that localized to ciliated cells, a cell type shift that involved club cell loss and goblet cell expansion, and a previously uncharacterized subpopulation of PG epithelial cells present within regions of GCH (fig. S19).

We identified a gene set (GS-8) specifically expressed by smoker ciliated cells (C-5) that contains genes encoding families of enzymes, such as aldehyde dehydrogenases (e.g., ALDH3A1 and ALDH1A3) and aldo-keto reductases (e.g., AKR1B10 and AKR1C1), capable of breaking down tobacco smokederived chemical compounds, such as toxic aldehydes (e.g., formaldehyde and acrolein) and ketones (e.g., acetone and methyl vinyl ketone) (8, 9). This finding suggests that ciliated cells exhibit a cell typespecific coping mechanism that may convey resistance to certain forms of smoking-induced toxicity. Links between this mechanism and previously reported smoking phenotypes, such as reduced ciliary length (15), however, are unclear. This finding might also highlight a protective function with tissue-wide significance, in which the bronchial epitheliums capacity for detoxification may be compromised if ciliated cells are lost because of injury or disease.

Several studies have reported that smoking is associated with increased mucous production and GCH in the bronchus (1214, 3840). Loss of club cells (SCGB1A1+) has been reported in smoker bronchioles (11, 12), but this is the first instance in which a similar observation has been made in the mainstem bronchus. We confirmed that GCH is a regional phenomenon interspersed among MN tissue areas. The determinants of GCH prevalence are unclear, but it has been shown that cytokines [e.g., interleukin-13 (IL-13) and (IL-4)] (4143) and viral infection (e.g., Rhino virus and polyinosinic:polycytidylic acid) (44, 45) can increase MUC5AC expression and goblet cell abundance. The specific catalyst for GCH in response to smoke exposure is unknown, but reports of its co-occurrence with airway inflammation suggest that immunological interplay may be a factor (14). Furthermore, there is evidence that both basal and club cells are capable of goblet cell differentiation (32, 46). However, the origins of newly formed goblet cells in the airways of smokers have not been explicitly described. Functional implications for goblet cell expansion and club cell loss are unclear, but a similar phenotype has been described in the airways of asthmatics, in which diminished mucosal fluidity, the formation of mucosal plugs, and impaired mucociliary clearance were observed (47, 48). Murine models have also shown that MUC5B loss is associated with impaired mucociliary clearance, airflow obstruction, and respiratory infection (49).

Smoking-induced GCH was associated with the presence of a previously uncharacterized subpopulation of CEACAM5+ KRT8+ MUC5AC PG epithelial cells. The origins of PG cells are unclear, but a KRT8+ undifferentiated epithelial subpopulation derived from basal cells, referred to as suprabasal, has been described in murine models (46, 50). Suprabasal cells act as intermediate precursors to ciliated and secretory cells during basal cell differentiation under normal conditions (46) and, after injury, as a repair mechanism (50). However, the suprabasal phenomenon has not been characterized in the human bronchus, and little is known regarding human intermediate epithelial subpopulations. Furthermore, the involvement of a KRT8+ intermediate state in club cell transdifferentiation (4, 34) has not been explored. Goblet cell differentiation required for the onset and maintenance of smoking-associated GCH might involve a pro-goblet precursor subpopulation, but the explicit role of PG cells in this context requires further investigation.

It has been reported that CEACAM5 expression is persistently up-regulated in the airways of former smokers, whereas genes specifically expressed by goblet cells, such as MUC5AC, SPDEF, and AGR2, return to normal, never smoker levels post-smoking cessation (18). These findings suggest that goblet cell expansion in the airways of smokers is reversible, whereas the emergence of CEACAM5+ PG cells might have long-term implications. The functional consequences of the presence of PG cells are unclear, but irreversible alterations to bronchial epithelial composition might underlie chronic disease states. Although PG cells were identified in this study in the absence of established disease phenotypes, CEACAM5+ KRT5+ cells have been detected in bronchial premalignant lesions and lung SCC (37). CEACAM5 has also been detected in numerous additional cancer types (51, 52), and several genes that are coexpressed with CEACAM5 (i.e., detected in GS-15) have been implicated in carcinogenesis, such as UPK1B (53), MSLN (54, 55), and PSCA (56, 57). Therefore, investigation of mechanisms linking the presence and variable abundance of GCH-associated CEACAM5+ PG cells and premalignant lesion-associated CEACAM5+ KRT5+ cells might provide insight into smoking-induced conditions that promote lung carcinogenesis.

These data demonstrate that human bronchial epithelial exposure to tobacco smoke drives ciliated cellspecific toxin metabolism and leads to both club cell depletion and goblet cell expansion. A novel subpopulation of PG cells was also detected in the bronchial airways of smokers in association with GCH. These results will enable us to more precisely define the landscape of smoking-induced epithelial abnormalities. Future work will use experimental systems to define the consequences of specific, smoke-derived chemical compounds and investigate the recapitulation and reversal of cell and molecular phenotypes observed in this study. Furthermore, these findings may be leveraged to improve diagnostics and develop preventative strategies for smoking-associated lung diseases.

At Boston University Medical Center, healthy volunteer never smokers (n = 6) and current smokers (n = 6) underwent a bronchoscopy to obtain brushings from the right mainstem bronchus, as described previously (17, 18). Eligible volunteers included subjects who (i) were between the ages of 19 and 55; (ii) did not use inhaled or intranasal medications; (iii) did not have a history of chronic obstructive pulmonary disease, asthma, pulmonary fibrosis, sarcoid, or head and neck/lung cancer; (iv) did not use marijuana; (v) did not have a respiratory infection within the past 6 weeks; and (vi) did not use other tobacco products (i.e., pipe, cigar, and chewing). Spirometry was performed to assess lung function (e.g., FEV1/FVC). Exhaled carbon monoxide (Smokerlyzer Carbon Monoxide Monitor, model EC-50; Bedfont Scientific Ltd.) and urine cotinine (NicAlert; Confirm BioSciences) levels were measured to confirm smoking status. The Institutional Review Board approved the study, and all subjects provided written informed consent.

Bronchial brushings were treated with 0.25% trypsin/EDTA for 20 min and stained for sorting using a BD FACSAria II. Gating based on forward scatter height (FSC-H) versus forward scatter area (FSC-A) was applied to sort only singlet events (fig. S1A). Dead cells (LIVE/DEAD Fixable Aqua Dead Cell Stain, Thermo Fisher; L34957) and red blood cells expressing GYPA/B (fig. S1B) on their surface [allophycocyanin (APC) anti-CD235ab; BioLegend, 306607] were stained and excluded. ALCAM+ epithelial cells [phycoerythrin (PE) anti-CD166; BioLegend, 343903] and CD45+ WBCs (APC-Cy7 anti-CD45; BD, 561863) were stained (fig. S1C) and sorted into 96-well polymerase chain reaction (PCR) plates containing lysis buffer [0.2% Triton X-100, 2.5% RNaseOUT (Thermo Fisher; 10777019)] compatible with downstream RNA library preparation. In each 96-well PCR plate for each subject, we sorted 84 ALCAM+ cells and 11 CD45+ cells and maintained one empty well as a negative control. The plates were frozen on dry ice and stored at 80C until preparation for sequencing.

Massively parallel scRNA-Seq of human bronchial airway cells was performed using a modified version of the CEL-Seq RNA library preparation protocol (20). For each of the 12 recruited donors, one frozen 96-well PCR plate containing sorted cells was thawed on ice, and RNA was directly reverse-transcribed (Thermo Fisher, AM1751) from whole-cell lysate using primers composed of an anchored poly(dT), the 5 Illumina adaptor sequence, a six-nucleotide well-specific barcode, a five-nucleotide unique molecular identifier (UMI), and a T7 RNA polymerase promoter. All primer sequences were listed in extended table S1. Samples were additionally supplemented with ERCC RNA Spike-In Mix (1:1,000,000 dilution; Thermo Fisher, 4456740) for quality control. Complementary DNA generated from each of the 96 wells per plate was pooled, subjected to second-strand synthesis (Thermo Fisher, AM1751), and amplified by in vitro transcription (Thermo Fisher, AM1751). Amplified RNA was chemically fragmented (New England BioLabs, E6150) and ligated to an Illumina RNA 3 adapter (Illumina, RS-200-0012). Samples were again reverse-transcribed using a 3 adaptor-specific primer and amplified using indexed Illumina RNA PCR primers (Illumina, RS-200-0012). In total, 1152 samples (1008 epithelial cells, 132 WBCs, and 12 negative controls) were sequenced on an Illumina HiSeq 2500 in rapid mode, generating paired-end reads (15 nucleotides for read 1, 7 nucleotides for index, and 52 nucleotides for read 2).

Illuminas bcl2fastq2 software (v2.19.1) was used to demultiplex the sequencing output to 12 plate-level FASTQ files (1 per 96-well plate). A python-based pipeline (https://github.com/yanailab/CEL-Seq-pipeline) was used to (i) demultiplex each plate-level FASTQ file to 96 cell-level FASTQ files, trim 52 nucleotide reads to 35 nucleotides, and append UMI information from read 1 (R1) to the header of read 2 (R2); (ii) perform genomic alignment of R2 with Bowtie2 (v2.2.2) using a concatenated hg19/External RNA Controls Consortium (ERCC) reference assembly; and (iii) convert aligned reads to gene-level counts using a modified version of the HTSeq (v0.5.4p1) python library that identifies reads aligning to the same location with identical UMIs and reduces them to a single count. One UMI-corrected count was then referred to as a transcript. The pipeline was configured with the following settings: alignment quality (min_bc_quality) = 10, barcode length (bc_length) = 6, UMI length (umi_length) = 5, cut_length = 35.

The quality of each cell was assessed by examining the total number of reads, total reads aligned to hg19, total reads aligning to genes (pre-UMI correction), total transcript counts, and total genes with at least one detected transcript. Cells were excluded from downstream analyses if the total number of transcripts was not twofold greater than the total background-level transcripts detected in the empty well negative control on each plate (fig. S3). Cells were also excluded from downstream analyses if there was a weak Pearson correlation (r < 0.7) between detected ERCC RNA Spike-In transcript counts (log10) and ERCC input concentration (log10) (amol/ml) (fig. S3). All nonprotein-coding genes and genes with less than two transcript counts in five cells were removed from the dataset. The remaining 7680 genes measured across 796 cells were used for subsequent analyses.

LDA from the topicmodels R package (v0.2-6) was used to generate probabilistic representations of cell clusters and gene sets present in the dataset, referred to as Cell-States and Gene-States. The input for the Cell-State model required a counts data matrix where cells were columns and genes were rows, whereas for the Gene-State model, the same matrix was transposed (i.e., genes were columns and cells were rows). Models were fit using the variational expectationmaximization (VEM) algorithm with the following parameters: nstart = 5, seed = 12345, estimate.alpha = TRUE, estimate.beta = TRUE. The given parameter k determined the number of Cell-States and Gene-States to be estimated by the model. The optimal value of k was determined by fivefold cross-validation and evaluation of model perplexity. For the Gene-State model, cells were randomly partitioned into training (80%) and test (20%) sets, whereas for the Cell-State model, genes were randomly partitioned into training (80%) and test (20%) sets. Models were then fit to the training set, and perplexity was estimated to evaluate model fit for the held-out test set. Fifty iterations of this process were performed for k = 2 to 50, mean perplexity was calculated at each k, and the minimum mean perplexity was selected as the optimal value of k (i.e., k.opt), which was k = 13 for the Cell-State model and k = 19 for the Gene-State model (fig. S6).

Negative binomial generalized linear models were built using the MASS R package (v7.3-45) for each Gene-State (n = 19) and each Cell-State (n = 13), in which States were treated as inferred, independent variables and genes or cells, respectively, were treated as dependent variables. A cell was assigned to a Cell-State if a significant association (FDR q < 0.05) was observed with positive directionality (regression coefficient > 1). Similarly, a gene was assigned to a Gene-State if a significant, positive association was observed (FDR q < 1 105, regression coefficient > 1). If multiple State associations were observed for a given gene or cell, assignment was determined on the basis of the strongest State association (i.e., minimum FDR q). Additional metrics for gene set and cluster assignment include State Specificity and State Similarity. LDA (see the previous section) also assigned a probability to each gene (or cell) for each Gene-State (or Cell-State), and State Specificity was calculated by dividing that probability by the sum of probabilities across all Gene-States (or Cell-States). A minimum State Specificity of 0.1 was required for gene or cell assignment. State Similarity was calculated by assessing the cosine (q) similarity between each Gene-State and relative expression of each gene (gene counts divided by total counts for each cell). A minimum State Similarity of 0.4 was required for gene assignment. All downstream analyses used the 785 cells that fit the criteria for Cell-State assignment and 676 genes that fit the criteria for Gene-State assignment. Statistical modeling results, State Specificity, and State Similarity values for all genes, regardless of assignment status, were included in extended table S2.

Before heatmap visualization, transcript counts were transformed to z-normalized transcripts per million (TPM). Genes (top to bottom) and cells (left to right) were ordered according to the strength of statistical association (FDR q) with respective assigned Gene-States and Cell-States. The tsne R package v0.1-3 was used for dimensionality reduction by t-distributed stochastic neighbor embedding (t-SNE). Modified parameters include k = 2 and seed = 1234. Input for t-SNE was z-normalized TPM values across genes with at least three transcript counts in three cells (n = 4914 genes). Gene expression overlay onto t-SNE visualization was also performed using z-normalized TPM values.

The enrichR R package (v0.0.0.9000) was used as an interface for the web-based functional annotation tool, Enrichr, to identify Gene Ontology (GO) terms from the GO Biological Process 2015 library significantly associated with each gene set (58, 59). Functional annotation results were listed in extended table S3.

Raw CEL files obtained from the Gene Expression Omnibus (GEO) for series GSE7895 were normalized to produce gene-level expression values using the implementation of the Robust Multiarray Average (RMA) in the affy R package (v1.36.1) and an Entrez Gene-specific probeset mapping (17.0.0) from the Molecular and Behavioral Neuroscience Institute (Brainarray) at the University of Michigan (http://brainarray.mbni.med.umich.edu/).

Bronchial brushings were reconstructed in silico from the single-cell data by taking the sum of all transcript counts for each gene across all cells procured from each donor. Negative binomial generalized linear models were built using the MASS R package (v7.3-45), modeling transcript counts as a function of smoking status (FDR q < 0.05: n = 593 genes). In parallel, using never and current smoker bulk bronchial brushing microarray data (GEO series GSE7895), linear models were built using the stats R package (R v3.2.0), modeling gene-level expression values as a function of smoking status (FDR q < 0.05: n = 689 genes). The correlation between test statistics generated from both models was then measured to compare differential expression results (fig. S4A). Using the overlap among smoking-associated genes identified in both models (n = 155 genes), correlations (Spearman) among in silico bronchial brushings and bulk bronchial brushings were examined (fig. S4B).

Using published microarray data generated from bulk bronchial brushings procured from never and current smokers (GEO series GSE7895), RMA-transformed values for each gene were z-normalized. MetaGene values were then generated by computing the mean z score across all genes in each gene set (GS-1 to GS-19) for each sample. Linear models were built using the stats R package (R v3.2.0), modeling MetaGene expression as a function of donor smoking status and age. For metagenes that were associated with smoking status (FDR q < 0.05), but not age, if the mean current smoker value was greater than or less than the mean never smoker value, the gene set was considered to be up- or down-regulated in current smokers, respectively.

TPM values for cell type marker genes (KRT5, FOXJ1, SCGB1A1, MUC5AC, and CD45) were z-normalized across all cells. Cluster-specific mean expression was designated high (pink) if expression exceeded 1 SD above the mean value across all cells, medium (white) if expression exceeded one-half of an SD above the mean value across all cells, and low (light gray) if expression exceeded the mean value across all cells. If cluster-specific mean expression was designated high, medium, or low for KRT5, FOXJ1, SCGB1A1, MUC5AC, or CD45 (PTPRC), that cluster was assigned the cell type of basal, ciliated, club, goblet, or WBC, respectively. Cluster-specific mean expression below the mean value across all cells indicated that a given cluster did not express a given marker gene (dark gray).

To assess smoking statusspecific cell enrichment for each cluster, logistic regression was performed using the stats R package (R v3.2.0), modeling each cluster assignment as a function of donor smoking status and the number of cells contributed by each donor. For clusters that were associated with smoking status (FDR q < 0.05), but not the number of cells contributed by each donor, the directionality of the regression coefficient was leveraged to assign never or current smoker status.

Transcript counts were transformed to z-normalized TPM. MetaGene values were then generated by computing the mean z score across all genes in each gene set (GS-1 to GS-19) for each cell. Cluster-specific MetaGene expression was designated high (pink) if mean expression exceeded 1 SD above the mean value across all cells, medium (white) if mean expression exceeded one-half of an SD above the mean value across all cells, and low (light gray) if mean expression exceeded the mean value across all cells. Cluster-specific mean expression below the mean value across all cells indicated that a given cluster did not express a given gene set (dark gray).

Bronchial tissue was collected from patients undergoing lung resection. All specimens were procured at least 5 cm from bronchial sites affected by disease diagnoses, and analyses indicated that tissue was histologically normal. The UMCG cohort (table S2) included specimens analyzed in collaboration with the UMCG collected from four never smokers and four current smokers. Specimens were obtained from the tissue bank in the UMCG Department of Pathology. The study protocol was consistent with the Research Code of the UMCG and Dutch national ethical and professional guidelines (Code of conduct; Dutch federation of biomedical scientific societies; http://www.federa.org). The UCL cohort (table S3) included specimens analyzed in collaboration with the UCL collected from five never smokers and five current smokers. Ethical approval was sought and obtained from the UCL Hospital Research Ethics Committee (REC reference 06/Q0505/12). This study was carried out in accordance with the Declaration of Helsinki (2000) of the World Medical Association.

Formalin-fixed paraffin-embedded lung sections were cut at 4 mm, tissue was probed with primary antibodies (listed below) and secondary antibodies with fluorescent conjugates (Invitrogen Alexa Fluor 488, 594, 647), and nuclear staining was performed with 4,6-diamidino-2-phenylindole (DAPI) (Thermo Fisher, R37606). Immunostaining was performed using the following primary antibodies: mouse antiAc--Tub (Sigma, T6793), rabbit antiAc--Tub (Enzo Life Sciences, BML SA4592), rabbit anti-AKR1B10 (Sigma, HPA020280), rabbit anti-CEACAM5 (Abcam, ab131070), chicken anti-KRT5 (BioLegend, 905-901), rat anti-KRT8 (Developmental Studies Hybridoma Bank, University of Iowa; TROMA-I), mouse anti-MUC5AC (Abcam, ab3649), and rabbit anti-MUC5B (Sigma, HPA008246). Imaging of staining panels analyzed in collaboration with investigators at the UMCG (table S2) (e.g., AKR1B10/Ac--Tub/KRT8: Fig. 3C; AKR1B10/MUC5AC/KRT8: fig. S13; MUC5B/MUC5AC: Fig. 4B; MUC5B/MUC5AC/KRT5: fig. S14; CEACAM5/KRT8/MUC5AC: fig. S18) was performed using a Carl Zeiss LSM 710 NLO confocal microscope at 63 objective magnification at the Boston University School of Medicine Multiphoton Microscope Core Facility. Imaging of staining panels analyzed in collaboration with investigators at the UCL (table S3) (e.g., CEACAM5/KRT8/MUC5AC: Fig. 5D; KRT8/MUC5AC/Ac--Tub: fig. S15) was performed using a Leica TCS Tandem confocal microscope at 63 objective magnification.

All imaging data were analyzed using ImageJ Fiji software. For each image, cells were counted relative to the measured length of the epithelium in micrometers (cells per micrometer). Mean cell counts per micrometer (cells per millimeter) were then calculated for never smokers (treated as the control), and individual values for each image from never and current smokers were calculated relative to the never smoker mean (i.e., relative cells per millimeter). We analyzed three images for each donor and assessed smoking-associated changes using the Wilcoxon rank-sum test. For panels in which MUC5AC was stained, current smoker tissue was assigned the phenotypic status of either MN or GCH based on qualitative assessment of goblet cell density and stratification. For each current smoker, three images of each status were analyzed.

Supplementary material for this article is available at http://advances.sciencemag.org/cgi/content/full/5/12/eaaw3413/DC1

Table S1. Bronchial brushings were procured from six never smokers and six current smokers.

Table S2. Bronchial tissue was obtained by lung resection from four never smokers and four current smokers at the UMCG.

Table S3. Bronchial tissue was obtained by lung resection from five never smokers and five current smokers at the UCL Hospital.

Fig. S1. Single bronchial cells were isolated by FACS.

Fig. S2. scRNA-Seq data quality were evaluated for each donor.

Fig. S3. Low-quality cells were excluded from downstream analyses.

Fig. S4. Bronchial brushings reconstructed in silico from single-cell data resemble data generated from bulk bronchial brushings.

Fig. S5. LDA was used to identify Cell-States and Gene-States.

Fig. S6. Gene-State and Cell-State model optimization.

Fig. S7. LDA was used to identify 13 cell clusters.

Fig. S8. LDA was used to identify 19 gene sets.

Fig. S9. Gene set expression across cell clusters.

Fig. S10. T cell receptor genes were detected in CD45+ cell cluster.

Fig. S11. Cluster 13 cells expressed CFTR.

Fig. S12. Distributions of cell clusters within each subject.

Fig. S13. Smoking-associated differential expression of each gene set was analyzed in published bulk bronchial brushing data.

Fig. S14. Nonciliated cell AKR1B10 expression was uncommon.

Fig. S15. MN and GCH tissue regions were distributed throughout the bronchial airways of current smokers.

Fig. S16. Basal cell numbers were not altered in smokers.

Fig. S17. Increased numbers of indeterminate KRT8+ cells were observed in GCH smoker tissue.

Fig. S18. PG cells were enriched in regions of GCH within the airways of smokers.

Fig. S19. Smoking-induced heterogeneity was observed in the human bronchial epithelium.

Extended table S1. Primer sequences for scRNA-Seq.

Extended table S2. Statistical modeling results, State Specificity, and State Similarity values for all genes.

Extended table S3. Functional annotation results for each gene set.

This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license, which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.

Continue reading here:
Characterizing smoking-induced transcriptional heterogeneity in the human bronchial epithelium at single-cell resolution - Science Advances

Evolutionarily novel genes found to be expressed in tumors – News-Medical.net

A team of scientists from Peter the Great St.Petersburg Polytechnic University (SPbPU) studied the evolutionary ages of human genes and identified a new class of them expressed in tumors -- tumor specifically expressed, evolutionarily novel (TSEEN) genes. This confirms the team's earlier theory about the evolutionary role of neoplasms.

A report about the study was published in Scientific Reports.

A tumor is a pathological new growth of tissues. Due to genetic changes, it has impaired cellular regulation and therefore defective functionality. Tumors can be benign or malignant. Unlike the latter, the former grow slowly, don't metastasize, and are easy to remove. Malignant tumors (cancer) are one of the primary mortality factors in the world.

A team of scientists from Saint Petersburg discovered a new class of evolutionarily novel genes present in all tumors -- the so-called TSEEN (Tumor Specifically Expressed Evolutionarily Novel) genes.

The evolutionary role of these genes is to provide genetic material for the origin of new progressive characteristics. TSEEN genes are expressed in many neoplasms and therefore can be excellent tumor markers."

Prof. Andrei Kozlov, a PhD in Biology, the head of Laboratory "Molecular Virology and Oncology" at Peter the Great St. Petersburg Polytechnic University

The new research confirms a theory that has been proposed by the A. Kozlov earlier. According to it, the number of oncogenes in a human body should correspond to the number of differential cell types. The theory also suggested that the evolution of oncogenes, tumor suppressor genes, and the genes that determine cell differentiation goes on concurrently. The theory is based on the hypothesis of evolution through tumor neofunctionalization, according to which hereditary neoplasms might have played an important role during the early stages of metazoan evolution by providing additional cell masses for the origin of new cell types, tissues, and organs. Evolutionarily novel genes that originate in the DNA of germ cells are expressed in these extra cells.

Prof. Kozlov also made a reference to the article 'Evolutionarily Novel Genes Are Involved in Development of Progressive Traits in Humans' (2019) that has recently been published by his laboratory. In this article the team confirmed their hypothesis using transgenic fish tumors and fish evolutionarily novel genes. The orthologs of such genes are found in the human genome, but in humans they play a role in the development of progressive characteristics not encountered in fish (e.g. lungs, breasts, placenta, ventricular septum in the heart, etc). This confirms the hypothesis about the evolutionary role of tumors. The studies referred to in the article lasted for several years, and their participants used a wide range of methods from the fields of bioinformatics and molecular biology.

"Our work is of great social importance, as the cancer problem hasn't been solved yet. Our theory suggests new prevention and therapy strategies," said Prof. Kozlov. According to him, to fight cancer, a new paradigm should be developed in oncology. TSEEN genes may be used to create new cancer test systems and antitumor vaccines.

Source:

Journal reference:

Makashov, A.A., et al. (2019) Oncogenes, tumor suppressor and differentiation genes represent the oldest human gene classes and evolve concurrently. Scientific Reports. doi.org/10.1038/s41598-019-52835-w.

Read the original post:
Evolutionarily novel genes found to be expressed in tumors - News-Medical.net

Study examines the effects of fat tissue on skeletal muscle structure and function – News-Medical.net

From arthritis and heart failure to diabetes and menopause, many conditions are associated with muscle weakness and increased fat deposits.

Now a multidisciplinary team of researchers at the University of Massachusetts Amherst is applying a unique approach to examine the effects of fat tissue on skeletal muscle structure and function in young and older men and women.

Armed with a two-year, $374,188 grant from the National Institute on Aging, lead investigator Jane Kent, professor and chair of kinesiology in the School of Public Health and Health Sciences, and colleagues will combine state-of-the-art, noninvasive magnetic resonance imaging and spectroscopy techniques with whole-body, single-cell and molecular measures of muscle function.

As muscle typically contributes 30-40% of total body mass, this metabolically active tissue plays a direct role in maintaining good health. Currently, we do not know the mechanical consequences of fat infiltration on muscle. Our hypothesis is that fat physically limits muscle strength by interfering with the way the muscle was designed to work."

Jane Kent, lead investigator

The collaborative research is being performed in the Human Magnetic Resonance and Human Health and Performance centers at the Institute for Applied Life Sciences (IALS), where scientists strive to translate fundamental research into innovations that benefit humankind. Advanced data analysis will be carried out in the Muscle Physiology and Muscle Biology laboratories in the Totman building.

Kent is working with kinesiology assistant professor Mark Miller, endocrinologist and research professor of kinesiology Dr. Stuart Chipkin, math and statistics professor emeritus John Buonaccorsi and professor Bruce Damon from the Vanderbilt University Institute of Imaging Science. Graduate students Joseph Gordon III and Christopher Hayden, along with project coordinator Nicholas Remillard, round out the research team.

Kent says the innovative research may yield new knowledge about the effects of fat on muscle activity, information that has potential health benefits.

"Understanding the impact adipose tissue has on skeletal muscle has the potential to markedly alter our approach to mitigating and reversing muscle dysfunction in aging and the large number of conditions associated with increased fat content in muscle," Kent says.

The research team is recruiting volunteers to round out the study group of overweight and obese young adults, age 25-45, and healthy older adults, age 65-75. Participants would be required to visit the campus up to three times and would receive financial compensation for their time.

Read the original:
Study examines the effects of fat tissue on skeletal muscle structure and function - News-Medical.net