Contents:
Log in to get trip updates and message other travelers. Olimpia Spa 1 Review. Sorry, there are no tours or activities available to book online for the date s you selected. Please choose a different date. Quick View. More Info. Is this a place or activity you would suggest for families with kids? Yes No Unsure.
Is this a place or activity you would go to on a rainy day? Is this a romantic place or activity that you would suggest for couples?
Thanks for helping! Share another experience before you go. Full view. Cel Street.
Fradique Coutinho Sao Paulo Metro 38 min. Faria Lima Sao Paulo Metro 40 min. Best nearby. In addition, literature searches on cancer are hampered by the fact that cancer is a complex disease. Cancer and cancer subtypes more closely resemble a set of diseases, each disease with different features and unknowns. Head and neck HN cancer is the sixth most common type of cancer worldwide, with about new cases in 9 and a remarkable example of heterogeneous malignancy. Similar to what is observed in many types of neoplasms, the challenge in searching the literature on HN cancer is particularly difficult due to its diversity, which involves diversity in histological type, anatomical location and primary risk factors.
For instance, the anatomical sites affected by the disease and the primary risk factors can be used to divide head and neck squamous cell carcinomas HNSCC into at least three classes. Two of these classes involve human papillomavirus HPV -positive disease mostly oropharyngeal with a favorable prognosis and HPV-negative disease with less favorable prognosis and a different molecular profile HPV-positive tumors are primarily wild-type TP53 , whereas HPV-negative tumors present mutated TP53 and show high chromosome instability 10 , 11 , which may sustain advantageous metabolic pathways, aid in escaping the inhibitory effects of suppressor signals 12 or promote oncogenic effects However, survival is frequently low.
This is likely because tumors in early stages frequently present few symptoms leading to a delay in diagnosis.
Furthermore, therapy effectiveness is highly variable, even in early lesions or histologically similar cases. According to this model, the clinical progression to dysplasia, in situ carcinoma, and, finally, invasive carcinoma is supported by the increased accumulation of molecular alterations Such alterations promote the neoplastic phenotype defined by Hanahan and Weinberg 17 , including increased cell proliferation, insensitivity to growth suppression factors, apoptosis resistance, sustained angiogenesis, energy metabolism alterations, immune attack avoidance and the acquisition of invasion and metastasis capability.
Investigations into HNSCC emphasize the importance of identifying the mechanisms and the molecular changes triggered during the malignant transformation that culminates in the neoplastic phenotype. New data on potential markers may shed light on tumor biology and, consequently, lead to the development of novel drugs. Literature mining is a fundamental starting point for this discovery process, but the recent exponential growth in biological data is well beyond the limit of a complete manual search in most cases.
In turn, automated literature mining can help to find disease-related biomarkers and their interrelationships, and extract hidden information with tools able to efficiently target valuable research questions and generate testable hypothesis. During this process, the articles of interest are retrieved, the biological entities are identified in texts, and specific information, particularly relationships between biological entities, is extracted. One of the challenges in automated approaches is the exact identification of genes, proteins or diseases since they may be referred to by different names, share names and symbols, or even be described by nonstandard nomenclature in literature and databases Another challenge is to identify consistent descriptions of gene products and their associated features, and supporting evidence for inferring such associations.
To overcome these limitations, text-mining applications have incorporated tools to recognize specific keywords and to capture relevant sentences and ontologies. For example, relationships may be extracted investigating entities that co-occur in the same report, title, abstract or even a sentence, or by the so-called natural language processing NLP methods. NLP methods are based on the structure of sentences and on how the biological data is mentioned However, this approach has advantages and limitations, since it may give rise to erroneous relationships depending on used parameters The controlled vocabularies of the Gene Ontology GO 21 project enable coupling of gene products to their associated biological processes, cellular components and molecular functions However, the automatic identification of GO-literature association is less accurate than manual curation methods, such as the one using Medical Subject Headings MeSH 23 for indexing PubMed articles, a process performed by trained experts that potentially generates few false positive assignments.
In addition, MeSH-literature associations may be linked to genes or diseases, facilitating the identification of previously unrevealed relationships between entities, such as protein—protein, drug—effect and protein—disease 24 , In this work, we developed an in-house methodology to conduct literature mining aiming to identify genes and gene products related with various aspects of HNSCC.
A database HNdb was established for unifying the information on these genes and proteins, covering data on genomics, transcriptomics, proteomics, literature citations, and also cross-references of external databases. The information was wrapped up in a friendly web interface, which provides easy and rapid access to the HNSCC-related genes and to a vast number of biological data resources. The interfaces aims to facilitate the selection of candidates for validation assays and the identification of potential new markers, as exemplified in this study. The workflow of our literature mining consisted of two initial automated stages and a separate manual step.
Three literature searches based on different MeSH terms were run on 29 June In stage II, the articles selected in stage I were associated with genes using the gene2pubmed association file 26 , which contains the gene identifiers gene IDs and the respective PubMed article identifiers PMIDs. For this association, only human genes were accepted.
The details on the MeSH terms and on the literature search strategy are presented in Supplementary File 1 and an overview of the workflow is provided in Figure 1. Genes were then ranked according to their level of association with HNSCC—from the most relevant to the less relevant defined by the number of publications addressing the gene in HNSCC—by a hypergeometric test 29 performed using the Stirling's approximation to high-factorial values The method calculates the probability of k or query-relevant publications for a gene A by chance, being S the score for gene A, m the publications in the gene2pubmed association file, n the number of publications retrieved for the query and present in the gene2pubmed association file, j the number of publications that involve gene A, and k the number of query-relevant publications that involves the gene A.
The data collection workflow will be routinely updated twice per year to incorporate new PMIDs and genes. To integrate potential biomarkers involved in HNSCC with data from the available literature, we constructed a MySQL relational database system implemented in an Apache server using the Linux operating system.
The platform provides users with the ability to search for and download information on the genes and proteins involved in HN cancer. The home page presents the database objectives and provide tools for searching genes related to HNSCC, their expression pattern and chromosome location. External data were included in the database to facilitate access to the maximum amount of information on a particular gene or protein.
For example, the genes selected by users are linked to PMIDs, metabolic pathways 31 , 32 , 33 , 34 , associated ontologies 21 , somatic mutations in HN cancer 35 , genetic disorders 36 and microarray data. Data on proteins, including interactions and drugs that target them 4 , 45—53 are also available. The curated set of genes related to HNSCC was imported into DAVID 54 , 55 , a database for annotation, visualization and integrated discovery 54 , and the genes were annotated for GO and pathways using the whole human genome as background.
By typing the gene symbol, aliases, gene or protein name, accession number or ID into the search box, users can obtain information on whether a gene has already been related to HN cancer. Users can also browse chromosome regions associated with HN cancer. The data returned by the queries can be downloaded as a spreadsheet or a text file. The results of a particular gene are displayed in a new page that provides the official gene name, gene IDs, aliases, chromosome location and gene expression pattern generated via microarray studies on tumor tissues as well as articles that support its involvement in HNSCC or report prognostic markers.
As indicated above, the results also include gene ontologies, metabolic pathways and links to external databases on expression patterns in normal tissues, somatic mutations in cancer and gene-phenotype or disease associations. The protein page provides 3D structures and posttranslational modifications, metabolite and protein—protein interactions, expression patterns and drugs for targets of interest. Following a manual curation, genes not related to HNSCC were excluded and a list of genes was obtained.
Although these values are satisfactory, they still need to be improved since not all the genes retrieved by the approach were considered relevant after manual curation.
In addition, several relevant genes were missed, which indicates that the literature search in future versions of HNdb have to be expanded to include articles identified through digital libraries besides PubMed e. Google Scholar, Web of Science and Scopus 59—61 , and approaches for information extraction should be added, such as NLP based methods.
These results highlight the need of extensive basic and clinical research focused on unique characteristics of this group of carcinomas. These scores for TP53 and EGFR were confirmed by the hypergeometric test Supplementary Table 1 , and indicate that they represent the most extensively studied ones and certainly exhibit relevant results.
According to the Sao Paulo State Secretariat of Public Security, on- and off-duty military and civil police officers were responsible for deaths in the state in the first half of the year, which was the highest number in the last 14 years. The information was wrapped up in a friendly web interface, which provides easy and rapid access to the HNSCC-related genes and to a vast number of biological data resources. Protesters organized to challenge military parades that were celebrating Brazil's independence from Portugal. Federal prosecutors opened an investigation, the second such investigation into a reported killing of uncontacted indigenous persons during the year. Public authorities such as politicians and law enforcement officials perpetrated 77 percent of the violations, and police did not open investigations in 39 percent of the cases.
Regarding the genes mentioned by only one article, many of them probably have not yet been completely exploited as potential markers and deserve further investigations. However, several others were mapped to less frequently cited regions. Considering the same anatomical sites analyzed in the present work, HNOCDB extracted genes in oral, 14 in tongue, 7 in hypopharyngeal, 3 in oropharyngeal and 60 in laryngeal cancers through text-mining. A total of nonredundant genes was identified by these databases.
In contrast with these databases, the present study performed three searches using MeSH terms and was more stringent by excluding articles that also analyzed non-HNSCC tumors. Therefore, our gene list the largest of the three databases is more specific and, therefore, more focused on the tumors of interest.
Furthermore, HNdb is the only database that uses specific MeSH terms to link genes to literature data on prognosis and outcome Supplementary Table 2 , also available on the gene results page , facilitating the identification of markers that are relevant to tumor biology and therapy response. A total of DAVID identifiers were mapped from the list of genes and similar annotation terms were clustered into groups, removing redundancy.
The results showed an overrepresentation of clusters related to tissue development and differentiation, response to stimulus, signal transduction, cell proliferation, cell migration, apoptosis, transcription and cell adhesion, which are biological processes relevant to cancer. Furthermore, the diversity of results compiled in our dataset allowed identifying novel and mostly unexplored gene associations. Few studies have explored the metabolic pathways involved in the response to steroid stimulus in HNSCC.
The authors also reported evidence that estrogen receptor and epidermal growth factor receptor cross talk is present in HNSCC.

Thus, we demonstrate that a database integrating multiple types of data greatly expands the possibilities for gene networks investigation, providing potential associations to be tested. The HNdb is an effort toward this goal and is intended to be an integrated database with rapid and easy-to-use tools that facilitate literature and biological data mining to thereby promote research and generate new insight into the development of useful markers for HN cancer.
Supplementary data are available at Database Online.