10640010@unknown@formal@none@1@S@
OpenOffice.org
@@@@1@1@@danf@17-8-2009 10640020@unknown@formal@none@1@S@'''OpenOffice.org''' ('''OO.o''' or '''OOo''') is a [[cross-platform]] [[office suite|office application suite]] available for a number of different computer [[operating system]]s.@@@@1@20@@danf@17-8-2009 10640030@unknown@formal@none@1@S@It supports the ISO standard '''[[OpenDocument]] Format (ODF)''' for data interchange as its default [[file format]], as well as [[Microsoft Office]] '97–2003 formats, [[Microsoft Office]] '2007 format (in version 3), among many others.@@@@1@33@@danf@17-8-2009 10640040@unknown@formal@none@1@S@OpenOffice.org was originally derived from [[StarOffice]], an office suite developed by [[StarDivision]] and acquired by [[Sun Microsystems]] in August 1999.@@@@1@20@@danf@17-8-2009 10640050@unknown@formal@none@1@S@The [[source code]] of the suite was released in July 2000 with the aim of reducing the dominant [[market share]] of [[Microsoft Office]] by providing a free, open and high-quality alternative; later versions of StarOffice are based upon OpenOffice.org with additional proprietary components.@@@@1@43@@danf@17-8-2009 10640060@unknown@formal@none@1@S@OpenOffice.org is [[free software]], available under the [[GNU Lesser General Public License]] (LGPL).@@@@1@13@@danf@17-8-2009 10640070@unknown@formal@none@1@S@The project and software are informally referred to as ''OpenOffice'', but this term is a [[trademark]] held by another party, requiring the project to adopt ''OpenOffice.org'' as its formal name.@@@@1@30@@danf@17-8-2009 10640080@unknown@formal@none@1@S@== History==@@@@1@2@@danf@17-8-2009 10640090@unknown@formal@none@1@S@Originally developed as the [[proprietary software]] application suite StarOffice by the German company [[StarDivision]], the code was purchased in 1999 by Sun Microsystems.@@@@1@23@@danf@17-8-2009 10640100@unknown@formal@none@1@S@In August 1999 version 5.2 of StarOffice was made available free of charge.@@@@1@13@@danf@17-8-2009 10640110@unknown@formal@none@1@S@On [[July 19]], [[2000]], Sun Microsystems announced that it was making the source code of StarOffice available for download under both the LGPL and the [[Sun Industry Standards Source License]] (SISSL) with the intention of building an open source development community around the software.@@@@1@44@@danf@17-8-2009 10640120@unknown@formal@none@1@S@The new project was known as OpenOffice.org, and its website went live on [[October 13]], [[2000]].@@@@1@16@@danf@17-8-2009 10640130@unknown@formal@none@1@S@Work on version 2.0 began in early 2003 with the following goals: better interoperability with Microsoft Office; better performance, with improved speed and lower memory usage; greater [[Scripting language|scripting]] capabilities; better integration, particularly with [[GNOME]]; an easier-to-find and use database front-end for creating reports, forms and queries; a new built-in [[SQL]] database; and improved [[usability]].@@@@1@55@@danf@17-8-2009 10640140@unknown@formal@none@1@S@A [[beta version]] was released on [[March 4]], [[2005]].@@@@1@9@@danf@17-8-2009 10640150@unknown@formal@none@1@S@On [[September 2]], [[2005]] Sun announced that it was retiring the SISSL.@@@@1@12@@danf@17-8-2009 10640160@unknown@formal@none@1@S@As a consequence, the OpenOffice.org Community Council announced that it would no longer [[dual license]] the office suite, and future versions would use only the LGPL.@@@@1@26@@danf@17-8-2009 10640170@unknown@formal@none@1@S@On [[October 20]], [[2005]], OpenOffice.org 2.0 was formally released to the public.@@@@1@12@@danf@17-8-2009 10640180@unknown@formal@none@1@S@Eight weeks after the release of Version 2.0, an update, OpenOffice.org 2.0.1, was released.@@@@1@14@@danf@17-8-2009 10640190@unknown@formal@none@1@S@It fixed minor bugs and introduced new features.@@@@1@8@@danf@17-8-2009 10640200@unknown@formal@none@1@S@As of the 2.0.3 release, OpenOffice.org changed its release cycle from 18-months to releasing updates, feature enhancements and bug fixes every three months.@@@@1@23@@danf@17-8-2009 10640210@unknown@formal@none@1@S@Currently, new versions including new features are released every six months (so-called "feature releases") alternating with so-called "bug fix releases" which are being released between two feature releases (Every 3 months).@@@@1@31@@danf@17-8-2009 10640220@unknown@formal@none@1@S@=== StarOffice ===@@@@1@3@@danf@17-8-2009 10640230@unknown@formal@none@1@S@Sun subsidizes the development of OpenOffice.org in order to use it as a base for its commercial [[proprietary software|proprietary]] StarOffice application software.@@@@1@22@@danf@17-8-2009 10640240@unknown@formal@none@1@S@Releases of StarOffice since version 6.0 have been based on the OpenOffice.org source code, with some additional proprietary components, including:@@@@1@20@@danf@17-8-2009 10640250@unknown@formal@none@1@S@* Additional bundled fonts (especially [[CJK|East Asian language]] fonts).@@@@1@9@@danf@17-8-2009 10640260@unknown@formal@none@1@S@* [[Adabas D]] database.@@@@1@4@@danf@17-8-2009 10640270@unknown@formal@none@1@S@* Additional document [[Template (word processing)|templates]].@@@@1@6@@danf@17-8-2009 10640280@unknown@formal@none@1@S@* [[Clip art]].@@@@1@3@@danf@17-8-2009 10640290@unknown@formal@none@1@S@* Sorting functionality for Asian versions.@@@@1@6@@danf@17-8-2009 10640300@unknown@formal@none@1@S@* Additional file filters.@@@@1@4@@danf@17-8-2009 10640310@unknown@formal@none@1@S@* Migration assessment tool (Enterprise Edition).@@@@1@6@@danf@17-8-2009 10640320@unknown@formal@none@1@S@* Macro migration tool (Enterprise Edition).@@@@1@6@@danf@17-8-2009 10640330@unknown@formal@none@1@S@* Configuration management tool (Enterprise Edition).@@@@1@6@@danf@17-8-2009 10640340@unknown@formal@none@1@S@OpenOffice.org, therefore, inherited many features from the original StarOffice upon which it was based including the [[OpenOffice.org XML]] file format which it retained until version 2, when it was replaced by the ISO standard [[OpenDocument]] Format (ODF).@@@@1@37@@danf@17-8-2009 10640350@unknown@formal@none@1@S@== Features ==@@@@1@3@@danf@17-8-2009 10640360@unknown@formal@none@1@S@According to its [[mission statement]], the OpenOffice.org project aims "''To create, as a community, the leading international office suite that will run on all major platforms and provide access to all functionality and data through open-component based APIs and an XML-based file format.''"@@@@1@43@@danf@17-8-2009 10640370@unknown@formal@none@1@S@OpenOffice.org aims to compete with Microsoft Office and emulate its look and feel where suitable.@@@@1@15@@danf@17-8-2009 10640380@unknown@formal@none@1@S@It can read and write most of the [[file formats]] found in Microsoft Office, and many other applications; an essential feature of the suite for many users.@@@@1@27@@danf@17-8-2009 10640390@unknown@formal@none@1@S@OpenOffice.org has been found to be able to open files of older versions of Microsoft Office and damaged files that newer versions of Microsoft Office itself cannot open.@@@@1@28@@danf@17-8-2009 10640400@unknown@formal@none@1@S@However, it cannot open older Word for Macintosh (MCW) files.@@@@1@10@@danf@17-8-2009 10640410@unknown@formal@none@1@S@=== Platforms ===@@@@1@3@@danf@17-8-2009 10640420@unknown@formal@none@1@S@Platforms for which OO.o is available include [[Microsoft Windows]], [[Linux]], [[Solaris Operating System|Solaris]], [[BSD]], [[OpenVMS]], [[OS/2]] and [[IRIX]].@@@@1@18@@danf@17-8-2009 10640430@unknown@formal@none@1@S@The current primary development platforms are Microsoft Windows, Linux and Solaris.@@@@1@11@@danf@17-8-2009 10640440@unknown@formal@none@1@S@A port for [[Mac OS X]] exists for OS X machines which have the [[X Window System]] component installed.@@@@1@19@@danf@17-8-2009 10640450@unknown@formal@none@1@S@A port to OS X's native [[Aqua (user interface)|Aqua user interface]] is in progress, and is scheduled for completion for the 3.0 milestone.@@@@1@23@@danf@17-8-2009 10640460@unknown@formal@none@1@S@[[NeoOffice]] is an independent [[Fork (software development)|fork]] of OpenOffice, specially adapted for Mac OS X.@@@@1@15@@danf@17-8-2009 10640470@unknown@formal@none@1@S@=== Version compatibility ===@@@@1@4@@danf@17-8-2009 10640480@unknown@formal@none@1@S@*Windows 95: up to v1.1.5@@@@1@5@@danf@17-8-2009 10640490@unknown@formal@none@1@S@*Windows 98-Vista: up to v2.4, development releases of v3.0@@@@1@9@@danf@17-8-2009 10640500@unknown@formal@none@1@S@*Mac OS 10.2: up to v1.1.2@@@@1@6@@danf@17-8-2009 10640510@unknown@formal@none@1@S@*Mac OS 10.3: up to v2.1@@@@1@6@@danf@17-8-2009 10640520@unknown@formal@none@1@S@*Mac OS 10.4-10.5: up to v2.4, development releases of v3.0 ([[Apple-Intel architecture|intel]] only)@@@@1@13@@danf@17-8-2009 10640530@unknown@formal@none@1@S@*OS/2 and eComStation: up to v2.0.4@@@@1@6@@danf@17-8-2009 10640540@unknown@formal@none@1@S@=== Components ===@@@@1@3@@danf@17-8-2009 10640550@unknown@formal@none@1@S@OpenOffice.org is a collection of applications that work together closely to provide the features expected from a modern office suite.@@@@1@20@@danf@17-8-2009 10640560@unknown@formal@none@1@S@Many of the components are designed to mirror those available in Microsoft Office.@@@@1@13@@danf@17-8-2009 10640570@unknown@formal@none@1@S@The components available include:@@@@1@4@@danf@17-8-2009 10640580@unknown@formal@none@1@S@*[[QuickStart]]er@@@@1@1@@danf@17-8-2009 10640590@unknown@formal@none@1@S@:A small program for Windows and Linux that runs when the computer starts for the first time.@@@@1@17@@danf@17-8-2009 10640600@unknown@formal@none@1@S@It loads the core files and libraries for OpenOffice.org during computer startup and allows the suite applications to start more quickly when selected later.@@@@1@24@@danf@17-8-2009 10640610@unknown@formal@none@1@S@The amount of time it takes to open OpenOffice.org applications was a common complaint in version 1.0 of the suite.@@@@1@20@@danf@17-8-2009 10640620@unknown@formal@none@1@S@Substantial improvements were made in this area for version 2.2.@@@@1@10@@danf@17-8-2009 10640630@unknown@formal@none@1@S@*The [[Macro (computer science)|macro]] recorder@@@@1@5@@danf@17-8-2009 10640640@unknown@formal@none@1@S@:Is used to record user actions and replay them later to help with automating tasks, using [[OpenOffice.org Basic]] (see [[OpenOffice.org#OpenOffice.org Basic|below]]).@@@@1@21@@danf@17-8-2009 10640650@unknown@formal@none@1@S@It is not possible to download these components individually on Windows, though they can be installed separately.@@@@1@17@@danf@17-8-2009 10640660@unknown@formal@none@1@S@Most Linux distributions break the components into individual packages which may be downloaded and installed separately.@@@@1@16@@danf@17-8-2009 10640670@unknown@formal@none@1@S@=== OpenOffice.org Basic ===@@@@1@4@@danf@17-8-2009 10640680@unknown@formal@none@1@S@OpenOffice.org Basic is a programming language similar to Microsoft [[Visual Basic for Applications]] (VBA) based on [[StarOffice Basic]].@@@@1@18@@danf@17-8-2009 10640690@unknown@formal@none@1@S@In addition to the macros, the upcoming Novell edition of OpenOffice.org 2.0 supports running Microsoft VBA macros, a feature expected to be incorporated into the mainstream version soon.@@@@1@28@@danf@17-8-2009 10640700@unknown@formal@none@1@S@OpenOffice.org Basic is available in the Writer and Calc applications.@@@@1@10@@danf@17-8-2009 10640710@unknown@formal@none@1@S@It is written in functions called subroutines or macros, with each macro performing a different task, such as counting the words in a paragraph.@@@@1@24@@danf@17-8-2009 10640720@unknown@formal@none@1@S@OpenOffice.org Basic is especially useful in doing repetitive tasks that have not been integrated in the program.@@@@1@17@@danf@17-8-2009 10640730@unknown@formal@none@1@S@As the OpenOffice.org database, called "Base", uses documents created under the Writer application for reports and forms, one could say that Base can also be programmed with OpenOffice.org Basic.@@@@1@29@@danf@17-8-2009 10640740@unknown@formal@none@1@S@== File formats ==@@@@1@4@@danf@17-8-2009 10640750@unknown@formal@none@1@S@OpenOffice.org pioneered the ISO/IEC standard [[OpenDocument]] file formats (ODF), which it uses natively, by default.@@@@1@15@@danf@17-8-2009 10640760@unknown@formal@none@1@S@It also supports reading (and in some cases writing) a large number of legacy proprietary file formats (e.g.: [[WordPerfect]] through libwpd, [[StarOffice]], [[Lotus software]], [[Microsoft Works|MS Works]] through libwps, [[Rich Text Format]]), most notably including [[Microsoft Office]] formats after which the OpenDocument specification was "approved for release as an ISO and IEC International Standard" under the name ISO/IEC 26300:2006..@@@@1@59@@danf@17-8-2009 10640770@unknown@formal@none@1@S@=== Microsoft Office interoperability ===@@@@1@5@@danf@17-8-2009 10640780@unknown@formal@none@1@S@In response to Microsoft's recent movement towards using the [[Office Open XML]] format in [[Microsoft Office 2007]], [[Novell]] has released an [[Office Open XML]] converter for OOo under a liberal [[BSD license]] (along with [[GNU GPL]] and [[LGPL]] licensed libraries), that will be submitted for inclusion into the OpenOffice.org project.@@@@1@50@@danf@17-8-2009 10640790@unknown@formal@none@1@S@This allows OOo to read and write Microsoft OpenXML-formatted word processing documents (.docx) in OpenOffice.org.@@@@1@15@@danf@17-8-2009 10640800@unknown@formal@none@1@S@Currently it works only with the latest Novell edition of OpenOffice.org.@@@@1@11@@danf@17-8-2009 10640810@unknown@formal@none@1@S@[[Sun Microsystems]] has developed an ODF plugin for Microsoft Office which enables users of Microsoft Office Word, Excel and PowerPoint to read and write ODF documents.@@@@1@26@@danf@17-8-2009 10640820@unknown@formal@none@1@S@The plugin currently works with Microsoft Office 2003, Microsoft Office XP and Microsoft Office 2000.@@@@1@15@@danf@17-8-2009 10640830@unknown@formal@none@1@S@Support for Microsoft Office 2007 is only available in combination with Microsoft Office 2007 SP1.@@@@1@15@@danf@17-8-2009 10640840@unknown@formal@none@1@S@Several software companies (including Microsoft and Novell) are working on an add-in for Microsoft Office that allows reading and writing ODF files.@@@@1@22@@danf@17-8-2009 10640850@unknown@formal@none@1@S@Currently it works only for Microsoft Word 2007 / XP / 2003.@@@@1@12@@danf@17-8-2009 10640860@unknown@formal@none@1@S@Microsoft provides a compatibility pack to read and write Office Open XML files with Office 2000, XP and 2003.@@@@1@19@@danf@17-8-2009 10640870@unknown@formal@none@1@S@The compatibility pack can also be used as a stand-alone converter with Microsoft Office 97.@@@@1@15@@danf@17-8-2009 10640880@unknown@formal@none@1@S@This might be helpful to convert older Microsoft Office files via Office Open XML to ODF if a direct conversion doesn't work as expected.@@@@1@24@@danf@17-8-2009 10640890@unknown@formal@none@1@S@The Office compatibility pack however does not install for Office 2000 or Office XP on [[Windows 9x]].@@@@1@17@@danf@17-8-2009 10640900@unknown@formal@none@1@S@Note that some office applications built with Microsoft components may refuse to import OpenOffice data.@@@@1@15@@danf@17-8-2009 10640910@unknown@formal@none@1@S@[[The Sage Group]]'s Simply Accounting, for example, can import Excel's .xls files, but refuses to accept OpenOffice.org-generated .xls files for the reason that the OOo .xls files are not "genuine Microsoft" .xls files.@@@@1@33@@danf@17-8-2009 10640920@unknown@formal@none@1@S@== Development ==@@@@1@3@@danf@17-8-2009 10640930@unknown@formal@none@1@S@=== Overview ===@@@@1@3@@danf@17-8-2009 10640940@unknown@formal@none@1@S@The OpenOffice.org [[Application Programming Interface|API]] is based on a component technology known as [[Universal Network Objects]] (UNO).@@@@1@17@@danf@17-8-2009 10640950@unknown@formal@none@1@S@It consists of a wide range of interfaces defined in a [[CORBA]]-like [[interface description language]].@@@@1@15@@danf@17-8-2009 10640960@unknown@formal@none@1@S@The [[document file format]] used is based on [[XML]] and several export and import filters.@@@@1@15@@danf@17-8-2009 10640970@unknown@formal@none@1@S@All external formats read by OpenOffice.org are converted back and forth from an internal XML representation.@@@@1@16@@danf@17-8-2009 10640980@unknown@formal@none@1@S@By using [[data compression|compression]] when saving [[XML]] to disk, files are generally smaller than the equivalent binary Microsoft Office documents.@@@@1@20@@danf@17-8-2009 10640990@unknown@formal@none@1@S@The native file format for storing documents in version 1.0 was used as the basis of the [[OASIS (organization)|OASIS]] OpenDocument file format standard, which has become the default file format in version 2.0.@@@@1@33@@danf@17-8-2009 10641000@unknown@formal@none@1@S@Development versions of the suite are released every few weeks on the developer zone of the OpenOffice.org website.@@@@1@18@@danf@17-8-2009 10641010@unknown@formal@none@1@S@The releases are meant for those who wish to test new features or are simply curious about forthcoming changes; they are not suitable for production use.@@@@1@26@@danf@17-8-2009 10641020@unknown@formal@none@1@S@=== Native desktop integration ===@@@@1@5@@danf@17-8-2009 10641030@unknown@formal@none@1@S@OpenOffice.org 1.0 was criticized for not having the [[look and feel]] of applications developed natively for the platforms on which it runs.@@@@1@22@@danf@17-8-2009 10641040@unknown@formal@none@1@S@Starting with version 2.0, OpenOffice.org uses native [[widget toolkit]], icons, and font-rendering libraries across a variety of platforms, to better match native applications and provide a smoother experience for the user.@@@@1@31@@danf@17-8-2009 10641050@unknown@formal@none@1@S@There are projects underway to further improve this integration on both [[GNOME]] and [[KDE]].@@@@1@14@@danf@17-8-2009 10641060@unknown@formal@none@1@S@This issue has been particularly pronounced on Mac OS X, whose standard user interface looks noticeably different from either Windows or [[X11]]-based desktop environments and requires the use of programming toolkits unfamiliar to most OpenOffice.org developers.@@@@1@36@@danf@17-8-2009 10641070@unknown@formal@none@1@S@There are two implementations of OpenOffice.org available for OS X:@@@@1@10@@danf@17-8-2009 10641080@unknown@formal@none@1@S@;OpenOffice.org Mac OS X (X11):@@@@1@5@@danf@17-8-2009 10641090@unknown@formal@none@1@S@This official implementation requires the installation of [[X11.app]] or [[XDarwin]], and is a close port of the well-tested Unix version.@@@@1@20@@danf@17-8-2009 10641100@unknown@formal@none@1@S@It is functionally equivalent to the Unix version, and its user interface resembles the [[look and feel]] of that version; for example, the application uses its own [[menu bar]] instead of the OS X menu at the top of the screen.@@@@1@41@@danf@17-8-2009 10641110@unknown@formal@none@1@S@It also requires system fonts to be converted to X11 format for OpenOffice.org to use them (which can be done during application installation).@@@@1@23@@danf@17-8-2009 10641120@unknown@formal@none@1@S@;OpenOffice.org Aqua:@@@@1@2@@danf@17-8-2009 10641130@unknown@formal@none@1@S@After a first step (completed) using [[Carbon (API)|Carbon]], OpenOffice.org Aqua switched to [[Cocoa (API)|Cocoa]] technology, and an [[Aqua (GUI)|Aqua]] version (based on [[Cocoa (API)|Cocoa]]) is also being developed under the aegis of OpenOffice.org, with a Beta version currently available.@@@@1@39@@danf@17-8-2009 10641140@unknown@formal@none@1@S@Sun Microsystems is collaborating with OOo to further development of the Aqua version of OpenOffice.org for Mac.@@@@1@17@@danf@17-8-2009 10641150@unknown@formal@none@1@S@=== Future ===@@@@1@3@@danf@17-8-2009 10641160@unknown@formal@none@1@S@Currently, a developed preview of OpenOffice.org 3 (OOo-dev 3.0) is available for download.@@@@1@13@@danf@17-8-2009 10641170@unknown@formal@none@1@S@Among the planned features for OOo 3.0, set to be released by September 2008 , are:@@@@1@16@@danf@17-8-2009 10641180@unknown@formal@none@1@S@* Personal Information Manager ([[Personal Information Manager|PIM]]), probably based on [[Mozilla Thunderbird|Thunderbird]]/[[Lightning (software)|Lightning]]@@@@1@13@@danf@17-8-2009 10641190@unknown@formal@none@1@S@* PDF import into Draw (to maintain correct layout of the original PDF)@@@@1@13@@danf@17-8-2009 10641200@unknown@formal@none@1@S@* [[OOXML]] document support for opening documents created in [[Office 2007]]@@@@1@11@@danf@17-8-2009 10641210@unknown@formal@none@1@S@* Support for [[Mac OS X]] [[Aqua (user interface)|Aqua]] platform@@@@1@10@@danf@17-8-2009 10641220@unknown@formal@none@1@S@* Extensions, to add third party functionality.@@@@1@7@@danf@17-8-2009 10641230@unknown@formal@none@1@S@* Presenter screen in Impress with multi-screen support@@@@1@8@@danf@17-8-2009 10641240@unknown@formal@none@1@S@=== Other projects ===@@@@1@4@@danf@17-8-2009 10641250@unknown@formal@none@1@S@A number of products are [http://wiki.services.openoffice.org/wiki/DerivedWorks derived from OpenOffice.org].@@@@1@9@@danf@17-8-2009 10641260@unknown@formal@none@1@S@Among the more well-known ones are Sun StarOffice and NeoOffice.@@@@1@10@@danf@17-8-2009 10641270@unknown@formal@none@1@S@The OpenOffice.org site also lists a large variety of [http://wiki.services.openoffice.org/wiki/OpenOffice.org_Solutions complementary products] including groupware solutions.@@@@1@15@@danf@17-8-2009 10641280@unknown@formal@none@1@S@==== NeoOffice ====@@@@1@3@@danf@17-8-2009 10641290@unknown@formal@none@1@S@[[NeoOffice]] is an independent [[porting|port]] that integrates with [[Mac OS X|OS X]]’s [[Aqua (GUI)|Aqua]] user interface using [[Java platform|Java]], [[Carbon (API)|Carbon]] and (increasingly) [[Cocoa (API)|Cocoa]] toolkits.@@@@1@26@@danf@17-8-2009 10641300@unknown@formal@none@1@S@NeoOffice adheres fairly closely to OS X UI standards (for example, using native pull-down menus), and has direct access to OS X’s installed fonts and printers.@@@@1@26@@danf@17-8-2009 10641310@unknown@formal@none@1@S@Its releases lag behind the official OpenOffice.org X11 releases, due to its small development team and the concurrent development of the technology used to port the user interface.@@@@1@28@@danf@17-8-2009 10641320@unknown@formal@none@1@S@Other projects run alongside the main OpenOffice.org project and are easier to contribute to.@@@@1@14@@danf@17-8-2009 10641330@unknown@formal@none@1@S@These include documentation, [[internationalisation and localisation]] and the API.@@@@1@9@@danf@17-8-2009 10641340@unknown@formal@none@1@S@==== OpenGroupware.org ====@@@@1@3@@danf@17-8-2009 10641350@unknown@formal@none@1@S@[[OpenGroupware.org]] is a set of extension programs to allow the sharing of OpenOffice.org documents, calendars, address books, [[e-mail]]s, [[instant messenger|instant messaging]] and blackboards, and provide access to other [[collaborative software|groupware]] applications.@@@@1@31@@danf@17-8-2009 10641360@unknown@formal@none@1@S@There is also an effort to create and share assorted document templates and other useful additions at OOExtras.@@@@1@18@@danf@17-8-2009 10641370@unknown@formal@none@1@S@A set of [[Perl]] extensions is available through the [[CPAN]] in order to allow OpenOffice.org document processing by external programs.@@@@1@20@@danf@17-8-2009 10641380@unknown@formal@none@1@S@These libraries do not use the OpenOffice.org API.@@@@1@8@@danf@17-8-2009 10641390@unknown@formal@none@1@S@They directly read or write the OpenOffice.org files using Perl standard file [[codec|compression/decompression]], XML access and [[UTF-8]] encoding modules.@@@@1@19@@danf@17-8-2009 10641400@unknown@formal@none@1@S@==== Portable ====@@@@1@3@@danf@17-8-2009 10641410@unknown@formal@none@1@S@A distribution of OpenOffice.org called OpenOffice.org Portable is designed to run the suite from a [[USB flash drive]].@@@@1@18@@danf@17-8-2009 10641420@unknown@formal@none@1@S@==== OxygenOffice Professional ====@@@@1@4@@danf@17-8-2009 10641430@unknown@formal@none@1@S@An enhancement of OpenOffice.org, providing: Current Version: 2.4@@@@1@8@@danf@17-8-2009 10641440@unknown@formal@none@1@S@* Possibility to run Visual Basic for Application (VBA) macros in Calc (for testing)@@@@1@14@@danf@17-8-2009 10641450@unknown@formal@none@1@S@* Improved Calc HTML export@@@@1@5@@danf@17-8-2009 10641460@unknown@formal@none@1@S@* Enhanced Access support for Base@@@@1@6@@danf@17-8-2009 10641470@unknown@formal@none@1@S@* Security fixes@@@@1@3@@danf@17-8-2009 10641480@unknown@formal@none@1@S@* Enhanced performance@@@@1@3@@danf@17-8-2009 10641490@unknown@formal@none@1@S@* Enhanced color-palette@@@@1@3@@danf@17-8-2009 10641500@unknown@formal@none@1@S@* Enhanced help menu, additional User’s Manual, and extended tips for beginners@@@@1@12@@danf@17-8-2009 10641510@unknown@formal@none@1@S@Optionally it provides, free for personal and professional use:@@@@1@9@@danf@17-8-2009 10641520@unknown@formal@none@1@S@* More than 3,200 graphics, both clip art and photos.@@@@1@10@@danf@17-8-2009 10641530@unknown@formal@none@1@S@* Several templates and sample documents@@@@1@6@@danf@17-8-2009 10641540@unknown@formal@none@1@S@* Over 90 free fonts.@@@@1@5@@danf@17-8-2009 10641550@unknown@formal@none@1@S@* Additional tools like OOoWikipedia@@@@1@5@@danf@17-8-2009 10641560@unknown@formal@none@1@S@====Extensions====@@@@1@1@@danf@17-8-2009 10641570@unknown@formal@none@1@S@Since version 2.0.4, OpenOffice.org has supported extensions in a similar manner to [[Mozilla Firefox]].@@@@1@14@@danf@17-8-2009 10641580@unknown@formal@none@1@S@Extensions make it easy to add new functionality to an existing OpenOffice.org installation.@@@@1@13@@danf@17-8-2009 10641590@unknown@formal@none@1@S@The [http://extensions.services.openoffice.org/most_pop_ext OpenOffice.org Extension Repository] lists already more than 80 extensions.@@@@1@11@@danf@17-8-2009 10641600@unknown@formal@none@1@S@Developers can easily build new extensions for OpenOffice.org, for example by using the [http://wiki.services.openoffice.org/wiki/OpenOffice_NetBeans_Integration OpenOffice.org API Plugin for NetBeans].@@@@1@19@@danf@17-8-2009 10641610@unknown@formal@none@1@S@==== The OpenOffice.org Bibliographic Project ====@@@@1@6@@danf@17-8-2009 10641620@unknown@formal@none@1@S@This aims to incorporate a powerful [[reference management software]] into the suite.@@@@1@12@@danf@17-8-2009 10641630@unknown@formal@none@1@S@The new major addition is slated for inclusion with the standard OpenOffice.org release on late-2007 to mid-2008, or possibly later depending upon the availability of programmers.@@@@1@26@@danf@17-8-2009 10641640@unknown@formal@none@1@S@=== Security ===@@@@1@3@@danf@17-8-2009 10641650@unknown@formal@none@1@S@OpenOffice.org includes a security team, and as of June 2008 the security organization [[Secunia]] reports no known unpatched security flaws for the software.@@@@1@23@@danf@17-8-2009 10641660@unknown@formal@none@1@S@[[Kaspersky Lab]] has shown a [[proof of concept]] virus for OpenOffice.org.@@@@1@11@@danf@17-8-2009 10641670@unknown@formal@none@1@S@This shows OOo viruses are possible, but there is no known virus "in the wild".@@@@1@15@@danf@17-8-2009 10641680@unknown@formal@none@1@S@In a private meeting of the French Ministry of Defense, macro-related security issues were raised.@@@@1@15@@danf@17-8-2009 10641690@unknown@formal@none@1@S@OpenOffice.org developers have responded and noted that the supposed vulnerability had not been announced through "well defined procedures" for disclosure and that the ministry had revealed nothing specific.@@@@1@28@@danf@17-8-2009 10641700@unknown@formal@none@1@S@However, the developers have been in talks with the researcher concerning the supposed vulnerability.@@@@1@14@@danf@17-8-2009 10641710@unknown@formal@none@1@S@As with Microsoft Word, documents created in OpenOffice can contain [[metadata]] which may include a complete history of what was changed, when and by whom.@@@@1@25@@danf@17-8-2009 10641720@unknown@formal@none@1@S@== Ownership ==@@@@1@3@@danf@17-8-2009 10641730@unknown@formal@none@1@S@The project and software are informally referred to as ''OpenOffice'', but project organizers report that this term is a [[trademark]] held by another party, requiring them to adopt ''OpenOffice.org'' as its formal name.@@@@1@33@@danf@17-8-2009 10641740@unknown@formal@none@1@S@(Due to a similar trademark issue, the [[Brazilian Portuguese]] version of the suite is distributed under the name ''BrOffice.org''.)@@@@1@19@@danf@17-8-2009 10641750@unknown@formal@none@1@S@Development is managed by staff members of StarOffice.@@@@1@8@@danf@17-8-2009 10641760@unknown@formal@none@1@S@Some delay and difficulty in implementing external contributions to the core codebase (even those from the project's corporate sponsors) has been noted.@@@@1@22@@danf@17-8-2009 10641770@unknown@formal@none@1@S@Currently, there are [http://wiki.services.openoffice.org/wiki/DerivedWorks several derived and/or proprietary works based on OOo], with some of them being:@@@@1@17@@danf@17-8-2009 10641780@unknown@formal@none@1@S@* Sun Microsystem's [[StarOffice]], with various complementary add-ons.@@@@1@8@@danf@17-8-2009 10641790@unknown@formal@none@1@S@* IBM's [[Lotus Symphony]], with a new interface based on [[Eclipse (software)|Eclipse]] (based on OO.o 1.x).@@@@1@16@@danf@17-8-2009 10641800@unknown@formal@none@1@S@* OpenOffice.org Novell edition, integrated with [[Novell Evolution|Evolution]] and with a [[OOXML]] filter.@@@@1@13@@danf@17-8-2009 10641810@unknown@formal@none@1@S@* Beijing [[Redflag]] Chinese 2000's [[RedOffice]], fully localized in Chinese characters.@@@@1@11@@danf@17-8-2009 10641820@unknown@formal@none@1@S@* Planamesa's [[NeoOffice]] for [[Mac OS X]] with Aqua support via Java.@@@@1@12@@danf@17-8-2009 10641830@unknown@formal@none@1@S@In [[May 23]], [[2007]], the OpenOffice.org community and Redflag Chinese 2000 Software Co, Ltd. announced a joint development effort focused on integrating the new features that have been added in the RedOffice localization of OpenOffice.org, as well as quality assurance and work on the core applications.@@@@1@46@@danf@17-8-2009 10641840@unknown@formal@none@1@S@Additionally, Redflag Chinese 2000 made public its commitment to the global OO.o community stating it would "strengthen its support of the development of the world's leading free and open source productivity suite", adding around 50 engineers (that have been working on RedOffice since 2006) to the project.@@@@1@47@@danf@17-8-2009 10641850@unknown@formal@none@1@S@In [[September 10]], [[2007]], the OO.o community announced that [[IBM]] had joined to support the development of OpenOffice.org.@@@@1@18@@danf@17-8-2009 10641860@unknown@formal@none@1@S@"IBM will be making initial code contributions that it has been developing as part of its Lotus Notes product, including accessibility enhancements, and will be making ongoing contributions to the feature richness and code quality of OpenOffice.org.@@@@1@37@@danf@17-8-2009 10641870@unknown@formal@none@1@S@Besides working with the community on the free productivity suite's software, IBM will also leverage OpenOffice.org technology in its products" as has been seen with [[Lotus Symphony]].@@@@1@27@@danf@17-8-2009 10641880@unknown@formal@none@1@S@Sean Poulley, the vice president of business and strategy in IBM's [[Lotus Software]] division said that IBM plans to take a leadership role in the OpenOffice.org community together with other companies such as Sun Microsystems.@@@@1@35@@danf@17-8-2009 10641890@unknown@formal@none@1@S@IBM will work within the leadership structure that exists.@@@@1@9@@danf@17-8-2009 10641900@unknown@formal@none@1@S@As of [[October 02]], [[2007]], [[Michael Meeks]] announced (and generated an answer by Sun's [[Simon Phipps]] and Mathias Bauer) a derived OpenOffice.org work, under the wing of his employer [[Novell]], with the purpose of including new features and fixes that do not get easily integrated in the OOo-build up-stream core.@@@@1@50@@danf@17-8-2009 10641910@unknown@formal@none@1@S@The work is called Go-OO (http://go-oo.org/) a name under which alternative OO.o software has been available for five years.@@@@1@19@@danf@17-8-2009 10641920@unknown@formal@none@1@S@The new features are shared with Novell's edition of OOo and include:@@@@1@12@@danf@17-8-2009 10641930@unknown@formal@none@1@S@* [[Visual Basic for Applications|VBA]] macros support.@@@@1@7@@danf@17-8-2009 10641940@unknown@formal@none@1@S@* Faster start up time.@@@@1@5@@danf@17-8-2009 10641950@unknown@formal@none@1@S@* "A [[Linear programming|linear optimization]] solver to optimize a cell value based on arbitrary constraints built into Calc".@@@@1@18@@danf@17-8-2009 10641960@unknown@formal@none@1@S@* Multimedia content supports into documents, using the [[gstreamer]] multimedia framework.@@@@1@11@@danf@17-8-2009 10641970@unknown@formal@none@1@S@* Support for [[Microsoft Works]] formats, [[WordPerfect]] graphics (WPG format) and T602 files imports.@@@@1@14@@danf@17-8-2009 10641980@unknown@formal@none@1@S@[http://wiki.services.openoffice.org/wiki/Contributing_Patches Details about the patch handling including metrics] can be found on the OpenOffice.org site.@@@@1@15@@danf@17-8-2009 10641990@unknown@formal@none@1@S@== Reactions ==@@@@1@3@@danf@17-8-2009 10642000@unknown@formal@none@1@S@Federal Computer Week issue listed OpenOffice.org as one of the "5 stars of open-source products."@@@@1@15@@danf@17-8-2009 10642010@unknown@formal@none@1@S@In contrast, OpenOffice.org was used in [[2005]] by ''[[The Guardian]]'' newspaper to illustrate what it claims are the limitations of open-source software, although the article does finish by stating that the software may be better than MS Word for books.@@@@1@40@@danf@17-8-2009 10642020@unknown@formal@none@1@S@=== Market share ===@@@@1@4@@danf@17-8-2009 10642030@unknown@formal@none@1@S@It is extremely difficult to estimate the market share of OpenOffice.org due to the fact that OpenOffice.org can be freely distributed via download sites including mirrors, peer-to-peer networks, CDs, Linux distros, etc.@@@@1@32@@danf@17-8-2009 10642040@unknown@formal@none@1@S@Nevertheless, the OpenOffice.org tries to capture key adoption data in a market share analysis@@@@1@14@@danf@17-8-2009 10642050@unknown@formal@none@1@S@Although Microsoft Office retains 95% of the general market as measured by revenue, OpenOffice.org and StarOffice have secured 14% of the large enterprise market as of 2004 and 19% of the small to midsize business market in 2005.@@@@1@38@@danf@17-8-2009 10642060@unknown@formal@none@1@S@The OpenOffice.org web site reports more than 98 million downloads.@@@@1@10@@danf@17-8-2009 10642070@unknown@formal@none@1@S@Other large scale users of OpenOffice.org include [[Ministry of Defence (Singapore)|Singapore’s Ministry of Defence]], and [[Bristol]] City Council in the UK.@@@@1@21@@danf@17-8-2009 10642080@unknown@formal@none@1@S@In [[France]], OpenOffice.org has attracted the attention of both local and national government administrations who wish to rationalize their software procurement, as well as have stable, standard file formats for archival purposes.@@@@1@32@@danf@17-8-2009 10642090@unknown@formal@none@1@S@It is now the official office suite for the [[French Gendarmerie]].@@@@1@11@@danf@17-8-2009 10642100@unknown@formal@none@1@S@Several government organizations in India, such as [[IIT Bombay]] (a renowned technical institute), the [[Supreme Court of India]], the [[Allahabad High Court]], which use Linux, completely rely on OpenOffice.org for their administration.@@@@1@32@@danf@17-8-2009 10642110@unknown@formal@none@1@S@On [[October 4]], [[2005]], Sun and [[Google]] announced a strategic partnership.@@@@1@11@@danf@17-8-2009 10642120@unknown@formal@none@1@S@As part of this agreement, Sun will add a Google search bar to OpenOffice.org, Sun and Google will engage in joint marketing activities as well as joint research and development, and Google will help distribute OpenOffice.org.@@@@1@36@@danf@17-8-2009 10642130@unknown@formal@none@1@S@Google is currently distributing StarOffice as part of the [[Google Pack]].@@@@1@11@@danf@17-8-2009 10642140@unknown@formal@none@1@S@Besides StarOffice, there are still a number of OpenOffice.org derived commercial products.@@@@1@12@@danf@17-8-2009 10642150@unknown@formal@none@1@S@Most of them are developed under [[SISSL]] license (which is valid up to OpenOffice.org 2.0 Beta 2).@@@@1@17@@danf@17-8-2009 10642160@unknown@formal@none@1@S@In general they are targeted at local or niche market, with proprietary add-ons such as speech recognition module, automatic database connection, or better [[CJK]] support.@@@@1@25@@danf@17-8-2009 10642170@unknown@formal@none@1@S@In July 2007 Everex, a division of First International Computer and the 9th largest PC supplier in the U.S., began shipping systems preloaded with OpenOffice.org 2.2 into Wal-Mart and Sam's Club throughout North America.@@@@1@34@@danf@17-8-2009 10642180@unknown@formal@none@1@S@In September 2007 IBM announced that it would supply and support OpenOffice.org branded as [[Lotus Symphony]], and integrated into Lotus Notes.@@@@1@21@@danf@17-8-2009 10642190@unknown@formal@none@1@S@IBM also announced 35 developers would be assigned to work on OpenOffice.org, and that it would join the OpenOffice.org foundation.@@@@1@20@@danf@17-8-2009 10642200@unknown@formal@none@1@S@Commentators noted parallels between IBM's 2000 support of Linux and this announcement.@@@@1@12@@danf@17-8-2009 10642210@unknown@formal@none@1@S@=== Java controversy ===@@@@1@4@@danf@17-8-2009 10642220@unknown@formal@none@1@S@In the past OpenOffice.org was criticized for an increasing dependency on the [[Java Runtime Environment]] which was not [[free software]].@@@@1@20@@danf@17-8-2009 10642230@unknown@formal@none@1@S@That Sun Microsystems is both the creator of Java and the chief supporter of OpenOffice.org drew accusations of ulterior motives for this technology choice.@@@@1@24@@danf@17-8-2009 10642240@unknown@formal@none@1@S@Version 1 depended on the [[Java Runtime Environment]] (JRE) being present on the user’s computer for some auxiliary functions, but version 2 increased the suite’s use of Java requiring a JRE.@@@@1@31@@danf@17-8-2009 10642250@unknown@formal@none@1@S@In response, [[Red Hat]] increased their efforts to improve [[free Java implementations]].@@@@1@12@@danf@17-8-2009 10642260@unknown@formal@none@1@S@Red Hat’s [[Fedora (Linux distribution)|Fedora Core]] 4 (released on [[June 13]], [[2005]]) included a beta version of OpenOffice.org version 2, running on [[GNU Compiler for Java|GCJ]] and [[GNU Classpath]].@@@@1@29@@danf@17-8-2009 10642270@unknown@formal@none@1@S@The issue of OpenOffice.org’s use of Java came to the fore in May 2005, when [[Richard Stallman]] appeared to call for a [[fork (software)|fork]] of the application in a posting on the [[Free Software Foundation]] website.@@@@1@36@@danf@17-8-2009 10642280@unknown@formal@none@1@S@This led to discussions within the OpenOffice.org community and between Sun staff and developers involved in [[GNU Classpath]], a free replacement for Sun’s Java implementation.@@@@1@25@@danf@17-8-2009 10642290@unknown@formal@none@1@S@Later that year, the OpenOffice.org developers also placed into their development guidelines various requirements to ensure that future versions of OpenOffice.org could be run on free implementations of Java and fixed the issues which previously prevented OpenOffice.org 2.0 from using free software Java implementations.@@@@1@44@@danf@17-8-2009 10642300@unknown@formal@none@1@S@On [[November 13]], [[2006]], Sun committed to releasing Java under the [[GNU General Public License]] in the near future.@@@@1@19@@danf@17-8-2009 10642310@unknown@formal@none@1@S@This process would end OpenOffice.org's dependence on [[non-free]] software.@@@@1@9@@danf@17-8-2009 10642320@unknown@formal@none@1@S@Between November 2006 and May 2007, Sun Microsystems made available most of their Java technologies under the GNU General Public License, in compliance with the specifications of the Java Community Process, thus making almost all of Sun's Java also free software.@@@@1@41@@danf@17-8-2009 10642330@unknown@formal@none@1@S@The following areas of OpenOffice.org 2.0 depend on the JRE being present:@@@@1@12@@danf@17-8-2009 10642340@unknown@formal@none@1@S@* The [[media player (application software)|media player]] on Unix-like systems@@@@1@10@@danf@17-8-2009 10642350@unknown@formal@none@1@S@* All document wizards in Writer@@@@1@6@@danf@17-8-2009 10642360@unknown@formal@none@1@S@* Accessibility tools@@@@1@3@@danf@17-8-2009 10642370@unknown@formal@none@1@S@* Report Autopilot@@@@1@3@@danf@17-8-2009 10642380@unknown@formal@none@1@S@* [[JDBC]] driver support@@@@1@4@@danf@17-8-2009 10642390@unknown@formal@none@1@S@* [[Hsqldb|HSQL]] database engine, which is used in OpenOffice.org Base@@@@1@10@@danf@17-8-2009 10642400@unknown@formal@none@1@S@* [[XSLT]] filters@@@@1@3@@danf@17-8-2009 10642410@unknown@formal@none@1@S@* [[BeanShell]], the [[NetBeans]] scripting language and the Java UNO bridge@@@@1@11@@danf@17-8-2009 10642420@unknown@formal@none@1@S@* Export filters to the Aportis.doc (.pdb) format for the [[Palm OS]] or [[Pocket Word]] (.psw) format for the [[Pocket PC]]@@@@1@21@@danf@17-8-2009 10642430@unknown@formal@none@1@S@* Export filter to [[LaTeX]]@@@@1@5@@danf@17-8-2009 10642440@unknown@formal@none@1@S@* Export filter to [[MediaWiki]]'s [[wikitext]]@@@@1@6@@danf@17-8-2009 10642450@unknown@formal@none@1@S@A common point of confusion is that [[mail merge]] to generate emails requires the Java API JavaMail in [[StarOffice]]; however, as of version 2.0.1, OpenOffice.org uses a [[Python (programming language)|Python]]-component instead.@@@@1@31@@danf@17-8-2009 10642460@unknown@formal@none@1@S@=== Complementary software ===@@@@1@4@@danf@17-8-2009 10642470@unknown@formal@none@1@S@OpenOffice.org provides replacement for MS Office's [[Microsoft Word]], [[Microsoft Excel]], [[Microsoft PowerPoint]], [[Microsoft Access]], [[Equation Editor|Microsoft Equation Editor]] and [[Microsoft Visio]].@@@@1@21@@danf@17-8-2009 10642480@unknown@formal@none@1@S@But to level the equivalent functionality from the rest of MS Office, OOo can be complemented with other open source programs such as:@@@@1@23@@danf@17-8-2009 10642490@unknown@formal@none@1@S@* [[Novell Evolution|Evolution]] or [[Mozilla Thunderbird|Thunderbird]]/[[Lightning (software)|Lightning]] for a PIM like [[Microsoft Outlook]].@@@@1@13@@danf@17-8-2009 10642500@unknown@formal@none@1@S@* [[OpenProj]] (which seeks integration with OOo, but might be limited due to licensing issues) for [[Microsoft Project]].@@@@1@18@@danf@17-8-2009 10642510@unknown@formal@none@1@S@* [[Scribus]] for [[Microsoft Publisher]]@@@@1@5@@danf@17-8-2009 10642520@unknown@formal@none@1@S@* [[O3spaces]] for [[Sharepoint]]@@@@1@4@@danf@17-8-2009 10642530@unknown@formal@none@1@S@Microsoft also provides Administrative Template Files ("adm files") that allow MS Office to be configured using Windows Group Policy.@@@@1@19@@danf@17-8-2009 10642540@unknown@formal@none@1@S@Equivalent functionality for OpenOffice.org is provided by [http://openoffice-enterprise.com/ OpenOffice-Enterprise], a commercial product from Open Office Technology, Inc.@@@@1@17@@danf@17-8-2009 10642550@unknown@formal@none@1@S@=== Issues ===@@@@1@3@@danf@17-8-2009 10642560@unknown@formal@none@1@S@OpenOffice.org has been criticized for slow start times and extensive CPU and RAM usage in comparison to other competitive software such as Microsoft Office.@@@@1@24@@danf@17-8-2009 10642570@unknown@formal@none@1@S@In comparison, tests between OpenOffice.org 2.2 and Microsoft Office 2007 have found that OpenOffice.org takes approximately 2 times the processing time and memory to load itself along with a blank file; and took approximately 4.7 times the processing time and 3.9 times the memory to open an extremely large spreadsheet file.@@@@1@49@@danf@17-8-2009 10642580@unknown@formal@none@1@S@Critics have pointed to excessive code bloat and OpenOffice.org's loading of the [[Java Virtual Machine|Java Runtime Environment]] as possible reasons for the slow speeds and excessive memory usage.@@@@1@28@@danf@17-8-2009 10642590@unknown@formal@none@1@S@However, since OpenOffice.org 2.2 the performance of OpenOffice.org has been improved dramatically.@@@@1@12@@danf@17-8-2009 10642600@unknown@formal@none@1@S@One of the greatest challenges is its ability to be truly cross compatible with other applications.@@@@1@16@@danf@17-8-2009 10642610@unknown@formal@none@1@S@Since Openoffice.org is forced to reverse engineer proprietary binary formats due to unavailability of open specifications, slight formatting incompatibilities tend to exist when files are saved in non-native format.@@@@1@29@@danf@17-8-2009 10642620@unknown@formal@none@1@S@For example, a complex .doc document formatted under OpenOffice.org, is usually not displayed with the correct format when opened with Microsoft Office.@@@@1@22@@danf@17-8-2009 10642630@unknown@formal@none@1@S@== Retail ==@@@@1@3@@danf@17-8-2009 10642640@unknown@formal@none@1@S@The [[free software license]] under which OpenOffice.org is distributed allows unlimited use of the software for both home and business use, including unlimited redistribution of the software.@@@@1@27@@danf@17-8-2009 10642650@unknown@formal@none@1@S@Several businesses sell the OpenOffice.org suite on auction websites such as [[eBay]], offering value-added services such as 24/7 technical support, download mirrors, and CD mailing.@@@@1@25@@danf@17-8-2009 10642660@unknown@formal@none@1@S@However, often the 24/7 support offered is not provided by the company selling the software, but rather by the official OpenOffice.org mailing list.@@@@1@23@@danf@17-8-2009 10650010@unknown@formal@none@1@S@
Parsing
@@@@1@1@@danf@17-8-2009 10650020@unknown@formal@none@1@S@In [[computer science]] and [[linguistics]], '''parsing''', or, more formally, '''syntactic analysis''', is the process of analyzing a sequence of [[Token (parser)|tokens]] to determine grammatical structure with respect to a given (more or less) [[formal grammar]].@@@@1@35@@danf@17-8-2009 10650030@unknown@formal@none@1@S@A '''parser''' is thus one of the components in an [[interpreter]] or [[compiler]], where it captures the implied hierarchy of the input text and transforms it into a form suitable for further processing (often some kind of [[parse tree]], [[abstract syntax tree]] or other hierarchical structure) and normally checks for syntax errors at the same time.@@@@1@56@@danf@17-8-2009 10650040@unknown@formal@none@1@S@The parser often uses a separate [[lexical analyser]] to create tokens from the sequence of input characters.@@@@1@17@@danf@17-8-2009 10650050@unknown@formal@none@1@S@Parsers may be programmed by hand or may be semi-automatically generated (in some programming language) by a tool (such as [[Yet Another Compiler Compiler|Yacc]]) from a grammar written in [[Backus-Naur form]].@@@@1@31@@danf@17-8-2009 10650060@unknown@formal@none@1@S@Parsing is also an earlier term for the diagramming of sentences of natural languages, and is still used for the diagramming of [[Inflection|inflected]] languages, such as the [[Romance languages|Romance languages]] or [[Latin]].@@@@1@32@@danf@17-8-2009 10650070@unknown@formal@none@1@S@Parsers can also be constructed as executable specifications of grammars in functional programming languages.@@@@1@14@@danf@17-8-2009 10650080@unknown@formal@none@1@S@Frost, Hafiz and Callaghan have built on the work of others to construct a set of [[higher-order function]]s (called [[parser combinators]]) which allow polynomial time and space complexity top-down parser to be constructed as executable specifications of ambiguous grammars containing left-recursive productions.@@@@1@42@@danf@17-8-2009 10650090@unknown@formal@none@1@S@The [http://www.cs.uwindsor.ca/~hafiz/proHome.html X-SAIGA] site has more about the algorithms and implementation details.@@@@1@12@@danf@17-8-2009 10650100@unknown@formal@none@1@S@== Human languages ==@@@@1@4@@danf@17-8-2009 10650110@unknown@formal@none@1@S@:''Also see [[:Category:Natural language parsing]]''@@@@1@5@@danf@17-8-2009 10650120@unknown@formal@none@1@S@In some [[machine translation]] and [[natural language processing]] systems, human languages are parsed by computer programs.@@@@1@16@@danf@17-8-2009 10650130@unknown@formal@none@1@S@Human sentences are not easily parsed by programs, as there is substantial [[syntactic ambiguity|ambiguity]] in the structure of human language.@@@@1@20@@danf@17-8-2009 10650140@unknown@formal@none@1@S@In order to parse natural language data, researchers must first agree on the [[grammar]] to be used.@@@@1@17@@danf@17-8-2009 10650150@unknown@formal@none@1@S@The choice of syntax is affected by both [[linguistic]] and computational concerns; for instance some parsing systems use [[lexical functional grammar]], but in general, parsing for grammars of this type is known to be [[NP-complete]].@@@@1@35@@danf@17-8-2009 10650160@unknown@formal@none@1@S@[[Head-driven phrase structure grammar]] is another linguistic formalism which has been popular in the parsing community, but other research efforts have focused on less complex formalisms such as the one used in the Penn [[Treebank]].@@@@1@35@@danf@17-8-2009 10650170@unknown@formal@none@1@S@[[Shallow parsing]] aims to find only the boundaries of major constituents such as noun phrases.@@@@1@15@@danf@17-8-2009 10650180@unknown@formal@none@1@S@Another popular strategy for avoiding linguistic controversy is [[dependency grammar]] parsing.@@@@1@11@@danf@17-8-2009 10650190@unknown@formal@none@1@S@Most modern parsers are at least partly [[statistics|statistical]]; that is, they rely on a corpus of training data which has already been annotated (parsed by hand).@@@@1@26@@danf@17-8-2009 10650200@unknown@formal@none@1@S@This approach allows the system to gather information about the frequency with which various constructions occur in specific contexts.@@@@1@19@@danf@17-8-2009 10650210@unknown@formal@none@1@S@''(See [[machine learning]].)''@@@@1@3@@danf@17-8-2009 10650220@unknown@formal@none@1@S@Approaches which have been used include straightforward [[PCFG]]s (probabilistic context free grammars), [[maximum entropy]], and [[neural net]]s.@@@@1@17@@danf@17-8-2009 10650230@unknown@formal@none@1@S@Most of the more successful systems use ''lexical'' statistics (that is, they consider the identities of the words involved, as well as their [[part of speech]]).@@@@1@26@@danf@17-8-2009 10650240@unknown@formal@none@1@S@However such systems are vulnerable to [[overfitting]] and require some kind of smoothing to be effective.@@@@1@16@@danf@17-8-2009 10650250@unknown@formal@none@1@S@Parsing algorithms for natural language cannot rely on the grammar having 'nice' properties as with manually-designed grammars for programming languages.@@@@1@20@@danf@17-8-2009 10650260@unknown@formal@none@1@S@As mentioned earlier some grammar formalisms are very computationally difficult to parse; in general, even if the desired structure is not [[context-free]], some kind of context-free approximation to the grammar is used to perform a first pass.@@@@1@37@@danf@17-8-2009 10650265@unknown@formal@none@1@S@Algorithms which use context-free grammars often rely on some variant of the [[CKY algorithm]], usually with some [[heuristic (computer science)|heuristic]] to prune away unlikely analyses to save time.@@@@1@28@@danf@17-8-2009 10650270@unknown@formal@none@1@S@''(See [[chart parsing]].)''@@@@1@3@@danf@17-8-2009 10650280@unknown@formal@none@1@S@However some systems trade speed for accuracy using, eg, linear-time versions of the [[Shift-reduce parsing|shift-reduce]] algorithm.@@@@1@16@@danf@17-8-2009 10650290@unknown@formal@none@1@S@A somewhat recent development has been [[parse reranking]] in which the parser proposes some large number of analyses, and a more complex system selects the best option.@@@@1@27@@danf@17-8-2009 10650300@unknown@formal@none@1@S@It is normally branching of one part and its subparts@@@@1@10@@danf@17-8-2009 10650310@unknown@formal@none@1@S@== Programming languages ==@@@@1@4@@danf@17-8-2009 10650320@unknown@formal@none@1@S@The most common use of a parser is as a component of a [[compiler]] or [[interpreter]].@@@@1@16@@danf@17-8-2009 10650330@unknown@formal@none@1@S@This parses the [[source code]] of a [[computer programming language]] to create some form of internal representation.@@@@1@17@@danf@17-8-2009 10650340@unknown@formal@none@1@S@Programming languages tend to be specified in terms of a [[context-free grammar]] because fast and efficient parsers can be written for them.@@@@1@22@@danf@17-8-2009 10650350@unknown@formal@none@1@S@Parsers are written by hand or generated by [[parser generator]]s.@@@@1@10@@danf@17-8-2009 10650360@unknown@formal@none@1@S@Context-free grammars are limited in the extent to which they can express all of the requirements of a language.@@@@1@19@@danf@17-8-2009 10650370@unknown@formal@none@1@S@Informally, the reason is that the memory of such a language is limited.@@@@1@13@@danf@17-8-2009 10650380@unknown@formal@none@1@S@The grammar cannot remember the presence of a construct over an arbitrarily long input; this is necessary for a language in which, for example, a name must be declared before it may be referenced.@@@@1@34@@danf@17-8-2009 10650390@unknown@formal@none@1@S@More powerful grammars that can express this constraint, however, cannot be parsed efficiently.@@@@1@13@@danf@17-8-2009 10650400@unknown@formal@none@1@S@Thus, it is a common strategy to create a relaxed parser for a context-free grammar which accepts a superset of the desired language constructs (that is, it accepts some invalid constructs); later, the unwanted constructs can be filtered out.@@@@1@39@@danf@17-8-2009 10650410@unknown@formal@none@1@S@===Overview of process===@@@@1@3@@danf@17-8-2009 10650420@unknown@formal@none@1@S@[[image:Parser_Flow.gif|right|Flow of data in a typical parser]] The following example demonstrates the common case of parsing a computer language with two levels of grammar: lexical and syntactic.@@@@1@27@@danf@17-8-2009 10650430@unknown@formal@none@1@S@The first stage is the token generation, or [[lexical analysis]], by which the input character stream is split into meaningful symbols defined by a grammar of [[regular expression]]s.@@@@1@28@@danf@17-8-2009 10650440@unknown@formal@none@1@S@For example, a calculator program would look at an input such as "12*(3+4)^2" and split it into the tokens 12, *, (, 3, +, 4, ), ^, and 2, each of which is a meaningful symbol in the context of an arithmetic expression.@@@@1@43@@danf@17-8-2009 10650450@unknown@formal@none@1@S@The parser would contain rules to tell it that the characters *, +, ^, ( and ) mark the start of a new token, so meaningless tokens like "12*" or "(3" will not be generated.@@@@1@35@@danf@17-8-2009 10650460@unknown@formal@none@1@S@The next stage is parsing or syntactic analysis, which is checking that the tokens form an allowable expression.@@@@1@18@@danf@17-8-2009 10650470@unknown@formal@none@1@S@This is usually done with reference to a [[context-free grammar]] which recursively defines components that can make up an expression and the order in which they must appear.@@@@1@28@@danf@17-8-2009 10650480@unknown@formal@none@1@S@However, not all rules defining programming languages can be expressed by context-free grammars alone, for example type validity and proper declaration of identifiers.@@@@1@23@@danf@17-8-2009 10650490@unknown@formal@none@1@S@These rules can be formally expressed with [[attribute grammar]]s.@@@@1@9@@danf@17-8-2009 10650500@unknown@formal@none@1@S@The final phase is [[Semantic analysis (computer science)|semantic parsing]] or analysis, which is working out the implications of the expression just validated and taking the appropriate action.@@@@1@27@@danf@17-8-2009 10650510@unknown@formal@none@1@S@In the case of a calculator or interpreter, the action is to evaluate the expression or program; a compiler, on the other hand, would generate some kind of code.@@@@1@29@@danf@17-8-2009 10650520@unknown@formal@none@1@S@Attribute grammars can also be used to define these actions.@@@@1@10@@danf@17-8-2009 10650530@unknown@formal@none@1@S@==Types of parsers==@@@@1@3@@danf@17-8-2009 10650540@unknown@formal@none@1@S@The task of the parser is essentially to determine if and how the input can be derived from the start symbol of the grammar.@@@@1@24@@danf@17-8-2009 10650550@unknown@formal@none@1@S@This can be done in essentially two ways:@@@@1@8@@danf@17-8-2009 10650560@unknown@formal@none@1@S@*[[Top-down parsing]] - Top-down parsing can be viewed as an attempt to find left-most derivations of an input-stream by searching for [[parse tree|parse-trees]] using a top-down expansion of the given [[formal grammar]] rules.@@@@1@33@@danf@17-8-2009 10650570@unknown@formal@none@1@S@Tokens are consumed from left to right.@@@@1@7@@danf@17-8-2009 10650580@unknown@formal@none@1@S@Inclusive choice is used to accommodate [[ambiguity]] by expanding all alternative right-hand-sides of grammar rules .@@@@1@16@@danf@17-8-2009 10650590@unknown@formal@none@1@S@[[LL parser]]s and [[recursive-descent parser]] are examples of top-down parsers, which cannot accommodate [[left recursion | left recursive]] productions.@@@@1@19@@danf@17-8-2009 10650600@unknown@formal@none@1@S@Although it has been believed that simple implementations of top-down parsing cannot accommodate direct and indirect left-recursion and may require exponential time and space complexity while parsing ambiguous [[context-free grammar]]s, more sophisticated algorithm for top-down parsing have been created by Frost, Hafiz, and Callaghan which accommodates [[ambiguity]] and [[left recursion]] in polynomial time and which generates polynomial-size representations of the potentially-exponential number of parse trees.@@@@1@65@@danf@17-8-2009 10650610@unknown@formal@none@1@S@Their algorithm is able to produce both left-most and right-most derivations of an input w.r.t. a given CFG.@@@@1@18@@danf@17-8-2009 10650620@unknown@formal@none@1@S@*[[Bottom-up parsing]] - A parser can start with the input and attempt to rewrite it to the start symbol.@@@@1@19@@danf@17-8-2009 10650630@unknown@formal@none@1@S@Intuitively, the parser attempts to locate the most basic elements, then the elements containing these, and so on.@@@@1@18@@danf@17-8-2009 10650640@unknown@formal@none@1@S@[[LR parser]]s are examples of bottom-up parsers.@@@@1@7@@danf@17-8-2009 10650650@unknown@formal@none@1@S@Another term used for this type of parser is Shift-Reduce parsing.@@@@1@11@@danf@17-8-2009 10650660@unknown@formal@none@1@S@Another important distinction is whether the parser generates a ''leftmost derivation'' or a ''rightmost derivation'' (see [[context-free grammar]]).@@@@1@18@@danf@17-8-2009 10650670@unknown@formal@none@1@S@LL parsers will generate a leftmost [[derivation]] and LR parsers will generate a rightmost derivation (although usually in reverse) .@@@@1@20@@danf@17-8-2009 10650680@unknown@formal@none@1@S@== Examples of parsers ==@@@@1@5@@danf@17-8-2009 10650690@unknown@formal@none@1@S@=== Top-down parsers ===@@@@1@4@@danf@17-8-2009 10650700@unknown@formal@none@1@S@Some of the parsers that use [[top-down parsing]] include:@@@@1@9@@danf@17-8-2009 10650710@unknown@formal@none@1@S@* [[Recursive descent parser]]@@@@1@4@@danf@17-8-2009 10650720@unknown@formal@none@1@S@* [[LL parser]] ('''L'''eft-to-right, '''L'''eftmost derivation)@@@@1@6@@danf@17-8-2009 10650730@unknown@formal@none@1@S@* [http://www.cs.uwindsor.ca/~hafiz/proHome.html X-SAIGA] - eXecutable SpecificAtIons of GrAmmars.@@@@1@8@@danf@17-8-2009 10650740@unknown@formal@none@1@S@Contains publications related to top-down parsing algorithm that supports left-recursion and ambiguity in polynomial time and space.@@@@1@17@@danf@17-8-2009 10650750@unknown@formal@none@1@S@=== Bottom-up parsers ===@@@@1@4@@danf@17-8-2009 10650760@unknown@formal@none@1@S@Some of the parsers that use [[bottom-up parsing]] include:@@@@1@9@@danf@17-8-2009 10650770@unknown@formal@none@1@S@* Precedence parser@@@@1@3@@danf@17-8-2009 10650780@unknown@formal@none@1@S@** [[Operator-precedence parser]]@@@@1@3@@danf@17-8-2009 10650790@unknown@formal@none@1@S@** [[Simple precedence parser]]@@@@1@4@@danf@17-8-2009 10650800@unknown@formal@none@1@S@* BC (bounded context) parsing@@@@1@5@@danf@17-8-2009 10650810@unknown@formal@none@1@S@* [[LR parser]] ('''L'''eft-to-right, '''R'''ightmost derivation)@@@@1@6@@danf@17-8-2009 10650820@unknown@formal@none@1@S@** [[SLR parser|Simple LR (SLR) parser]]@@@@1@6@@danf@17-8-2009 10650830@unknown@formal@none@1@S@** [[LALR parser]]@@@@1@3@@danf@17-8-2009 10650840@unknown@formal@none@1@S@** [[Canonical LR parser|Canonical LR (LR(1)) parser]]@@@@1@7@@danf@17-8-2009 10650850@unknown@formal@none@1@S@** [[GLR parser]]@@@@1@3@@danf@17-8-2009 10650860@unknown@formal@none@1@S@* [[CYK algorithm|CYK parser]]@@@@1@4@@danf@17-8-2009 10660010@unknown@formal@none@1@S@
Lexical category
@@@@1@2@@danf@17-8-2009 10660020@unknown@formal@none@1@S@In [[grammar]], a '''lexical category''' (also '''word class''', '''lexical class''', or in traditional grammar '''part of speech''') is a linguistic category of words (or more precisely ''lexical items''), which is generally defined by the [[syntactic]] or [[morphology (linguistics)|morphological]] behaviour of the lexical item in question.@@@@1@45@@danf@17-8-2009 10660030@unknown@formal@none@1@S@Common linguistic categories include ''noun'' and ''verb'', among others.@@@@1@9@@danf@17-8-2009 10660040@unknown@formal@none@1@S@There are [[open class word|open word classes]], which constantly acquire new members, and [[closed class word|closed word classes]], which acquire new members infrequently if at all.@@@@1@26@@danf@17-8-2009 10660050@unknown@formal@none@1@S@Different languages may have different lexical categories, or they might associate different properties to the same one.@@@@1@17@@danf@17-8-2009 10660060@unknown@formal@none@1@S@For example, [[Japanese language|Japanese]] has at least three classes of adjectives where English has one; Chinese and Japanese have [[measure word]]s while European languages have nothing resembling them; many languages don't have a distinction between adjectives and adverbs, or adjectives and nouns, etc.@@@@1@43@@danf@17-8-2009 10660070@unknown@formal@none@1@S@Many linguists argue that the formal distinctions between parts of speech must be made within the framework of a specific language or language family, and should not be carried over to other languages or language families.@@@@1@36@@danf@17-8-2009 10660080@unknown@formal@none@1@S@==History==@@@@1@1@@danf@17-8-2009 10660090@unknown@formal@none@1@S@The classification of words into lexical categories is found from the earliest moments in the [[history of linguistics]].@@@@1@18@@danf@17-8-2009 10660100@unknown@formal@none@1@S@In the ''[[Nirukta]]'', written in the [[5th century BCE|5th]] or [[6th century BCE]], the [[Sanskrit grammarian]] [[Yāska]] defined four main categories of words :@@@@1@24@@danf@17-8-2009 10660110@unknown@formal@none@1@S@# nāma - [[noun]]s or substantives@@@@1@6@@danf@17-8-2009 10660120@unknown@formal@none@1@S@# ākhyāta - [[verb]]s@@@@1@4@@danf@17-8-2009 10660130@unknown@formal@none@1@S@# upasarga - pre-verbs or [[prefix]]es@@@@1@6@@danf@17-8-2009 10660140@unknown@formal@none@1@S@# nipāta - [[Grammatical particle|particle]]s, invariant words (perhaps [[prepositions]])@@@@1@9@@danf@17-8-2009 10660150@unknown@formal@none@1@S@These four were grouped into two large classes: [[inflection|inflected]] (nouns and verbs) and uninflected (pre-verbs and particles).@@@@1@17@@danf@17-8-2009 10660160@unknown@formal@none@1@S@A century or two later, the [[Classical Greece|Greek]] scholar [[Plato]] wrote in the [[Cratylus (dialogue)|''Cratylus'' dialog]] that "... sentences are, I conceive, a combination of verbs [''rhēma''] and nouns [''ónoma'']".@@@@1@30@@danf@17-8-2009 10660170@unknown@formal@none@1@S@Another class, "conjunctions" (covering [[Grammatical conjunction|conjunction]]s, [[pronoun]]s, and the [[article (grammar)|article]]), was later added by [[Aristotle]].@@@@1@16@@danf@17-8-2009 10660180@unknown@formal@none@1@S@By the end of the [[2nd century BCE]], the classification scheme had been expanded into eight categories, seen in the ''[[Art of Grammar|Tékhnē grammatiké]]'':@@@@1@24@@danf@17-8-2009 10660190@unknown@formal@none@1@S@# Noun: a part of speech inflected for case, signifying a concrete or abstract entity@@@@1@15@@danf@17-8-2009 10660200@unknown@formal@none@1@S@# Verb: a part of speech without case inflection, but inflected for tense, person and number, signifying an activity or process performed or undergone@@@@1@24@@danf@17-8-2009 10660210@unknown@formal@none@1@S@# Participle: a part of speech sharing the features of the verb and the noun@@@@1@15@@danf@17-8-2009 10660220@unknown@formal@none@1@S@# Article: a part of speech inflected for case and preposed or postposed to nouns (the relative pronoun is meant by the postposed article)@@@@1@24@@danf@17-8-2009 10660230@unknown@formal@none@1@S@# Pronoun: a part of speech substitutable for a noun and marked for person@@@@1@14@@danf@17-8-2009 10660240@unknown@formal@none@1@S@# Preposition: a part of speech placed before other words in composition and in syntax@@@@1@15@@danf@17-8-2009 10660250@unknown@formal@none@1@S@# Adverb: a part of speech without inflection, in modification of or in addition to a verb@@@@1@17@@danf@17-8-2009 10660260@unknown@formal@none@1@S@# Conjunction: a part of speech binding together the discourse and filling gaps in its interpretation@@@@1@16@@danf@17-8-2009 10660270@unknown@formal@none@1@S@The [[Latin grammar]]ian [[Priscian]] ([[floruit|fl.]] [[500 CE]]) modified the above eight-fold system, substituting "[[interjection]]" for "article".@@@@1@16@@danf@17-8-2009 10660280@unknown@formal@none@1@S@It wasn't until 1767 that the [[adjective]] was taken as a separate class.@@@@1@13@@danf@17-8-2009 10660290@unknown@formal@none@1@S@Traditional English grammar is patterned after the European tradition above, and is still taught in schools and used in [[dictionaries]].@@@@1@20@@danf@17-8-2009 10660300@unknown@formal@none@1@S@It names eight parts of speech: [[noun]], [[verb]], [[adjective]], [[adverb]], [[pronoun]], [[preposition]], [[Grammatical conjunction|conjunction]], and [[interjection]] (sometimes called an exclamation).@@@@1@20@@danf@17-8-2009 10660310@unknown@formal@none@1@S@==Controversies==@@@@1@1@@danf@17-8-2009 10660320@unknown@formal@none@1@S@Since the Greek grammarians of 2nd century BCE, parts of speech have been defined by [[morphology (linguistics)|morphological]], [[syntax|syntactic]] and [[semantics|semantic]] criteria.@@@@1@21@@danf@17-8-2009 10660330@unknown@formal@none@1@S@However, there is currently no generally agreed-upon classification scheme that can apply to all languages, or even a set of criteria upon which such a scheme should be based.@@@@1@29@@danf@17-8-2009 10660340@unknown@formal@none@1@S@Linguists recognize that the above list of eight word classes is simplified and artificial.@@@@1@14@@danf@17-8-2009 10660350@unknown@formal@none@1@S@For example, "adverb" is to some extent a catch-all class that includes words with many different functions.@@@@1@17@@danf@17-8-2009 10660360@unknown@formal@none@1@S@Some have even argued that the most basic of category distinctions, that of nouns and verbs, is unfounded, or not applicable to certain languages.@@@@1@24@@danf@17-8-2009 10660370@unknown@formal@none@1@S@==Functional classification==@@@@1@2@@danf@17-8-2009 10660380@unknown@formal@none@1@S@Common ways of delimiting words by function include:@@@@1@8@@danf@17-8-2009 10660390@unknown@formal@none@1@S@* '''[[Open word classes]]:'''@@@@1@4@@danf@17-8-2009 10660400@unknown@formal@none@1@S@**[[adjective]]s@@@@1@1@@danf@17-8-2009 10660410@unknown@formal@none@1@S@**[[adverb]]s@@@@1@1@@danf@17-8-2009 10660420@unknown@formal@none@1@S@**[[interjection]]s@@@@1@1@@danf@17-8-2009 10660430@unknown@formal@none@1@S@**[[noun]]s@@@@1@1@@danf@17-8-2009 10660440@unknown@formal@none@1@S@**[[verb]]s (except [[auxiliary verb]]s)@@@@1@4@@danf@17-8-2009 10660450@unknown@formal@none@1@S@* '''[[Closed word classes]]:'''@@@@1@4@@danf@17-8-2009 10660460@unknown@formal@none@1@S@**[[auxiliary verb]]s@@@@1@2@@danf@17-8-2009 10660470@unknown@formal@none@1@S@**[[clitic]]s@@@@1@1@@danf@17-8-2009 10660480@unknown@formal@none@1@S@**[[coverb]]s@@@@1@1@@danf@17-8-2009 10660490@unknown@formal@none@1@S@**[[Grammatical conjunction|conjunction]]s@@@@1@2@@danf@17-8-2009 10660500@unknown@formal@none@1@S@**[[determiner (class)|Determiner]]s ([[article (grammar)|article]]s, [[quantifier]]s, [[demonstrative adjective]]s, and [[possessive adjective]]s)@@@@1@10@@danf@17-8-2009 10660510@unknown@formal@none@1@S@**[[grammatical particle|particle]]s@@@@1@2@@danf@17-8-2009 10660520@unknown@formal@none@1@S@**[[measure word]]s@@@@1@2@@danf@17-8-2009 10660530@unknown@formal@none@1@S@**[[adposition]]s (prepositions, postpositions, and circumpositions)@@@@1@5@@danf@17-8-2009 10660540@unknown@formal@none@1@S@**[[preverb]]s@@@@1@1@@danf@17-8-2009 10660550@unknown@formal@none@1@S@**[[pronoun]]s@@@@1@1@@danf@17-8-2009 10660560@unknown@formal@none@1@S@**[[Contraction (grammar)|contraction]]s@@@@1@2@@danf@17-8-2009 10660570@unknown@formal@none@1@S@**[[Names of numbers in English#Cardinal numbers|cardinal numbers]]@@@@1@7@@danf@17-8-2009 10660580@unknown@formal@none@1@S@==English==@@@@1@1@@danf@17-8-2009 10660590@unknown@formal@none@1@S@[[English language|English]] frequently does not [[marker (linguistics)|mark]] words as belonging to one part of speech or another.@@@@1@17@@danf@17-8-2009 10660600@unknown@formal@none@1@S@Words like ''neigh'', ''break'', ''outlaw'', ''laser'', ''microwave'' and ''telephone'' might all be either verb forms or nouns.@@@@1@17@@danf@17-8-2009 10660610@unknown@formal@none@1@S@Although ''-ly'' is an adverb marker, not all adverbs end in ''-ly'' and not all words ending in ''-ly'' are adverbs.@@@@1@21@@danf@17-8-2009 10660620@unknown@formal@none@1@S@For instance, ''tomorrow'', ''slow'', ''fast'', ''crosswise'' can all be adverbs, while ''early'', ''friendly'', ''ugly'' are all adjectives (though ''early'' can also function as an adverb).@@@@1@25@@danf@17-8-2009 10660630@unknown@formal@none@1@S@In certain circumstances, even words with primarily grammatical functions can be used as verbs or nouns, as in "We must look to the ''hows'' and not just the ''whys''" or "Miranda was ''to-ing and fro-ing'' and not paying attention".@@@@1@39@@danf@17-8-2009 10670010@unknown@formal@none@1@S@
Part-of-speech tagging
@@@@1@2@@danf@17-8-2009 10670020@unknown@formal@none@1@S@'''Part-of-speech tagging''' ('''POS tagging''' or '''POST'''), also called '''grammatical tagging''', is the process of marking up the words in a text as corresponding to a particular [[parts of speech|part of speech]], based on both its definition, as well as its context—i.e., relationship with adjacent and related words in a [[phrase]], [[sentence]], or [[paragraph]].@@@@1@53@@danf@17-8-2009 10670030@unknown@formal@none@1@S@A simplified form of this is commonly taught school-age children, in the identification of words as [[noun]]s, [[verb]]s, [[adjective]]s, [[adverb]]s, etc.@@@@1@21@@danf@17-8-2009 10670040@unknown@formal@none@1@S@Once performed by hand, POS tagging is now done in the context of [[computational linguistics]], using [[algorithms]] which associate discrete terms, as well as hidden parts of speech, in accordance with a set of descriptive tags.@@@@1@36@@danf@17-8-2009 10670050@unknown@formal@none@1@S@==History==@@@@1@1@@danf@17-8-2009 10670060@unknown@formal@none@1@S@Research on part-of-speech tagging has been closely tied to [[corpus linguistics]].@@@@1@11@@danf@17-8-2009 10670070@unknown@formal@none@1@S@The first major corpus of English for computer analysis was the [[Brown Corpus]] developed at [[Brown University]] by [[Henry Kucera]] and [[Nelson Francis]], in the mid-1960s.@@@@1@26@@danf@17-8-2009 10670080@unknown@formal@none@1@S@It consists of about 1,000,000 words of running English prose text, made up of 500 samples from randomly chosen publications.@@@@1@20@@danf@17-8-2009 10670090@unknown@formal@none@1@S@Each sample is 2,000 or more words (ending at the first sentence-end after 2,000 words, so that the corpus contains only complete sentences).@@@@1@23@@danf@17-8-2009 10670100@unknown@formal@none@1@S@The [[Brown Corpus]] was painstakingly "tagged" with part-of-speech markers over many years.@@@@1@12@@danf@17-8-2009 10670110@unknown@formal@none@1@S@A first approximation was done with a program by Greene and Rubin, which consisted of a huge handmade list of what categories could co-occur at all.@@@@1@26@@danf@17-8-2009 10670120@unknown@formal@none@1@S@For example, article then noun can occur, but article verb (arguably) cannot.@@@@1@12@@danf@17-8-2009 10670130@unknown@formal@none@1@S@The program got about 70% correct.@@@@1@6@@danf@17-8-2009 10670140@unknown@formal@none@1@S@Its results were repeatedly reviewed and corrected by hand, and later users sent in errata, so that by the late 70s the tagging was nearly perfect (allowing for some cases even human speakers might not agree on).@@@@1@37@@danf@17-8-2009 10670150@unknown@formal@none@1@S@This corpus has been used for innumerable studies of word-frequency and of part-of-speech, and inspired the development of similar "tagged" corpora in many other languages.@@@@1@25@@danf@17-8-2009 10670160@unknown@formal@none@1@S@Statistics derived by analyzing it formed the basis for most later part-of-speech tagging systems, such as CLAWS and [[VOLSUNGA]].@@@@1@19@@danf@17-8-2009 10670170@unknown@formal@none@1@S@However, by this time (2005) it has been superseded by larger corpora such as the 100 million word [[British National Corpus]].@@@@1@21@@danf@17-8-2009 10670180@unknown@formal@none@1@S@For some time, part-of-speech tagging was considered an inseparable part of [[natural language processing]], because there are certain cases where the correct part of speech cannot be decided without understanding the [[semantics]] or even the [[pragmatics]] of the context.@@@@1@39@@danf@17-8-2009 10670190@unknown@formal@none@1@S@This is extremely expensive, especially because analyzing the higher levels is much harder when multiple part-of-speech possibilities must be considered for each word.@@@@1@23@@danf@17-8-2009 10670200@unknown@formal@none@1@S@In the mid 1980s, researchers in Europe began to use [[hidden Markov model]]s (HMMs) to disambiguate parts of speech, when working to tag the [[Lancaster-Oslo-Bergen Corpus]] of British English.@@@@1@29@@danf@17-8-2009 10670210@unknown@formal@none@1@S@HMMs involve counting cases (such as from the Brown Corpus), and making a table of the probabilities of certain sequences.@@@@1@20@@danf@17-8-2009 10670220@unknown@formal@none@1@S@For example, once you've seen an article such as 'the', perhaps the next word is a noun 40% of the time, an adjective 40%, and a number 20%.@@@@1@28@@danf@17-8-2009 10670230@unknown@formal@none@1@S@Knowing this, a program can decide that "can" in "the can" is far more likely to be a noun than a verb or a modal.@@@@1@25@@danf@17-8-2009 10670240@unknown@formal@none@1@S@The same method can of course be used to benefit from knowledge about following words.@@@@1@15@@danf@17-8-2009 10670250@unknown@formal@none@1@S@More advanced ("higher order") HMMs learn the probabilities not only of pairs, but triples or even larger sequences.@@@@1@18@@danf@17-8-2009 10670260@unknown@formal@none@1@S@So, for example, if you've just seen an article and a verb, the next item may be very likely a preposition, article, or noun, but even less likely another verb.@@@@1@30@@danf@17-8-2009 10670270@unknown@formal@none@1@S@When several ambiguous words occur together, the possibilities multiply.@@@@1@9@@danf@17-8-2009 10670280@unknown@formal@none@1@S@However, it is easy to enumerate every combination and to assign a relative probability to each one, by multiplying together the probabilities of each choice in turn.@@@@1@27@@danf@17-8-2009 10670290@unknown@formal@none@1@S@The combination with highest probability is then chosen.@@@@1@8@@danf@17-8-2009 10670300@unknown@formal@none@1@S@The European group developed CLAWS, a tagging program that did exactly this, and achieved accuracy in the 93-95% range.@@@@1@19@@danf@17-8-2009 10670310@unknown@formal@none@1@S@It is worth remembering, as [[Eugene Charniak]] points out in ''Statistical techniques for natural language parsing'' [http://www.cs.brown.edu/people/ec/home.html], that merely assigning the most common tag to each known word and the tag "proper noun" to all unknowns, will approach 90% accuracy because many words are unambiguous.@@@@1@45@@danf@17-8-2009 10670320@unknown@formal@none@1@S@CLAWS pioneered the field of HMM-based part of speech tagging, but was quite expensive since it enumerated all possibilities.@@@@1@19@@danf@17-8-2009 10670330@unknown@formal@none@1@S@It sometimes had to resort to backup methods when there were simply too many (the [[Brown Corpus]] contains a case with 17 ambiguous words in a row, and there are words such as "still" that can represent as many as 7 distinct parts of speech).@@@@1@45@@danf@17-8-2009 10670340@unknown@formal@none@1@S@In 1987, [[Steve DeRose]] and [[Ken Church]] independently developed [[dynamic programming]] algorithms to solve the same problem in vastly less time.@@@@1@21@@danf@17-8-2009 10670350@unknown@formal@none@1@S@Their methods were similar to the [[Viterbi algorithm]] known for some time in other fields.@@@@1@15@@danf@17-8-2009 10670360@unknown@formal@none@1@S@DeRose used a table of pairs, while Church used a table of triples and an ingenious method of estimating the values for triples that were rare or nonexistent in the Brown Corpus (actual measurement of triple probabilities would require a much larger corpus).@@@@1@43@@danf@17-8-2009 10670370@unknown@formal@none@1@S@Both methods achieved accuracy over 95%.@@@@1@6@@danf@17-8-2009 10670380@unknown@formal@none@1@S@DeRose's 1990 dissertation at [[Brown University]] included analyses of the specific error types, probabilities, and other related data, and replicated his work for Greek, where it proved similarly effective.@@@@1@29@@danf@17-8-2009 10670390@unknown@formal@none@1@S@These findings were surprisingly disruptive to the field of [[Natural Language Processing]].@@@@1@12@@danf@17-8-2009 10670400@unknown@formal@none@1@S@The accuracy reported was higher than the typical accuracy of very sophisticated algorithms that integrated part of speech choice with many higher levels of linguistic analysis: syntax, morphology, semantics, and so on.@@@@1@32@@danf@17-8-2009 10670410@unknown@formal@none@1@S@CLAWS, DeRose's and Church's methods did fail for some of the known cases where semantics is required, but those proved negligibly rare.@@@@1@22@@danf@17-8-2009 10670420@unknown@formal@none@1@S@This convinced many in the field that part-of-speech tagging could usefully be separated out from the other levels of processing; this in turn simplified the theory and practice of computerized language analysis, and encouraged researchers to find ways to separate out other pieces as well.@@@@1@45@@danf@17-8-2009 10670430@unknown@formal@none@1@S@Markov Models are now the standard method for part-of-speech assignment.@@@@1@10@@danf@17-8-2009 10670440@unknown@formal@none@1@S@The methods already discussed involve working from a pre-existing corpus to learn tag probabilities.@@@@1@14@@danf@17-8-2009 10670450@unknown@formal@none@1@S@It is, however, also possible to [[Bootstrapping (linguistics)|bootstrap]] using "unsupervised" tagging.@@@@1@11@@danf@17-8-2009 10670460@unknown@formal@none@1@S@Unsupervised tagging techniques use an untagged corpus for their training data and produce the tagset by induction.@@@@1@17@@danf@17-8-2009 10670470@unknown@formal@none@1@S@That is, they observe patterns in word use, and derive part-of-speech categories themselves.@@@@1@13@@danf@17-8-2009 10670480@unknown@formal@none@1@S@For example, statistics readily reveal that "the", "a", and "an" occur in similar contexts, while "eat" occurs in very different ones.@@@@1@21@@danf@17-8-2009 10670490@unknown@formal@none@1@S@With sufficient iteration, similarity classes of words emerge that are remarkably similar to those human linguists would expect; and the differences themselves sometimes suggest valuable new insights.@@@@1@27@@danf@17-8-2009 10670500@unknown@formal@none@1@S@These two categories can be further subdivided into rule-based, stochastic, and neural approaches.@@@@1@13@@danf@17-8-2009 10670510@unknown@formal@none@1@S@Some current major algorithms for '''part-of-speech tagging''' include the [[Viterbi algorithm]], [[Brill Tagger]], and the [[Baum-Welch algorithm]] (also known as the forward-backward algorithm).@@@@1@23@@danf@17-8-2009 10670520@unknown@formal@none@1@S@[[Hidden Markov model]] and [[visible Markov model]] taggers can both be implemented using the [[Viterbi algorithm]].@@@@1@16@@danf@17-8-2009 10680010@unknown@formal@none@1@S@
Pattern recognition
@@@@1@2@@danf@17-8-2009 10680020@unknown@formal@none@1@S@'''Pattern recognition''' is a sub-topic of [[machine learning]].@@@@1@8@@danf@17-8-2009 10680030@unknown@formal@none@1@S@It can be defined as@@@@1@5@@danf@17-8-2009 10680040@unknown@formal@none@1@S@:"the act of taking in raw data and taking an action based on the [[Category (taxonomy)|category]] of the data".@@@@1@19@@danf@17-8-2009 10680050@unknown@formal@none@1@S@Most research in pattern recognition is about methods for [[supervised learning]] and [[unsupervised learning]].@@@@1@14@@danf@17-8-2009 10680060@unknown@formal@none@1@S@Pattern recognition aims to classify [[data]] ([[pattern]]s) based on either ''[[A priori and a posteriori (philosophy)|a priori]]'' knowledge or on [[statistics|statistical]] information extracted from the patterns.@@@@1@26@@danf@17-8-2009 10680070@unknown@formal@none@1@S@The patterns to be classified are usually groups of measurements or observations, defining points in an appropriate [[space (mathematics)|multidimensional space]].@@@@1@20@@danf@17-8-2009 10680080@unknown@formal@none@1@S@This is in contrast to '''[[pattern matching]]''', where the pattern is rigidly specified.@@@@1@13@@danf@17-8-2009 10680090@unknown@formal@none@1@S@==Overview==@@@@1@1@@danf@17-8-2009 10680100@unknown@formal@none@1@S@A complete pattern recognition system consists of a [[sensor]] that gathers the observations to be classified or described; a [[feature extraction]] mechanism that computes numeric or symbolic information from the observations; and a [[statistical classification|classification]] or description scheme that does the actual job of classifying or describing observations, relying on the extracted features.@@@@1@53@@danf@17-8-2009 10680110@unknown@formal@none@1@S@The classification or description scheme is usually based on the availability of a set of patterns that have already been classified or described.@@@@1@23@@danf@17-8-2009 10680120@unknown@formal@none@1@S@This set of patterns is termed the [[training set]] and the resulting learning strategy is characterized as [[supervised learning]].@@@@1@19@@danf@17-8-2009 10680130@unknown@formal@none@1@S@Learning can also be [[unsupervised learning|unsupervised]], in the sense that the system is not given an ''a priori'' labeling of patterns, instead it establishes the classes itself based on the statistical regularities of the patterns.@@@@1@35@@danf@17-8-2009 10680140@unknown@formal@none@1@S@The classification or description scheme usually uses one of the following approaches: [[statistical classification|statistical]] (or decision theoretic), [[syntactic pattern recognition|syntactic]] (or structural).@@@@1@22@@danf@17-8-2009 10680150@unknown@formal@none@1@S@Statistical pattern recognition is based on statistical characterisations of patterns, assuming that the patterns are generated by a [[probabilistic]] system.@@@@1@20@@danf@17-8-2009 10680160@unknown@formal@none@1@S@Syntactical (or structural) pattern recognition is based on the structural interrelationships of features.@@@@1@13@@danf@17-8-2009 10680170@unknown@formal@none@1@S@A wide range of algorithms can be applied for pattern recognition, from very simple [[Naive Bayes classifier|Bayesian classifiers]] to much more powerful [[Artificial neural network|neural networks]].@@@@1@26@@danf@17-8-2009 10680180@unknown@formal@none@1@S@An intriguing problem in pattern recognition yet to be solved is the relationship between the problem to be solved (data to be classified) and the performance of various pattern recognition algorithms (classifiers).@@@@1@32@@danf@17-8-2009 10680190@unknown@formal@none@1@S@Pattern recognition is more complex when templates are used to generate variants.@@@@1@12@@danf@17-8-2009 10680200@unknown@formal@none@1@S@For example, in English, sentences often follow the "N-VP" (noun - verb phrase) pattern, but some knowledge of the English language is required to detect the pattern.@@@@1@27@@danf@17-8-2009 10680210@unknown@formal@none@1@S@Pattern recognition is studied in many fields, including [[psychology]], [[ethology]], and [[computer science]].@@@@1@13@@danf@17-8-2009 10680220@unknown@formal@none@1@S@[[Holographic associative memory]] is another type of pattern matching scheme where a target small patterns can be searched from a large set of learned patterns based on cognitive meta-weight.@@@@1@29@@danf@17-8-2009 10680230@unknown@formal@none@1@S@==Uses==@@@@1@1@@danf@17-8-2009 10680240@unknown@formal@none@1@S@Within medical science pattern recognition creates the basis for [[computer-aided diagnosis]] (CAD) systems.@@@@1@13@@danf@17-8-2009 10680250@unknown@formal@none@1@S@CAD describes a procedure that supports the doctor's interpretations and findings.@@@@1@11@@danf@17-8-2009 10680260@unknown@formal@none@1@S@Typical applications are automatic [[speech recognition]], [[document classification|classification of text into several categories]] (e.g. spam/non-spam email messages), the [[handwriting recognition|automatic recognition of handwritten postal codes]] on postal envelopes, or the [[facial recognition system|automatic recognition of images]] of human faces.@@@@1@39@@danf@17-8-2009 10680270@unknown@formal@none@1@S@The last two examples form the subtopic [[image analysis]] of pattern recognition that deals with digital images as input to pattern recognition systems.@@@@1@23@@danf@17-8-2009 10690010@unknown@formal@none@1@S@
Phrase
@@@@1@1@@danf@17-8-2009 10690020@unknown@formal@none@1@S@In [[grammar]], a '''phrase''' is a group of [[word]]s that functions as a single unit in the [[syntax]] of a [[Sentence (linguistics)|sentence]].@@@@1@22@@danf@17-8-2009 10690030@unknown@formal@none@1@S@For example ''the house at the end of the street'' (example 1) is a phrase.@@@@1@15@@danf@17-8-2009 10690040@unknown@formal@none@1@S@It acts like a noun.@@@@1@5@@danf@17-8-2009 10690050@unknown@formal@none@1@S@It contains the phrase ''at the end of the street'' (example 2), a prepositional phrase which acts like an adjective.@@@@1@20@@danf@17-8-2009 10690060@unknown@formal@none@1@S@Example 2 could be replaced by ''white'', to make the phrase ''the white house''.@@@@1@14@@danf@17-8-2009 10690070@unknown@formal@none@1@S@Examples 1 and 2 contain the phrase ''the end of the street'' (example 3) which acts like a noun.@@@@1@19@@danf@17-8-2009 10690080@unknown@formal@none@1@S@It could be replaced by ''the cross-roads'' to give ''the house at the cross-roads''.@@@@1@14@@danf@17-8-2009 10690090@unknown@formal@none@1@S@Most phrases have a or central word which defines the type of phrase.@@@@1@13@@danf@17-8-2009 10690100@unknown@formal@none@1@S@This word is called the [[head (linguistics)|head]] of the phrase.@@@@1@10@@danf@17-8-2009 10690110@unknown@formal@none@1@S@In English the head is often the first word of the phrase.@@@@1@12@@danf@17-8-2009 10690120@unknown@formal@none@1@S@Some phrases, however, can be headless.@@@@1@6@@danf@17-8-2009 10690130@unknown@formal@none@1@S@For example, ''the rich'' is a noun phrase composed of a determiner and an adjective, but no noun.@@@@1@18@@danf@17-8-2009 10690140@unknown@formal@none@1@S@Phrases may be classified by the type of head they take@@@@1@11@@danf@17-8-2009 10690150@unknown@formal@none@1@S@*[[Prepositional phrase]] (PP) with a [[preposition]] as head (e.g. ''in love'', ''over the rainbow'').@@@@1@14@@danf@17-8-2009 10690160@unknown@formal@none@1@S@Languages that use [[postposition]]s instead have [[postpositional phrase]]s.@@@@1@8@@danf@17-8-2009 10690170@unknown@formal@none@1@S@The two types are sometimes commonly referred to as [[adpositional phrase]]s.@@@@1@11@@danf@17-8-2009 10690180@unknown@formal@none@1@S@*[[Noun phrase]] (NP) with a [[noun]] as head (e.g. ''the black cat'', ''a cat on the mat'')@@@@1@17@@danf@17-8-2009 10690190@unknown@formal@none@1@S@*[[Verb phrase]] (VP) with a [[verb]] as head (e.g. ''eat cheese'', ''jump up and down'')@@@@1@15@@danf@17-8-2009 10690200@unknown@formal@none@1@S@*[[Adjectival phrase]] with an [[adjective]] as head (e.g. ''full of toys'')@@@@1@11@@danf@17-8-2009 10690210@unknown@formal@none@1@S@*[[Adverbial phrase]] with [[adverb]] as head (e.g. ''very carefully'')@@@@1@9@@danf@17-8-2009 10690220@unknown@formal@none@1@S@== Formal definition ==@@@@1@4@@danf@17-8-2009 10690230@unknown@formal@none@1@S@A '''phrase''' is a [[syntax|syntactic]] structure which has syntactic properties derived from its [[head (linguistics)|head]].@@@@1@15@@danf@17-8-2009 10690240@unknown@formal@none@1@S@== Complexity ==@@@@1@3@@danf@17-8-2009 10690250@unknown@formal@none@1@S@A complex phrase consists of several words, whereas a simple phrase consists of only one word.@@@@1@16@@danf@17-8-2009 10690260@unknown@formal@none@1@S@This terminology is especially often used with [[verb]] phrases:@@@@1@9@@danf@17-8-2009 10690270@unknown@formal@none@1@S@* simple past and present are simple verb, which require just one verb@@@@1@13@@danf@17-8-2009 10690280@unknown@formal@none@1@S@* complex verb have one or two [[grammatical aspect|aspect]]s added, hence require additional two or three words@@@@1@17@@danf@17-8-2009 10690290@unknown@formal@none@1@S@"Complex", which is phrase-level, is often confused with "[[compound (linguistics)|compound]]", which is [[word]]-level.@@@@1@13@@danf@17-8-2009 10690300@unknown@formal@none@1@S@However, there are certain phenomena that formally seem to be phrases but semantically are more like compounds, like "women's magazines", which has the form of a possessive noun phrase, but which refers (just like a compound) to one specific [[lexeme]] (i.e. a magazine for women and not some magazine owned by a woman).@@@@1@53@@danf@17-8-2009 10690310@unknown@formal@none@1@S@== Semiotic approaches to the concept of "phrase" ==@@@@1@9@@danf@17-8-2009 10690320@unknown@formal@none@1@S@In more [[semiotic]] approaches to language, such as the more cognitivist versions of [[construction grammar]], a phrasal structure is not only a certain formal combination of word types whose features are inherited from the head.@@@@1@35@@danf@17-8-2009 10690330@unknown@formal@none@1@S@Here each phrasal structure also expresses some type of [[concept]]ual content, be it specific or abstract.@@@@1@16@@danf@17-8-2009 10700010@unknown@formal@none@1@S@
Portuguese language
@@@@1@2@@danf@17-8-2009 10700020@unknown@formal@none@1@S@'''Portuguese''' ( or ''língua portuguesa'') is a [[Romance language]] that originated in what is now [[Galicia (Spain)]] and [[Portugal|northern Portugal]] from the [[Latin language|Latin]] spoken by [[Romanization (cultural)|romanized]] [[Pre-Roman peoples of the Iberian Peninsula]] (namely the [[Gallaeci]], the [[Lusitanians]], the [[Celtici]] and the [[Conii]]) about 2000 years ago.@@@@1@48@@danf@17-8-2009 10700030@unknown@formal@none@1@S@It spread worldwide in the 15th and 16th centuries as Portugal established a [[Portuguese Empire|colonial and commercial empire]] (1415–1999) which spanned from [[Brazil]] in the [[Americas]] to [[Goa]] in [[India]] and [[Macau]] in [[China]], in fact it was used exclusively on the island of [[Sri Lanka]] as the [[lingua franca]] for almost 350 years.@@@@1@54@@danf@17-8-2009 10700040@unknown@formal@none@1@S@During that time, many [[Portuguese Creole|creole languages based on Portuguese]] also appeared around the world, especially in [[Africa]], [[Asia]], and the [[Caribbean]].@@@@1@22@@danf@17-8-2009 10700050@unknown@formal@none@1@S@Today it is one of the world's major languages, [[List of languages by number of native speakers|ranked 6th]] according to number of native speakers (approximately 177 million).@@@@1@27@@danf@17-8-2009 10700060@unknown@formal@none@1@S@It is the language with the largest number of speakers in [[South America]], spoken by nearly all of Brazil's population, which amounts to over 51% of the continent's population even though it is the only Portuguese-speaking nation in [[the Americas]].@@@@1@40@@danf@17-8-2009 10700070@unknown@formal@none@1@S@It is also a major lingua franca in Portugal's former colonial possessions in Africa.@@@@1@14@@danf@17-8-2009 10700080@unknown@formal@none@1@S@It is the official language of ten countries (see the table on the right), also being co-official with [[Spanish language|Spanish]] and [[French language|French]] in [[Equatorial Guinea]], with [[Standard Cantonese|Cantonese]] [[Chinese language|Chinese]] in the Chinese special administrative region of [[Macau]], and with [[Tetum]] in [[East Timor]].@@@@1@45@@danf@17-8-2009 10700090@unknown@formal@none@1@S@There are sizable communities of Portuguese-speakers in various regions of North America, notably in the [[United States]] ([[New Jersey]], [[New England]] and south [[Florida]]) and in [[Ontario]], [[Canada]].@@@@1@28@@danf@17-8-2009 10700100@unknown@formal@none@1@S@[[Spain|Spanish]] author [[Miguel de Cervantes]] once called Portuguese "the sweet language", while Brazilian writer [[Olavo Bilac]] poetically described it as ''a última flor do Lácio, inculta e bela'': "the last flower of [[Latium]], wild and beautiful".@@@@1@36@@danf@17-8-2009 10700110@unknown@formal@none@1@S@==Geographic distribution==@@@@1@2@@danf@17-8-2009 10700120@unknown@formal@none@1@S@Today, Portuguese is the [[official language]] of [[Angola]], [[Brazil]], [[Cape Verde]], [[Guinea-Bissau]], [[Portugal]], [[São Tomé and Príncipe]] and [[Mozambique]].@@@@1@19@@danf@17-8-2009 10700130@unknown@formal@none@1@S@It is also one of the official languages of [[Equatorial Guinea]] (with [[Spanish language|Spanish]] and [[French language|French]]), the [[Special Administrative Region of the People's Republic of China|Chinese special administrative region]] of [[Macau]] (with [[Chinese language|Chinese]]), and [[East Timor]], (with [[Tetum]]).@@@@1@40@@danf@17-8-2009 10700140@unknown@formal@none@1@S@It is a [[First language|native language]] of most of the population in Portugal (100%), Brazil (99%), Angola (60%), and São Tomé and Príncipe (50%), and it is spoken by a [[plurality]] of the population of Mozambique (40%), though only 6.5% are native speakers.@@@@1@43@@danf@17-8-2009 10700150@unknown@formal@none@1@S@No data is available for Cape Verde, but almost all the population is bilingual, and the monolingual population speaks [[Cape Verdean Creole]].@@@@1@22@@danf@17-8-2009 10700160@unknown@formal@none@1@S@Small Portuguese-speaking communities subsist in former overseas colonies of Portugal such as Macau, where it is spoken as a first language by 0.6% of the population and East Timor.@@@@1@29@@danf@17-8-2009 10700170@unknown@formal@none@1@S@[[Uruguay]] gave Portuguese an equal status to Spanish in its educational system at the north border with Brazil.@@@@1@18@@danf@17-8-2009 10700180@unknown@formal@none@1@S@In the rest of the country, it's taught as an obligatory subject beginning by the 6th grade.@@@@1@17@@danf@17-8-2009 10700190@unknown@formal@none@1@S@It is also spoken by substantial immigrant communities, though not official, in [[Andorra]], [[France]], [[Luxembourg]], [[Jersey]] (with a statistically significant Portuguese-speaking community of approximately 10,000 people), [[Paraguay]], [[Namibia]], [[South Africa]], [[Switzerland]], [[Venezuela]] and in the [[U.S.]] states of [[California]], [[Connecticut]], [[Florida]], [[Massachusetts]], [[New Jersey]], [[New York]] and [[Rhode Island]].@@@@1@49@@danf@17-8-2009 10700200@unknown@formal@none@1@S@In some parts of India, such as [[Goa]] and [[Daman and Diu]] Portuguese is still spoken.@@@@1@16@@danf@17-8-2009 10700210@unknown@formal@none@1@S@There are also significant populations of Portuguese speakers in [[Canada]] (mainly concentrated in and around [[Toronto]]) [[Bermuda]] and [[Netherlands Antilles]].@@@@1@20@@danf@17-8-2009 10700220@unknown@formal@none@1@S@Portuguese is an official language of several international organizations.@@@@1@9@@danf@17-8-2009 10700230@unknown@formal@none@1@S@The [[Community of Portuguese Language Countries]] (with the Portuguese acronym CPLP) consists of the eight independent countries that have Portuguese as an official language.@@@@1@24@@danf@17-8-2009 10700240@unknown@formal@none@1@S@It is also an official language of the [[European Union]], [[Mercosul]], the [[Organization of American States]], the [[Organization of Ibero-American States]], the [[Union of South American Nations]], and the [[African Union]] (one of the working languages) and one of the official languages of other organizations.@@@@1@45@@danf@17-8-2009 10700250@unknown@formal@none@1@S@The Portuguese language is gaining popularity in Africa, Asia, and South America as a second language for study.@@@@1@18@@danf@17-8-2009 10700260@unknown@formal@none@1@S@Portuguese and Spanish are the fastest-growing European languages, and, according to estimates by UNESCO, Portuguese is the language with the highest potential for growth as an international language in southern Africa and South America.@@@@1@34@@danf@17-8-2009 10700270@unknown@formal@none@1@S@The Portuguese-speaking African countries are expected to have a combined population of 83 million by 2050.@@@@1@16@@danf@17-8-2009 10700280@unknown@formal@none@1@S@Since 1991, when Brazil signed into the economic market of Mercosul with other South American nations, such as Argentina, Uruguay, and Paraguay, there has been an increase in interest in the study of Portuguese in those South American countries.@@@@1@39@@danf@17-8-2009 10700290@unknown@formal@none@1@S@The demographic weight of Brazil in the continent will continue to strengthen the presence of the language in the region.@@@@1@20@@danf@17-8-2009 10700300@unknown@formal@none@1@S@Although in the early 21st century, after Macau was ceded to China in 1999, the use of Portuguese was in decline in Asia, it is becoming a language of opportunity there; mostly because of East Timor's boost in the number of speakers in the last five years but also because of increased Chinese diplomatic and financial ties with Portuguese-speaking countries.@@@@1@60@@danf@17-8-2009 10700310@unknown@formal@none@1@S@In July 2007, President Teodoro Obiang Nguema announced his government's decision to make Portuguese [[Equatorial Guinea]]'s third official language, in order to meet the requirements to apply for full membership of the [[Community of Portuguese Language Countries]].@@@@1@37@@danf@17-8-2009 10700320@unknown@formal@none@1@S@This upgrading from its current Associate Observer condition would result in Equatorial Guinea being able to access several professional and academic exchange programs and the facilitation of cross-border circulation of citizens.@@@@1@31@@danf@17-8-2009 10700330@unknown@formal@none@1@S@Its application is currently being assessed by other CPLP members.@@@@1@10@@danf@17-8-2009 10700340@unknown@formal@none@1@S@In March 1994 the [[Bosque de Portugal]] (Portugal's Woods) was founded in the Brazilian city of [[Curitiba]].@@@@1@17@@danf@17-8-2009 10700350@unknown@formal@none@1@S@The park houses the Portuguese Language Memorial, which honors the Portuguese immigrants and the countries that adopted the Portuguese language.@@@@1@20@@danf@17-8-2009 10700360@unknown@formal@none@1@S@Originally there were seven nations represented with pillars, but the independence of [[East Timor]] brought yet another pillar for that nation in 2007.@@@@1@23@@danf@17-8-2009 10700370@unknown@formal@none@1@S@In March 2006, the [[Museum of the Portuguese Language]], an interactive museum about the Portuguese language, was founded in [[São Paulo]], Brazil, the city with the largest number of Portuguese speakers in the world.@@@@1@34@@danf@17-8-2009 10700380@unknown@formal@none@1@S@==Dialects==@@@@1@1@@danf@17-8-2009 10700390@unknown@formal@none@1@S@Portuguese is a [[pluricentric language]] with two main groups of [[dialect]]s, those of [[Brazil]] and those of the [[Old World]].@@@@1@20@@danf@17-8-2009 10700400@unknown@formal@none@1@S@For historical reasons, the dialects of Africa and Asia are generally closer to those of Portugal than the Brazilian dialects, although in some aspects of their phonetics, especially the pronunciation of unstressed vowels, they resemble [[Brazilian Portuguese]] more than [[European Portuguese]].@@@@1@41@@danf@17-8-2009 10700410@unknown@formal@none@1@S@They have not been studied as widely as European and Brazilian Portuguese.@@@@1@12@@danf@17-8-2009 10700420@unknown@formal@none@1@S@Audio samples of some dialects of Portuguese are available below.@@@@1@10@@danf@17-8-2009 10700430@unknown@formal@none@1@S@There are some differences between the areas but these are the best approximations possible.@@@@1@14@@danf@17-8-2009 10700440@unknown@formal@none@1@S@For example, the ''caipira'' dialect has some differences from the one of Minas Gerais, but in general it is very close.@@@@1@21@@danf@17-8-2009 10700450@unknown@formal@none@1@S@A good example of Brazilian Portuguese may be found in the capital city, [[Brasília]], because of the generalized population from all parts of the country.@@@@1@25@@danf@17-8-2009 10700460@unknown@formal@none@1@S@'''[[Angola]]'''@@@@1@1@@danf@17-8-2009 10700470@unknown@formal@none@1@S@# ''Benguelense'' — [[Benguela]] province.@@@@1@5@@danf@17-8-2009 10700480@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som85.html ''Luandense''] — [[Luanda]] province.@@@@1@6@@danf@17-8-2009 10700490@unknown@formal@none@1@S@# ''Sulista'' — South of Angola.@@@@1@6@@danf@17-8-2009 10700500@unknown@formal@none@1@S@'''[[Brazil]]'''@@@@1@1@@danf@17-8-2009 10700510@unknown@formal@none@1@S@# ''[[Caipira]]'' — States of [[São Paulo (state)|São Paulo]] (countryside; the city of São Paulo and the eastern areas of the state have their own dialect, called ''paulistano''); southern [[Minas Gerais]], northern [[Paraná (state)|Paraná]], [[Goiás]] and [[Mato Grosso do Sul]].@@@@1@40@@danf@17-8-2009 10700520@unknown@formal@none@1@S@# ''Cearense'' — [[Ceará]].@@@@1@4@@danf@17-8-2009 10700530@unknown@formal@none@1@S@# ''Baiano'' — [[Bahia]].@@@@1@4@@danf@17-8-2009 10700540@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som90.html ''Fluminense''] — Variants spoken in the states of [[Rio de Janeiro (state)|Rio de Janeiro]] and [[Espírito Santo]] (excluding the city of Rio de Janeiro and its adjacent metropolitan areas, which have their own dialect, called ''[[carioca]]'').@@@@1@38@@danf@17-8-2009 10700550@unknown@formal@none@1@S@# ''[[Gaucho|Gaúcho]]'' — [[Rio Grande do Sul]].@@@@1@7@@danf@17-8-2009 10700560@unknown@formal@none@1@S@(There are many distinct accents in Rio Grande do Sul, mainly due to the heavy influx of European immigrants of diverse origins, those which have settled several colonies throughout the state.)@@@@1@31@@danf@17-8-2009 10700570@unknown@formal@none@1@S@# ''[[Mineiro]]'' — [[Minas Gerais]] (not prevalent in the [[Triângulo Mineiro]], southern and southeastern [[Minas Gerais]]).@@@@1@16@@danf@17-8-2009 10700580@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som91.html ''Nordestino''] — [[Northeast Region, Brazil|northeastern states of Brazil]] ([[Pernambuco]] and [[Rio Grande do Norte]] have a particular way of speaking).@@@@1@22@@danf@17-8-2009 10700590@unknown@formal@none@1@S@# ''Nortista'' — [[Amazon Basin]] states.@@@@1@6@@danf@17-8-2009 10700600@unknown@formal@none@1@S@# ''Paulistano'' — Variants spoken around [[São Paulo]] city and the eastern areas of São Paulo state.@@@@1@17@@danf@17-8-2009 10700610@unknown@formal@none@1@S@# ''Sertanejo'' — States of [[Goiás]] and [[Mato Grosso]] (the city of [[Cuiabá]] has a particular way of speaking).@@@@1@19@@danf@17-8-2009 10700620@unknown@formal@none@1@S@# ''Sulista'' — Variants spoken in the areas between the northern regions of [[Rio Grande do Sul]] and southern regions of São Paulo state.@@@@1@24@@danf@17-8-2009 10700630@unknown@formal@none@1@S@(The cities of [[Curitiba]], [[Florianópolis]], and [[Itapetininga]] have fairly distinct accents as well.)@@@@1@13@@danf@17-8-2009 10700640@unknown@formal@none@1@S@'''[[Portugal]]'''@@@@1@1@@danf@17-8-2009 10700650@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som69.html ''Açoriano''] (Azorean) — [[Azores]].@@@@1@6@@danf@17-8-2009 10700660@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som40.html ''Alentejano''] — [[Alentejo]]@@@@1@5@@danf@17-8-2009 10700670@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som44.html ''Algarvio''] — [[Algarve]] (there is a particular dialect in a small part of western Algarve).@@@@1@17@@danf@17-8-2009 10700680@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som1.html ''Alto-Minhoto''] — North of [[Braga]] (hinterland).@@@@1@8@@danf@17-8-2009 10700690@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som49.html ''Baixo-Beirão''; ''Alto-Alentejano''] — Central Portugal (hinterland).@@@@1@8@@danf@17-8-2009 10700700@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som9.html ''Beirão''] — Central Portugal.@@@@1@6@@danf@17-8-2009 10700710@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som22.html ''Estremenho''] — Regions of [[Coimbra]] and [[Lisbon]] (the Lisbon dialect has some peculiar features not shared with the one of Coimbra).@@@@1@23@@danf@17-8-2009 10700720@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som60.html ''Madeirense''] (Madeiran) — [[Madeira]].@@@@1@6@@danf@17-8-2009 10700730@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som14.html ''Nortenho''] — Regions of Braga and [[Porto]].@@@@1@9@@danf@17-8-2009 10700740@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som6.html ''Transmontano''] — [[Trás-os-Montes e Alto Douro]].@@@@1@8@@danf@17-8-2009 10700750@unknown@formal@none@1@S@Other countries@@@@1@2@@danf@17-8-2009 10700760@unknown@formal@none@1@S@* '''[[Cape Verde]]''' — [http://www.instituto-camoes.pt/cvc/hlp/geografia/som87.html ''Português cabo-verdiano''] ([[Cape Verdean Portuguese]])@@@@1@10@@danf@17-8-2009 10700770@unknown@formal@none@1@S@* '''[[Daman and Diu]]''', India — ''Damaense''.@@@@1@7@@danf@17-8-2009 10700780@unknown@formal@none@1@S@* '''[[East Timor]]''' — [http://www.instituto-camoes.pt/cvc/hlp/geografia/som84.html ''Timorense''] ([[East Timorese Portuguese|East Timorese]])@@@@1@10@@danf@17-8-2009 10700790@unknown@formal@none@1@S@* '''[[Goa]]''', India — ''Goês''.@@@@1@5@@danf@17-8-2009 10700800@unknown@formal@none@1@S@* '''[[Guinea-Bissau]]''' — [http://www.instituto-camoes.pt/cvc/hlp/geografia/som88.html ''Guineense''] ([[Guinean Portuguese]]).@@@@1@7@@danf@17-8-2009 10700810@unknown@formal@none@1@S@* '''[[Macau]]''', China — [http://www.instituto-camoes.pt/cvc/hlp/geografia/som92.html ''Macaense''] ([[Macanese Portuguese|Macanese]])@@@@1@8@@danf@17-8-2009 10700820@unknown@formal@none@1@S@* '''[[Mozambique]]''' — [http://www.instituto-camoes.pt/cvc/hlp/geografia/som89.html ''Moçambicano''] ([[Mozambican Portuguese|Mozambican]])@@@@1@7@@danf@17-8-2009 10700830@unknown@formal@none@1@S@* '''[[São Tomé and Príncipe]]''' — [http://www.instituto-camoes.pt/cvc/hlp/geografia/som83.html ''Santomense'']@@@@1@8@@danf@17-8-2009 10700840@unknown@formal@none@1@S@* '''[[Uruguay]]''' — [[Riverense Portuñol language|''Dialectos Portugueses del Uruguay (DPU)'']].@@@@1@10@@danf@17-8-2009 10700850@unknown@formal@none@1@S@Differences between dialects are mostly of [[accent (linguistics)|accent]] and [[vocabulary]], but between the Brazilian dialects and other dialects, especially in their most coloquial forms, there can also be some grammatical differences.@@@@1@31@@danf@17-8-2009 10700860@unknown@formal@none@1@S@The [[Portuguese creole|Portuguese-based creole]]s spoken in various parts of Africa, Asia, and the Americas are independent languages which should not be confused with Portuguese itself.@@@@1@25@@danf@17-8-2009 10700870@unknown@formal@none@1@S@==History==@@@@1@1@@danf@17-8-2009 10700880@unknown@formal@none@1@S@Arriving in the Iberian Peninsula in 216 BC, the Romans brought with them the [[Latin language]], from which all Romance languages descend.@@@@1@22@@danf@17-8-2009 10700890@unknown@formal@none@1@S@The language was spread by arriving Roman soldiers, settlers and merchants, who built Roman cities mostly near the settlements of previous civilizations.@@@@1@22@@danf@17-8-2009 10700900@unknown@formal@none@1@S@Between AD 409 and 711, as the Roman Empire collapsed in Western Europe, the Iberian Peninsula was conquered by Germanic peoples ([[Migration Period]]).@@@@1@23@@danf@17-8-2009 10700910@unknown@formal@none@1@S@The occupiers, mainly [[Suebi]] and [[Visigoths]], quickly adopted late Roman culture and the [[Vulgar Latin]] dialects of the peninsula.@@@@1@19@@danf@17-8-2009 10700920@unknown@formal@none@1@S@After the [[Moors|Moorish]] invasion of 711, [[Arabic language|Arabic]] became the administrative language in the conquered regions, but most of the population continued to speak a form of [[Romance languages|Romance]] commonly known as [[Mozarabic]].@@@@1@33@@danf@17-8-2009 10700930@unknown@formal@none@1@S@The influence exerted by Arabic on the Romance dialects spoken in the Christian kingdoms of the north was small, affecting mainly their lexicon.@@@@1@23@@danf@17-8-2009 10700940@unknown@formal@none@1@S@The earliest surviving records of a distinctively Portuguese language are administrative documents of the 9th century, still interspersed with many Latin phrases.@@@@1@22@@danf@17-8-2009 10700950@unknown@formal@none@1@S@Today this phase is known as Proto-Portuguese (between the 9th and the 12th centuries).@@@@1@14@@danf@17-8-2009 10700960@unknown@formal@none@1@S@In the first period of Old Portuguese — [[Galician-Portuguese]] Period (from the 12th to the 14th century) — the language gradually came into general use.@@@@1@25@@danf@17-8-2009 10700970@unknown@formal@none@1@S@For some time, it was the language of preference for [[lyric poetry]] in Christian Hispania, much like [[Occitan]] was the language of the [[Occitan literature#Poetry_of_the_troubadours|poetry of the troubadours]].@@@@1@28@@danf@17-8-2009 10700980@unknown@formal@none@1@S@Portugal was formally recognized as an independent kingdom by the [[Kingdom of Leon]] in 1143, with [[Afonso I of Portugal|Afonso Henriques]] as king.@@@@1@23@@danf@17-8-2009 10700990@unknown@formal@none@1@S@In 1290, king [[Denis of Portugal|Dinis]] created the first Portuguese university in Lisbon (the ''Estudos Gerais'', later moved to [[Coimbra]]) and decreed that Portuguese, then simply called the "common language" should be known as the Portuguese language and used officially.@@@@1@40@@danf@17-8-2009 10701000@unknown@formal@none@1@S@In the second period of Old Portuguese, from the 14th to the 16th century, with the [[Age of discovery|Portuguese discoveries]], the language was taken to many regions of [[Asia]], [[Africa]] and the [[Americas]] (nowadays, the great majority of Portuguese speakers live in Brazil, in South America).@@@@1@46@@danf@17-8-2009 10701010@unknown@formal@none@1@S@By the 16th century it had become a ''[[lingua franca]]'' in Asia and Africa, used not only for colonial administration and trade but also for communication between local officials and Europeans of all nationalities.@@@@1@34@@danf@17-8-2009 10701020@unknown@formal@none@1@S@Its spread was helped by mixed marriages between Portuguese and local people, and by its association with [[Roman Catholic]] [[missionary]] efforts, which led to the formation of a [[creole language]] called [[Kristang language|Kristang]] in many parts of Asia (from the word ''cristão'', "Christian").@@@@1@43@@danf@17-8-2009 10701030@unknown@formal@none@1@S@The language continued to be popular in parts of Asia until the 19th century.@@@@1@14@@danf@17-8-2009 10701040@unknown@formal@none@1@S@Some Portuguese-speaking Christian communities in [[India]], [[Sri Lanka]], [[Malaysia]], and [[Indonesia]] preserved their language even after they were isolated from Portugal.@@@@1@21@@danf@17-8-2009 10701050@unknown@formal@none@1@S@The end of the Old Portuguese period was marked by the publication of the ''Cancioneiro Geral'' by [[Garcia de Resende]], in 1516.@@@@1@22@@danf@17-8-2009 10701060@unknown@formal@none@1@S@The early times of Modern Portuguese, which spans from the 16th century to present day, were characterized by an increase in the number of learned words borrowed from Classical Latin and Classical Greek since the Renaissance, which greatly enriched the lexicon.@@@@1@41@@danf@17-8-2009 10701070@unknown@formal@none@1@S@===Characterization===@@@@1@1@@danf@17-8-2009 10701080@unknown@formal@none@1@S@A distinctive feature of Portuguese is that it preserved the stressed vowels of [[Vulgar Latin]], which became diphthongs in other Romance languages; cf. Fr. ''pierre'', Sp. ''piedra'', It. ''pietra'', Port. ''pedra'', from Lat. ''petra''; or Sp. ''fuego'', It. ''fuoco'', Port. ''fogo'', from Lat. ''focum''.@@@@1@44@@danf@17-8-2009 10701090@unknown@formal@none@1@S@Another characteristic of early Portuguese was the loss of [[:wiktionary:intervocalic|intervocalic]] ''l'' and ''n'', sometimes followed by the merger of the two surrounding vowels, or by the insertion of an [[epenthesis|epenthetic vowel]] between them: cf. Lat. ''salire'', ''tenere'', ''catena'', Sp. ''salir'', ''tener'', ''cadena'', Port. ''sair'', ''ter'', ''cadeia''.@@@@1@46@@danf@17-8-2009 10701100@unknown@formal@none@1@S@When the [[elision|elided]] consonant was ''n'', it often [[nasalization|nasalized]] the preceding vowel: cf. Lat. ''manum'', ''rana'', ''bonum'', Port. ''mão'', ''rãa'', ''bõo'' (now ''mão'', ''rã'', ''bom'').@@@@1@25@@danf@17-8-2009 10701110@unknown@formal@none@1@S@This process was the source of most of the nasal diphthongs which are typical of Portuguese.@@@@1@16@@danf@17-8-2009 10701120@unknown@formal@none@1@S@In particular, the Latin endings ''-anem'', ''-anum'' and ''-onem'' became ''-ão'' in most cases, cf. Lat. ''canem'', ''germanum'', ''rationem'' with Modern Port. ''cão'', ''irmão'', ''razão'', and their plurals ''-anes'', ''-anos'', ''-ones'' normally became ''-ães'', ''-ãos'', ''-ões'', cf. ''cães'', ''irmãos'', ''razões''.@@@@1@40@@danf@17-8-2009 10701130@unknown@formal@none@1@S@===Movement to make Portuguese an official language of the UN===@@@@1@10@@danf@17-8-2009 10701140@unknown@formal@none@1@S@There is a growing number of people in the Portuguese speaking media and the internet who are presenting the case to the CPLP and other organizations to run a debate in the [[Lusophone]] community with the purpose of bringing forward a petition to make Portuguese an official language of the United Nations.@@@@1@52@@danf@17-8-2009 10701150@unknown@formal@none@1@S@In October 2005, during the international Convention of the [http://www.elosinternacional.com.br/index.htm Elos Club International ] that took place in Tavira, Portugal a petition was written and unanimously approved whose text can be found on the internet with the title ''Petição Para Tornar Oficial o Idioma Português na ONU''.@@@@1@47@@danf@17-8-2009 10701160@unknown@formal@none@1@S@Romulo Alexandre Soares, president of the Brazil-Portugal Chamber highlights that the positioning of Brazil in the international arena as one of the emergent powers of the 21 century, the size of its population, and the presence of the language around the world provides legitimacy and justifies a petition to the UN to make the Portuguese an official language at the UN.@@@@1@61@@danf@17-8-2009 10701170@unknown@formal@none@1@S@==Vocabulary==@@@@1@1@@danf@17-8-2009 10701180@unknown@formal@none@1@S@Most of the lexicon of Portuguese is derived from Latin.@@@@1@10@@danf@17-8-2009 10701190@unknown@formal@none@1@S@Nevertheless, because of the [[Moors|Moorish]] occupation of the [[Iberian Peninsula]] during the Middle Ages, and the participation of Portugal in the [[Age of Discovery]], it has adopted loanwords from all over the world.@@@@1@33@@danf@17-8-2009 10701200@unknown@formal@none@1@S@Very few Portuguese words can be traced to the [[Pre-Roman peoples of the Iberian Peninsula|pre-Roman inhabitants of Portugal]], which included the [[Gallaeci]], [[Lusitanians]], [[Celtici]] and [[Cynetes]].@@@@1@26@@danf@17-8-2009 10701210@unknown@formal@none@1@S@The [[Phoenicians]] and [[Carthaginians]], briefly present, also left some scarce traces.@@@@1@11@@danf@17-8-2009 10701220@unknown@formal@none@1@S@Some notable examples are ''abóbora'' "pumpkin" and ''bezerro'' "year-old calf", from the nearby [[Celtiberian language]] (probably through the Celtici); ''cerveja'' "beer", from [[Celtic languages|Celtic]]; ''saco'' "bag", from [[Phoenician language|Phoenician]]; and ''cachorro'' "dog, puppy", from [[Basque language|Basque]].@@@@1@36@@danf@17-8-2009 10701230@unknown@formal@none@1@S@In the 5th century, the Iberian Peninsula (the [[Ancient Rome|Roman]] [[Hispania]]) was conquered by the [[Germanic peoples|Germanic]] [[Suevi]] and [[Visigoths]].@@@@1@20@@danf@17-8-2009 10701240@unknown@formal@none@1@S@As they adopted the Roman civilization and language, however, these people contributed only a few words to the lexicon, mostly related to warfare — such as ''espora'' "spur", ''estaca'' "stake", and ''guerra'' "war", from [[Gothic language|Gothic]] ''*spaúra'', ''*stakka'', and ''*wirro'', respectively.@@@@1@41@@danf@17-8-2009 10701250@unknown@formal@none@1@S@Between the 9th and 15th centuries Portuguese acquired about 1000 words from [[Arabic language|Arabic]] by influence of [[al-Andalus|Moorish Iberia]].@@@@1@19@@danf@17-8-2009 10701260@unknown@formal@none@1@S@They are often recognizable by the initial Arabic article ''a''(''l'')''-'', and include many common words such as ''aldeia'' "village" from الضيعة ''aldaya'', ''alface'' "lettuce" from الخس ''alkhass'', ''armazém'' "warehouse" from المخزن ''almahazan'', and ''azeite'' "olive oil" from زيت ''azzait''.@@@@1@39@@danf@17-8-2009 10701270@unknown@formal@none@1@S@From Arabic came also the grammatically peculiar word [[Insha'Allah|''oxalá'']] "hopefully".@@@@1@10@@danf@17-8-2009 10701280@unknown@formal@none@1@S@The Mozambican currency name [[Mozambican Metical|''metical'']] was derived from the word مطقال ''miṭqāl'', a unit of weight.@@@@1@17@@danf@17-8-2009 10701290@unknown@formal@none@1@S@The word Mozambique itself is from the Arabic name of sultan Muça Alebique (Musa Alibiki).@@@@1@15@@danf@17-8-2009 10701300@unknown@formal@none@1@S@The name of the Portuguese town of [[Fátima, Portugal|Fátima]] comes from the name of one of the daughters of the prophet [[Muhammad]].@@@@1@22@@danf@17-8-2009 10701310@unknown@formal@none@1@S@Starting in the 15th century, the Portuguese maritime explorations led to the introduction of many loanwords from [[Asia]]n languages.@@@@1@19@@danf@17-8-2009 10701320@unknown@formal@none@1@S@For instance, ''catana'' "cutlass" from Japanese ''katana''; ''corja'' "rabble" from Malay ''kórchchu''; and ''chá'' "tea" from [[Chinese language|Chinese]] ''[[Tea#The word tea|''chá'']]''.@@@@1@21@@danf@17-8-2009 10701330@unknown@formal@none@1@S@From South America came ''batata'' "[[potato]]", from [[Taino]]; ''ananás'' and ''abacaxi'', from [[Tupi-Guarani]] ''naná'' and [[Tupi language|Tupi]] ''ibá cati'', respectively (two species of [[pineapple]]), and ''tucano'' "[[toucan]]" from [[Guarani language|Guarani]] ''tucan''.@@@@1@31@@danf@17-8-2009 10701340@unknown@formal@none@1@S@See [[List of Brazil state name etymologies]], for some more examples.@@@@1@11@@danf@17-8-2009 10701350@unknown@formal@none@1@S@From the 16th to the 19th century, the role of Portugal as intermediary in the [[Atlantic slave trade]], with the establishment of large Portuguese colonies in Angola, Mozambique, and Brazil, Portuguese got several words of African and [[indigenous peoples of Brazil|Amerind]] origin, especially names for most of the animals and plants found in those territories.@@@@1@55@@danf@17-8-2009 10701360@unknown@formal@none@1@S@While those terms are mostly used in the former colonies, many became current in European Portuguese as well.@@@@1@18@@danf@17-8-2009 10701370@unknown@formal@none@1@S@From [[Kimbundu language|Kimbundu]], for example, came ''kifumate'' → ''cafuné'' "head caress", ''kusula'' → ''caçula'' "youngest child", ''marimbondo'' "tropical wasp", and ''kubungula'' → ''bungular'' "to dance like a wizard".@@@@1@28@@danf@17-8-2009 10701380@unknown@formal@none@1@S@Finally, it has received a steady influx of loanwords from other European languages.@@@@1@13@@danf@17-8-2009 10701390@unknown@formal@none@1@S@For example, ''melena'' "hair lock", ''fiambre'' "wet-cured ham" (in contrast with ''presunto'' "dry-cured ham" from Latin ''prae-exsuctus'' "dehydrated"), and ''castelhano'' "Castilian", from Spanish; ''colchete''/''crochê'' "bracket"/"crochet", ''paletó'' "jacket", ''batom'' "lipstick", and ''filé''/''filete'' "steak"/"slice" respectively, from French ''crochet'', ''paletot'', ''bâton'', ''filet''; ''macarrão'' "pasta", ''piloto'' "pilot", ''carroça'' "carriage", and ''barraca'' "barrack", from Italian ''maccherone'', ''pilota'', ''carrozza'', ''baracca''; and ''bife'' "steak", ''futebol'', ''revólver'', ''estoque'', ''folclore'', from English ''beef'', ''football'', ''revolver'', ''stock'', ''folklore''.@@@@1@68@@danf@17-8-2009 10701400@unknown@formal@none@1@S@==Classification and related languages==@@@@1@4@@danf@17-8-2009 10701410@unknown@formal@none@1@S@Portuguese belongs to the [[West Iberian languages|West Iberian]] branch of the [[Romance language]]s, and it has special ties with the following members of this group:@@@@1@25@@danf@17-8-2009 10701420@unknown@formal@none@1@S@* [[Galician language|Galician]] and the [[Fala language|Fala]], its closest relatives.@@@@1@10@@danf@17-8-2009 10701430@unknown@formal@none@1@S@See below.@@@@1@2@@danf@17-8-2009 10701440@unknown@formal@none@1@S@* [[Spanish language|Spanish]], the major language closest to Portuguese.@@@@1@9@@danf@17-8-2009 10701450@unknown@formal@none@1@S@(See also [[Differences between Spanish and Portuguese]].)@@@@1@7@@danf@17-8-2009 10701460@unknown@formal@none@1@S@* [[Mirandese language|Mirandese]], another West Iberian language spoken in Portugal.@@@@1@10@@danf@17-8-2009 10701470@unknown@formal@none@1@S@* [[Judeo-Portuguese]] and [[Ladino language|Judeo-Spanish]], languages spoken by [[Sephardic Jew]]s, which remained close to Portuguese and Spanish.@@@@1@17@@danf@17-8-2009 10701480@unknown@formal@none@1@S@Despite the obvious lexical and grammatical similarities between Portuguese and other Romance languages, it is not [[mutually intelligible]] with most of them.@@@@1@22@@danf@17-8-2009 10701490@unknown@formal@none@1@S@Apart from Galician, Portuguese speakers will usually need some formal study of basic grammar and vocabulary, before attaining a reasonable level of comprehension of those languages, and vice-versa.@@@@1@28@@danf@17-8-2009 10701500@unknown@formal@none@1@S@===Galician and the Fala===@@@@1@4@@danf@17-8-2009 10701510@unknown@formal@none@1@S@The closest language to Portuguese is Galician, spoken in the autonomous community of Galicia (northwestern Spain).@@@@1@16@@danf@17-8-2009 10701520@unknown@formal@none@1@S@The two were at one time a single language, known today as [[Galician-Portuguese]], but since the political separation of Portugal from Galicia they have diverged somewhat, especially in pronunciation and vocabulary.@@@@1@31@@danf@17-8-2009 10701530@unknown@formal@none@1@S@Nevertheless, the core vocabulary and grammar of Galician are still noticeably closer to Portuguese than to Spanish.@@@@1@17@@danf@17-8-2009 10701540@unknown@formal@none@1@S@In particular, like Portuguese, it uses the future subjunctive, the personal infinitive, and the synthetic pluperfect (see the section on the grammar of Portuguese, below).@@@@1@25@@danf@17-8-2009 10701550@unknown@formal@none@1@S@Mutual intelligibility (estimated at 85% by R. A. Hall, Jr., 1989) is good between Galicians and northern Portuguese, but poorer between Galicians and speakers from central Portugal.@@@@1@27@@danf@17-8-2009 10701560@unknown@formal@none@1@S@The Fala language is another descendant of Galician-Portuguese, spoken by a small number of people in the Spanish towns of Valverdi du Fresnu, As Ellas and Sa Martín de Trebellu (autonomous community of [[Extremadura]], near the border with Portugal).@@@@1@39@@danf@17-8-2009 10701570@unknown@formal@none@1@S@===Influence on other languages===@@@@1@4@@danf@17-8-2009 10701580@unknown@formal@none@1@S@Many languages have [[loanword|borrowed words]] from Portuguese, such as [[Bahasa Indonesia|Indonesian]], [[Sri Lanka]]n [[Sri Lanka Tamils (native)|Tamil]] and [[Sinhalese language|Sinhalese]] (see [[Sri Lanka Indo-Portuguese language|Sri Lanka Indo-Portuguese]]), [[Malay language|Malay]], [[Bengali language|Bengali]], [[English (language)|English]], [[Hindi]], [[Konkani language|Konkani]], [[Marathi language|Marathi]], [[Tetum language|Tetum]], [[Tsonga language|Xitsonga]], [[Papiamentu]], [[Japanese language|Japanese]], [[Barbadian|Bajan Creole]] (Spoken in Barbados), [[Lanc-Patuá]] (spoken in northern Brazil) and [[Sranan Tongo]] (spoken in Suriname).@@@@1@61@@danf@17-8-2009 10701590@unknown@formal@none@1@S@It left a strong influence on the ''[[Old Tupi|língua brasílica]]'', a [[Tupi-Guarani|Tupi-Guarani language]] which was the most widely spoken in [[Brazil]] until the 18th century, and on the language spoken around [[Sikka]] in [[Flores|Flores Island]], [[Indonesia]].@@@@1@36@@danf@17-8-2009 10701600@unknown@formal@none@1@S@In nearby [[Larantuka]], Portuguese is used for prayers in [[Holy Week]] rituals.@@@@1@12@@danf@17-8-2009 10701610@unknown@formal@none@1@S@The Japanese-Portuguese dictionary ''[[Nippo Jisho]]'' (1603) was the first dictionary of Japanese in a European language, a product of [[Society of Jesus|Jesuit]] missionary activity in [[Japan]].@@@@1@26@@danf@17-8-2009 10701620@unknown@formal@none@1@S@Building on the work of earlier Portuguese missionaries, the ''Dictionarium Anamiticum, Lusitanum et Latinum'' (Annamite-Portuguese-Latin dictionary) of [[Alexandre de Rhodes]] (1651) introduced the modern [[Vietnamese alphabet|orthography of Vietnamese]], which is based on the orthography of 17th-century Portuguese.@@@@1@37@@danf@17-8-2009 10701630@unknown@formal@none@1@S@The [[Romanization]] of [[Chinese language|Chinese]] was also influenced by the Portuguese language (among others), particularly regarding [[List of common Chinese surnames|Chinese surnames]]; one example is ''Mei''.@@@@1@26@@danf@17-8-2009 10701640@unknown@formal@none@1@S@See also [[List of English words of Portuguese origin]], [[Loan words in Indonesian]], [[Japanese words of Portuguese origin]], [[Malay_language#Borrowed_words|Borrowed words in Malay]], [[Sinhala words of Portuguese origin]], [[Loan words in Sri Lankan Tamil#Portuguese|Loan words from Portuguese in Sri Lankan Tamil]].@@@@1@40@@danf@17-8-2009 10701650@unknown@formal@none@1@S@===Derived languages===@@@@1@2@@danf@17-8-2009 10701660@unknown@formal@none@1@S@Beginning in the 16th century, the extensive contacts between Portuguese travelers and settlers, African slaves, and local populations led to the appearance of many [[pidgin]]s with varying amounts of Portuguese influence.@@@@1@31@@danf@17-8-2009 10701670@unknown@formal@none@1@S@As these pidgins became the mother tongue of succeeding generations, they evolved into fully fledged [[creole language]]s, which remained in use in many parts of Asia and Africa until the 18th century.@@@@1@32@@danf@17-8-2009 10701680@unknown@formal@none@1@S@Some Portuguese-based or Portuguese-influenced creoles are still spoken today, by over 3 million people worldwide, especially people of partial [[Portuguese people|Portuguese]] ancestry.@@@@1@22@@danf@17-8-2009 10701690@unknown@formal@none@1@S@== Phonology ==@@@@1@3@@danf@17-8-2009 10701700@unknown@formal@none@1@S@There is a maximum of 9 oral vowels and 19 consonants, though some varieties of the language have fewer phonemes (Brazilian Portuguese has only 8 oral vowel [[phone]]s).@@@@1@28@@danf@17-8-2009 10701710@unknown@formal@none@1@S@There are also five nasal vowels, which some linguists regard as allophones of the oral vowels, ten oral [[diphthong]]s, and five nasal diphthongs.@@@@1@23@@danf@17-8-2009 10701720@unknown@formal@none@1@S@===Vowels===@@@@1@1@@danf@17-8-2009 10701730@unknown@formal@none@1@S@To the seven vowels of [[Vulgar Latin]], European Portuguese has added two [[Mid-centralized vowel|near central vowels]], one of which tends to be [[elision|elided]] in [[relaxed pronunciation|rapid speech]], like the ''e caduc'' of [[French language|French]] (represented either as {{IPA|/ɯ̽/}}, or {{IPA|/ɨ/}}, or {{IPA|/ə/}}).@@@@1@42@@danf@17-8-2009 10701740@unknown@formal@none@1@S@The high vowels {{IPA|/e o/}} and the low vowels {{IPA|/ɛ ɔ/}} are four distinct phonemes, and they alternate in various forms of [[apophony]].@@@@1@23@@danf@17-8-2009 10701750@unknown@formal@none@1@S@Like [[Catalan language|Catalan]], Portuguese uses vowel quality to contrast stressed syllables with unstressed syllables: isolated vowels tend to be [[Vowel#Height|raised]], and in some cases centralized, when unstressed.@@@@1@27@@danf@17-8-2009 10701760@unknown@formal@none@1@S@Nasal diphthongs occur mostly at the end of words.@@@@1@9@@danf@17-8-2009 10701770@unknown@formal@none@1@S@===Consonants===@@@@1@1@@danf@17-8-2009 10701780@unknown@formal@none@1@S@The consonant inventory of Portuguese is fairly conservative.@@@@1@8@@danf@17-8-2009 10701790@unknown@formal@none@1@S@The medieval affricates {{IPA|/ts/}}, {{IPA|/dz/}}, {{IPA|/tʃ/}}, {{IPA|/dʒ/}} merged with the fricatives {{IPA|/s/}}, {{IPA|/z/}}, {{IPA|/ʃ/}}, {{IPA|/ʒ/}}, respectively, but not with each other, and there were no other significant changes to the consonant phonemes since then.@@@@1@34@@danf@17-8-2009 10701800@unknown@formal@none@1@S@However, some remarkable dialectal variants and [[allophone]]s have appeared, among which:@@@@1@11@@danf@17-8-2009 10701810@unknown@formal@none@1@S@*In many regions of Brazil, {{IPA|/t/}} and {{IPA|/d/}} have the affricate allophones {{IPA|[tʃ]}} and {{IPA|[dʒ]}}, respectively, before {{IPA|/i/}} and {{IPA|/ĩ/}}.@@@@1@20@@danf@17-8-2009 10701820@unknown@formal@none@1@S@([[Quebec French]] has a similar phenomenon, with alveolar affricates instead of postalveolars.@@@@1@12@@danf@17-8-2009 10701830@unknown@formal@none@1@S@[[Japanese language|Japanese]] is another example).@@@@1@5@@danf@17-8-2009 10701840@unknown@formal@none@1@S@*At the end of a syllable, the phoneme {{IPA|/l/}} has the allophone {{IPA|[u̯]}} in Brazilian Portuguese (''[[L-vocalization#L-vocalization|L-vocalization]]'').@@@@1@17@@danf@17-8-2009 10701850@unknown@formal@none@1@S@*In many parts of Brazil and Angola, intervocalic {{IPA|/ɲ/}} is pronounced as a [[nasalization|nasalized]] [[palatal approximant]] {{IPA|[j̃]}} which nasalizes the preceding vowel, so that for instance {{IPA|/ˈniɲu/}} is pronounced {{IPA|[ˈnĩj̃u]}}.@@@@1@30@@danf@17-8-2009 10701860@unknown@formal@none@1@S@*In most of Brazil, the alveolar sibilants {{IPA|/s/}} and {{IPA|/z/}} occur in complementary distribution at the end of syllables, depending on whether the consonant that follows is voiceless or voiced, as in English.@@@@1@33@@danf@17-8-2009 10701870@unknown@formal@none@1@S@But in most of Portugal and parts of Brazil sibilants are postalveolar at the end of syllables, {{IPA|/ʃ/}} before voiceless consonants, and {{IPA|/ʒ/}} before voiced consonants (in [[Ladino language|Judeo-Spanish]], {{IPA|/s/}} is often replaced with {{IPA|/ʃ/}} at the end of syllables, too).@@@@1@41@@danf@17-8-2009 10701880@unknown@formal@none@1@S@*There is considerable dialectal variation in the value of the [[Rhotic consonant|rhotic]] phoneme {{IPA|/ʁ/}}.@@@@1@14@@danf@17-8-2009 10701890@unknown@formal@none@1@S@See [[Guttural R#Portuguese|Guttural R in Portuguese]], for details.@@@@1@8@@danf@17-8-2009 10701900@unknown@formal@none@1@S@==Grammar==@@@@1@1@@danf@17-8-2009 10701910@unknown@formal@none@1@S@A particularly interesting aspect of the grammar of Portuguese is the verb.@@@@1@12@@danf@17-8-2009 10701920@unknown@formal@none@1@S@Morphologically, more verbal inflections from classical Latin have been preserved by Portuguese than any other major Romance language.@@@@1@18@@danf@17-8-2009 10701930@unknown@formal@none@1@S@See [[Romance copula#Morphological comparison|Romance copula]], for a detailed comparison.@@@@1@9@@danf@17-8-2009 10701940@unknown@formal@none@1@S@It has also some innovations not found in other Romance languages (except Galician and the Fala):@@@@1@16@@danf@17-8-2009 10701950@unknown@formal@none@1@S@* The [[present perfect tense]] has an iterative sense unique among the Romance languages.@@@@1@14@@danf@17-8-2009 10701960@unknown@formal@none@1@S@It denotes an action or a series of actions which began in the past and are expected to keep repeating in the future.@@@@1@23@@danf@17-8-2009 10701970@unknown@formal@none@1@S@For instance, the sentence ''Tenho tentado falar com ela'' would be translated to "I have been trying to talk to her", not "I have tried to talk to her".@@@@1@29@@danf@17-8-2009 10701980@unknown@formal@none@1@S@On the other hand, the correct translation of the question "Have you heard the latest news?" is not ''*Tem ouvido a última notícia?'', but ''Ouviu a última notícia?'', since no repetition is implied.@@@@1@33@@danf@17-8-2009 10701990@unknown@formal@none@1@S@* The future [[Subjunctive mood|subjunctive]] tense, which was developed by medieval [[West Iberian languages|West Iberian Romance]], but has now fallen into disuse in Spanish, is still used in [[vernacular]] Portuguese.@@@@1@30@@danf@17-8-2009 10702000@unknown@formal@none@1@S@It appears in dependent clauses that denote a condition which must be fulfilled in the future, so that the independent clause will occur.@@@@1@23@@danf@17-8-2009 10702010@unknown@formal@none@1@S@Other languages normally employ the present tense under the same circumstances:@@@@1@11@@danf@17-8-2009 10702020@unknown@formal@none@1@S@:''Se ''for'' eleito presidente, mudarei a lei.''@@@@1@7@@danf@17-8-2009 10702030@unknown@formal@none@1@S@:If ''I am'' elected president, I will change the law.@@@@1@10@@danf@17-8-2009 10702040@unknown@formal@none@1@S@:''Quando ''fores'' mais velho, vais entender.''@@@@1@6@@danf@17-8-2009 10702050@unknown@formal@none@1@S@:When ''you are'' older, you will understand.@@@@1@7@@danf@17-8-2009 10702060@unknown@formal@none@1@S@* The personal [[infinitive]]: infinitives can [[inflection|inflect]] according to their subject in [[Grammatical person|person]] and [[Grammatical number|number]], often showing who is expected to perform a certain action; cf. ''É melhor voltares'' "It is better [for you] to go back," ''É melhor voltarmos'' "It is better [for us] to go back."@@@@1@50@@danf@17-8-2009 10702070@unknown@formal@none@1@S@Perhaps for this reason, infinitive clauses replace subjunctive clauses more often in Portuguese than in other Romance languages.@@@@1@18@@danf@17-8-2009 10702080@unknown@formal@none@1@S@==Writing system==@@@@1@2@@danf@17-8-2009 10702090@unknown@formal@none@1@S@Portuguese is written with the [[Latin alphabet]], making use of five [[diacritic]]s to denote stress, vowel height, contraction, nasalization, and other sound changes (acute accent, grave accent, circumflex accent, tilde, and cedilla).@@@@1@32@@danf@17-8-2009 10702100@unknown@formal@none@1@S@[[Brazilian Portuguese]] also uses the diaeresis mark.@@@@1@7@@danf@17-8-2009 10702110@unknown@formal@none@1@S@Accented characters and digraphs are not counted as separate letters for [[collation]] purposes.@@@@1@13@@danf@17-8-2009 10702120@unknown@formal@none@1@S@===Brazilian vs. European spelling===@@@@1@4@@danf@17-8-2009 10702130@unknown@formal@none@1@S@There are some minor differences between the orthographies of Brazil and other Portuguese language countries.@@@@1@15@@danf@17-8-2009 10702140@unknown@formal@none@1@S@One of the most pervasive is the use of acute accents in the European/African/Asian orthography in many words such as ''sinónimo'', where the Brazilian orthography has a circumflex accent, ''sinônimo''.@@@@1@30@@danf@17-8-2009 10702150@unknown@formal@none@1@S@Another important difference is that Brazilian spelling often lacks ''c'' or ''p'' before ''c'', ''ç'', or ''t'', where the European orthography has them; for example, cf. Brazilian ''fato'' with European ''facto'', "fact", or Brazilian ''objeto'' with European ''objecto'', "object".@@@@1@39@@danf@17-8-2009 10702160@unknown@formal@none@1@S@Some of these spelling differences reflect differences in the pronunciation of the words, but others are merely graphic.@@@@1@18@@danf@17-8-2009 10702170@unknown@formal@none@1@S@==Examples==@@@@1@1@@danf@17-8-2009 10702180@unknown@formal@none@1@S@;Excerpt from the Portuguese [[national epic]] ''[[Os Lusíadas]]'', by author [[Luís de Camões]] (I, 33)@@@@1@15@@danf@17-8-2009