10640010@unknown@formal@none@1@S@<article>OpenOffice.org</article>@@@@1@1@@danf@17-8-2009
10640020@unknown@formal@none@1@S@'''OpenOffice.org''' ('''OO.o''' or '''OOo''') is a [[cross-platform]] [[office suite|office application suite]] available for a number of different computer [[operating system]]s.@@@@1@20@@danf@17-8-2009
10640030@unknown@formal@none@1@S@It supports the ISO standard '''[[OpenDocument]] Format (ODF)''' for data interchange as its default [[file format]], as well as [[Microsoft Office]] '97–2003 formats, [[Microsoft Office]] '2007 format (in version 3), among many others.@@@@1@33@@danf@17-8-2009
10640040@unknown@formal@none@1@S@OpenOffice.org was originally derived from [[StarOffice]], an office suite developed by [[StarDivision]] and acquired by [[Sun Microsystems]] in August 1999.@@@@1@20@@danf@17-8-2009
10640050@unknown@formal@none@1@S@The [[source code]] of the suite was released in July 2000 with the aim of reducing the dominant [[market share]] of [[Microsoft Office]] by providing a free, open and high-quality alternative; later versions of StarOffice are based upon OpenOffice.org with additional proprietary components.@@@@1@43@@danf@17-8-2009
10640060@unknown@formal@none@1@S@OpenOffice.org is [[free software]], available under the [[GNU Lesser General Public License]] (LGPL).@@@@1@13@@danf@17-8-2009
10640070@unknown@formal@none@1@S@The project and software are informally referred to as ''OpenOffice'', but this term is a [[trademark]] held by another party, requiring the project to adopt ''OpenOffice.org'' as its formal name.@@@@1@30@@danf@17-8-2009
10640080@unknown@formal@none@1@S@== History==@@@@1@2@@danf@17-8-2009
10640090@unknown@formal@none@1@S@Originally developed as the [[proprietary software]] application suite StarOffice by the German company [[StarDivision]], the code was purchased in 1999 by Sun Microsystems.@@@@1@23@@danf@17-8-2009
10640100@unknown@formal@none@1@S@In August 1999 version 5.2 of StarOffice was made available free of charge.@@@@1@13@@danf@17-8-2009
10640110@unknown@formal@none@1@S@On [[July 19]], [[2000]], Sun Microsystems announced that it was making the source code of StarOffice available for download under both the LGPL and the [[Sun Industry Standards Source License]] (SISSL) with the intention of building an open source development community around the software.@@@@1@44@@danf@17-8-2009
10640120@unknown@formal@none@1@S@The new project was known as OpenOffice.org, and its website went live on [[October 13]], [[2000]].@@@@1@16@@danf@17-8-2009
10640130@unknown@formal@none@1@S@Work on version 2.0 began in early 2003 with the following goals: better interoperability with Microsoft Office; better performance, with improved speed and lower memory usage; greater [[Scripting language|scripting]] capabilities; better integration, particularly with [[GNOME]]; an easier-to-find and use database front-end for creating reports, forms and queries; a new built-in [[SQL]] database; and improved [[usability]].@@@@1@55@@danf@17-8-2009
10640140@unknown@formal@none@1@S@A [[beta version]] was released on [[March 4]], [[2005]].@@@@1@9@@danf@17-8-2009
10640150@unknown@formal@none@1@S@On [[September 2]], [[2005]] Sun announced that it was retiring the SISSL.@@@@1@12@@danf@17-8-2009
10640160@unknown@formal@none@1@S@As a consequence, the OpenOffice.org Community Council announced that it would no longer [[dual license]] the office suite, and future versions would use only the LGPL.@@@@1@26@@danf@17-8-2009
10640170@unknown@formal@none@1@S@On [[October 20]], [[2005]], OpenOffice.org 2.0 was formally released to the public.@@@@1@12@@danf@17-8-2009
10640180@unknown@formal@none@1@S@Eight weeks after the release of Version 2.0, an update, OpenOffice.org 2.0.1, was released.@@@@1@14@@danf@17-8-2009
10640190@unknown@formal@none@1@S@It fixed minor bugs and introduced new features.@@@@1@8@@danf@17-8-2009
10640200@unknown@formal@none@1@S@As of the 2.0.3 release, OpenOffice.org changed its release cycle from 18-months to releasing updates, feature enhancements and bug fixes every three months.@@@@1@23@@danf@17-8-2009
10640210@unknown@formal@none@1@S@Currently, new versions including new features are released every six months (so-called "feature releases") alternating with so-called "bug fix releases" which are being released between two feature releases (Every 3 months).@@@@1@31@@danf@17-8-2009
10640220@unknown@formal@none@1@S@=== StarOffice ===@@@@1@3@@danf@17-8-2009
10640230@unknown@formal@none@1@S@Sun subsidizes the development of OpenOffice.org in order to use it as a base for its commercial [[proprietary software|proprietary]] StarOffice application software.@@@@1@22@@danf@17-8-2009
10640240@unknown@formal@none@1@S@Releases of StarOffice since version 6.0 have been based on the OpenOffice.org source code, with some additional proprietary components, including:@@@@1@20@@danf@17-8-2009
10640250@unknown@formal@none@1@S@* Additional bundled fonts (especially [[CJK|East Asian language]] fonts).@@@@1@9@@danf@17-8-2009
10640260@unknown@formal@none@1@S@* [[Adabas D]] database.@@@@1@4@@danf@17-8-2009
10640270@unknown@formal@none@1@S@* Additional document [[Template (word processing)|templates]].@@@@1@6@@danf@17-8-2009
10640280@unknown@formal@none@1@S@* [[Clip art]].@@@@1@3@@danf@17-8-2009
10640290@unknown@formal@none@1@S@* Sorting functionality for Asian versions.@@@@1@6@@danf@17-8-2009
10640300@unknown@formal@none@1@S@* Additional file filters.@@@@1@4@@danf@17-8-2009
10640310@unknown@formal@none@1@S@* Migration assessment tool (Enterprise Edition).@@@@1@6@@danf@17-8-2009
10640320@unknown@formal@none@1@S@* Macro migration tool (Enterprise Edition).@@@@1@6@@danf@17-8-2009
10640330@unknown@formal@none@1@S@* Configuration management tool (Enterprise Edition).@@@@1@6@@danf@17-8-2009
10640340@unknown@formal@none@1@S@OpenOffice.org, therefore, inherited many features from the original StarOffice upon which it was based including the [[OpenOffice.org XML]] file format which it retained until version 2, when it was replaced by the ISO standard [[OpenDocument]] Format (ODF).@@@@1@37@@danf@17-8-2009
10640350@unknown@formal@none@1@S@== Features ==@@@@1@3@@danf@17-8-2009
10640360@unknown@formal@none@1@S@According to its [[mission statement]], the OpenOffice.org project aims "''To create, as a community, the leading international office suite that will run on all major platforms and provide access to all functionality and data through open-component based APIs and an XML-based file format.''"@@@@1@43@@danf@17-8-2009
10640370@unknown@formal@none@1@S@OpenOffice.org aims to compete with Microsoft Office and emulate its look and feel where suitable.@@@@1@15@@danf@17-8-2009
10640380@unknown@formal@none@1@S@It can read and write most of the [[file formats]] found in Microsoft Office, and many other applications; an essential feature of the suite for many users.@@@@1@27@@danf@17-8-2009
10640390@unknown@formal@none@1@S@OpenOffice.org has been found to be able to open files of older versions of Microsoft Office and damaged files that newer versions of Microsoft Office itself cannot open.@@@@1@28@@danf@17-8-2009
10640400@unknown@formal@none@1@S@However, it cannot open older Word for Macintosh (MCW) files.@@@@1@10@@danf@17-8-2009
10640410@unknown@formal@none@1@S@=== Platforms ===@@@@1@3@@danf@17-8-2009
10640420@unknown@formal@none@1@S@Platforms for which OO.o is available include [[Microsoft Windows]], [[Linux]], [[Solaris Operating System|Solaris]], [[BSD]], [[OpenVMS]], [[OS/2]] and [[IRIX]].@@@@1@18@@danf@17-8-2009
10640430@unknown@formal@none@1@S@The current primary development platforms are Microsoft Windows, Linux and Solaris.@@@@1@11@@danf@17-8-2009
10640440@unknown@formal@none@1@S@A port for [[Mac OS X]] exists for OS X machines which have the [[X Window System]] component installed.@@@@1@19@@danf@17-8-2009
10640450@unknown@formal@none@1@S@A port to OS X's native [[Aqua (user interface)|Aqua user interface]] is in progress, and is scheduled for completion for the 3.0 milestone.@@@@1@23@@danf@17-8-2009
10640460@unknown@formal@none@1@S@[[NeoOffice]] is an independent [[Fork (software development)|fork]] of OpenOffice, specially adapted for Mac OS X.@@@@1@15@@danf@17-8-2009
10640470@unknown@formal@none@1@S@=== Version compatibility ===@@@@1@4@@danf@17-8-2009
10640480@unknown@formal@none@1@S@*Windows 95: up to v1.1.5@@@@1@5@@danf@17-8-2009
10640490@unknown@formal@none@1@S@*Windows 98-Vista: up to v2.4, development releases of v3.0@@@@1@9@@danf@17-8-2009
10640500@unknown@formal@none@1@S@*Mac OS 10.2: up to v1.1.2@@@@1@6@@danf@17-8-2009
10640510@unknown@formal@none@1@S@*Mac OS 10.3: up to v2.1@@@@1@6@@danf@17-8-2009
10640520@unknown@formal@none@1@S@*Mac OS 10.4-10.5: up to v2.4, development releases of v3.0 ([[Apple-Intel architecture|intel]] only)@@@@1@13@@danf@17-8-2009
10640530@unknown@formal@none@1@S@*OS/2 and eComStation: up to v2.0.4@@@@1@6@@danf@17-8-2009
10640540@unknown@formal@none@1@S@=== Components ===@@@@1@3@@danf@17-8-2009
10640550@unknown@formal@none@1@S@OpenOffice.org is a collection of applications that work together closely to provide the features expected from a modern office suite.@@@@1@20@@danf@17-8-2009
10640560@unknown@formal@none@1@S@Many of the components are designed to mirror those available in Microsoft Office.@@@@1@13@@danf@17-8-2009
10640570@unknown@formal@none@1@S@The components available include:@@@@1@4@@danf@17-8-2009
10640580@unknown@formal@none@1@S@*[[QuickStart]]er@@@@1@1@@danf@17-8-2009
10640590@unknown@formal@none@1@S@:A small program for Windows and Linux that runs when the computer starts for the first time.@@@@1@17@@danf@17-8-2009
10640600@unknown@formal@none@1@S@It loads the core files and libraries for OpenOffice.org during computer startup and allows the suite applications to start more quickly when selected later.@@@@1@24@@danf@17-8-2009
10640610@unknown@formal@none@1@S@The amount of time it takes to open OpenOffice.org applications was a common complaint in version 1.0 of the suite.@@@@1@20@@danf@17-8-2009
10640620@unknown@formal@none@1@S@Substantial improvements were made in this area for version 2.2.@@@@1@10@@danf@17-8-2009
10640630@unknown@formal@none@1@S@*The [[Macro (computer science)|macro]] recorder@@@@1@5@@danf@17-8-2009
10640640@unknown@formal@none@1@S@:Is used to record user actions and replay them later to help with automating tasks, using [[OpenOffice.org Basic]] (see [[OpenOffice.org#OpenOffice.org Basic|below]]).@@@@1@21@@danf@17-8-2009
10640650@unknown@formal@none@1@S@It is not possible to download these components individually on Windows, though they can be installed separately.@@@@1@17@@danf@17-8-2009
10640660@unknown@formal@none@1@S@Most Linux distributions break the components into individual packages which may be downloaded and installed separately.@@@@1@16@@danf@17-8-2009
10640670@unknown@formal@none@1@S@=== OpenOffice.org Basic ===@@@@1@4@@danf@17-8-2009
10640680@unknown@formal@none@1@S@OpenOffice.org Basic is a programming language similar to Microsoft [[Visual Basic for Applications]] (VBA) based on [[StarOffice Basic]].@@@@1@18@@danf@17-8-2009
10640690@unknown@formal@none@1@S@In addition to the macros, the upcoming Novell edition of OpenOffice.org 2.0 supports running Microsoft VBA macros, a feature expected to be incorporated into the mainstream version soon.@@@@1@28@@danf@17-8-2009
10640700@unknown@formal@none@1@S@OpenOffice.org Basic is available in the Writer and Calc applications.@@@@1@10@@danf@17-8-2009
10640710@unknown@formal@none@1@S@It is written in functions called subroutines or macros, with each macro performing a different task, such as counting the words in a paragraph.@@@@1@24@@danf@17-8-2009
10640720@unknown@formal@none@1@S@OpenOffice.org Basic is especially useful in doing repetitive tasks that have not been integrated in the program.@@@@1@17@@danf@17-8-2009
10640730@unknown@formal@none@1@S@As the OpenOffice.org database, called "Base", uses documents created under the Writer application for reports and forms, one could say that Base can also be programmed with OpenOffice.org Basic.@@@@1@29@@danf@17-8-2009
10640740@unknown@formal@none@1@S@== File formats ==@@@@1@4@@danf@17-8-2009
10640750@unknown@formal@none@1@S@OpenOffice.org pioneered the ISO/IEC standard [[OpenDocument]] file formats (ODF), which it uses natively, by default.@@@@1@15@@danf@17-8-2009
10640760@unknown@formal@none@1@S@It also supports reading (and in some cases writing) a large number of legacy proprietary file formats (e.g.: [[WordPerfect]] through libwpd, [[StarOffice]], [[Lotus software]], [[Microsoft Works|MS Works]] through libwps, [[Rich Text Format]]), most notably including [[Microsoft Office]] formats after which the OpenDocument specification was "approved for release as an ISO and IEC International Standard" under the name ISO/IEC 26300:2006..@@@@1@59@@danf@17-8-2009
10640770@unknown@formal@none@1@S@=== Microsoft Office interoperability ===@@@@1@5@@danf@17-8-2009
10640780@unknown@formal@none@1@S@In response to Microsoft's recent movement towards using the [[Office Open XML]] format in [[Microsoft Office 2007]], [[Novell]] has released an [[Office Open XML]] converter for OOo under a liberal [[BSD license]] (along with [[GNU GPL]] and [[LGPL]] licensed libraries), that will be submitted for inclusion into the OpenOffice.org project.@@@@1@50@@danf@17-8-2009
10640790@unknown@formal@none@1@S@This allows OOo to read and write Microsoft OpenXML-formatted word processing documents (.docx) in OpenOffice.org.@@@@1@15@@danf@17-8-2009
10640800@unknown@formal@none@1@S@Currently it works only with the latest Novell edition of OpenOffice.org.@@@@1@11@@danf@17-8-2009
10640810@unknown@formal@none@1@S@[[Sun Microsystems]] has developed an ODF plugin for Microsoft Office which enables users of Microsoft Office Word, Excel and PowerPoint to read and write ODF documents.@@@@1@26@@danf@17-8-2009
10640820@unknown@formal@none@1@S@The plugin currently works with Microsoft Office 2003, Microsoft Office XP and Microsoft Office 2000.@@@@1@15@@danf@17-8-2009
10640830@unknown@formal@none@1@S@Support for Microsoft Office 2007 is only available in combination with Microsoft Office 2007 SP1.@@@@1@15@@danf@17-8-2009
10640840@unknown@formal@none@1@S@Several software companies (including Microsoft and Novell) are working on an add-in for Microsoft Office that allows reading and writing ODF files.@@@@1@22@@danf@17-8-2009
10640850@unknown@formal@none@1@S@Currently it works only for Microsoft Word 2007 / XP / 2003.@@@@1@12@@danf@17-8-2009
10640860@unknown@formal@none@1@S@Microsoft provides a compatibility pack to read and write Office Open XML files with Office 2000, XP and 2003.@@@@1@19@@danf@17-8-2009
10640870@unknown@formal@none@1@S@The compatibility pack can also be used as a stand-alone converter with Microsoft Office 97.@@@@1@15@@danf@17-8-2009
10640880@unknown@formal@none@1@S@This might be helpful to convert older Microsoft Office files via Office Open XML to ODF if a direct conversion doesn't work as expected.@@@@1@24@@danf@17-8-2009
10640890@unknown@formal@none@1@S@The Office compatibility pack however does not install for Office 2000 or Office XP on [[Windows 9x]].@@@@1@17@@danf@17-8-2009
10640900@unknown@formal@none@1@S@Note that some office applications built with Microsoft components may refuse to import OpenOffice data.@@@@1@15@@danf@17-8-2009
10640910@unknown@formal@none@1@S@[[The Sage Group]]'s Simply Accounting, for example, can import Excel's .xls files, but refuses to accept OpenOffice.org-generated .xls files for the reason that the OOo .xls files are not "genuine Microsoft" .xls files.@@@@1@33@@danf@17-8-2009
10640920@unknown@formal@none@1@S@== Development ==@@@@1@3@@danf@17-8-2009
10640930@unknown@formal@none@1@S@=== Overview ===@@@@1@3@@danf@17-8-2009
10640940@unknown@formal@none@1@S@The OpenOffice.org [[Application Programming Interface|API]] is based on a component technology known as [[Universal Network Objects]] (UNO).@@@@1@17@@danf@17-8-2009
10640950@unknown@formal@none@1@S@It consists of a wide range of interfaces defined in a [[CORBA]]-like [[interface description language]].@@@@1@15@@danf@17-8-2009
10640960@unknown@formal@none@1@S@The [[document file format]] used is based on [[XML]] and several export and import filters.@@@@1@15@@danf@17-8-2009
10640970@unknown@formal@none@1@S@All external formats read by OpenOffice.org are converted back and forth from an internal XML representation.@@@@1@16@@danf@17-8-2009
10640980@unknown@formal@none@1@S@By using [[data compression|compression]] when saving [[XML]] to disk, files are generally smaller than the equivalent binary Microsoft Office documents.@@@@1@20@@danf@17-8-2009
10640990@unknown@formal@none@1@S@The native file format for storing documents in version 1.0 was used as the basis of the [[OASIS (organization)|OASIS]] OpenDocument file format standard, which has become the default file format in version 2.0.@@@@1@33@@danf@17-8-2009
10641000@unknown@formal@none@1@S@Development versions of the suite are released every few weeks on the developer zone of the OpenOffice.org website.@@@@1@18@@danf@17-8-2009
10641010@unknown@formal@none@1@S@The releases are meant for those who wish to test new features or are simply curious about forthcoming changes; they are not suitable for production use.@@@@1@26@@danf@17-8-2009
10641020@unknown@formal@none@1@S@=== Native desktop integration ===@@@@1@5@@danf@17-8-2009
10641030@unknown@formal@none@1@S@OpenOffice.org 1.0 was criticized for not having the [[look and feel]] of applications developed natively for the platforms on which it runs.@@@@1@22@@danf@17-8-2009
10641040@unknown@formal@none@1@S@Starting with version 2.0, OpenOffice.org uses native [[widget toolkit]], icons, and font-rendering libraries across a variety of platforms, to better match native applications and provide a smoother experience for the user.@@@@1@31@@danf@17-8-2009
10641050@unknown@formal@none@1@S@There are projects underway to further improve this integration on both [[GNOME]] and [[KDE]].@@@@1@14@@danf@17-8-2009
10641060@unknown@formal@none@1@S@This issue has been particularly pronounced on Mac OS X, whose standard user interface looks noticeably different from either Windows or [[X11]]-based desktop environments and requires the use of programming toolkits unfamiliar to most OpenOffice.org developers.@@@@1@36@@danf@17-8-2009
10641070@unknown@formal@none@1@S@There are two implementations of OpenOffice.org available for OS X:@@@@1@10@@danf@17-8-2009
10641080@unknown@formal@none@1@S@;OpenOffice.org Mac OS X (X11):@@@@1@5@@danf@17-8-2009
10641090@unknown@formal@none@1@S@This official implementation requires the installation of [[X11.app]] or [[XDarwin]], and is a close port of the well-tested Unix version.@@@@1@20@@danf@17-8-2009
10641100@unknown@formal@none@1@S@It is functionally equivalent to the Unix version, and its user interface resembles the [[look and feel]] of that version; for example, the application uses its own [[menu bar]] instead of the OS X menu at the top of the screen.@@@@1@41@@danf@17-8-2009
10641110@unknown@formal@none@1@S@It also requires system fonts to be converted to X11 format for OpenOffice.org to use them (which can be done during application installation).@@@@1@23@@danf@17-8-2009
10641120@unknown@formal@none@1@S@;OpenOffice.org Aqua:@@@@1@2@@danf@17-8-2009
10641130@unknown@formal@none@1@S@After a first step (completed) using [[Carbon (API)|Carbon]], OpenOffice.org Aqua switched to [[Cocoa (API)|Cocoa]] technology, and an [[Aqua (GUI)|Aqua]] version (based on [[Cocoa (API)|Cocoa]]) is also being developed under the aegis of OpenOffice.org, with a Beta version currently available.@@@@1@39@@danf@17-8-2009
10641140@unknown@formal@none@1@S@Sun Microsystems is collaborating with OOo to further development of the Aqua version of OpenOffice.org for Mac.@@@@1@17@@danf@17-8-2009
10641150@unknown@formal@none@1@S@=== Future ===@@@@1@3@@danf@17-8-2009
10641160@unknown@formal@none@1@S@Currently, a developed preview of OpenOffice.org 3 (OOo-dev 3.0) is available for download.@@@@1@13@@danf@17-8-2009
10641170@unknown@formal@none@1@S@Among the planned features for OOo 3.0, set to be released by September 2008 , are:@@@@1@16@@danf@17-8-2009
10641180@unknown@formal@none@1@S@* Personal Information Manager ([[Personal Information Manager|PIM]]), probably based on [[Mozilla Thunderbird|Thunderbird]]/[[Lightning (software)|Lightning]]@@@@1@13@@danf@17-8-2009
10641190@unknown@formal@none@1@S@* PDF import into Draw (to maintain correct layout of the original PDF)@@@@1@13@@danf@17-8-2009
10641200@unknown@formal@none@1@S@* [[OOXML]] document support for opening documents created in [[Office 2007]]@@@@1@11@@danf@17-8-2009
10641210@unknown@formal@none@1@S@* Support for [[Mac OS X]] [[Aqua (user interface)|Aqua]] platform@@@@1@10@@danf@17-8-2009
10641220@unknown@formal@none@1@S@* Extensions, to add third party functionality.@@@@1@7@@danf@17-8-2009
10641230@unknown@formal@none@1@S@* Presenter screen in Impress with multi-screen support@@@@1@8@@danf@17-8-2009
10641240@unknown@formal@none@1@S@=== Other projects ===@@@@1@4@@danf@17-8-2009
10641250@unknown@formal@none@1@S@A number of products are [http://wiki.services.openoffice.org/wiki/DerivedWorks derived from OpenOffice.org].@@@@1@9@@danf@17-8-2009
10641260@unknown@formal@none@1@S@Among the more well-known ones are Sun StarOffice and NeoOffice.@@@@1@10@@danf@17-8-2009
10641270@unknown@formal@none@1@S@The OpenOffice.org site also lists a large variety of [http://wiki.services.openoffice.org/wiki/OpenOffice.org_Solutions complementary products] including groupware solutions.@@@@1@15@@danf@17-8-2009
10641280@unknown@formal@none@1@S@==== NeoOffice ====@@@@1@3@@danf@17-8-2009
10641290@unknown@formal@none@1@S@[[NeoOffice]] is an independent [[porting|port]] that integrates with [[Mac OS X|OS X]]’s [[Aqua (GUI)|Aqua]] user interface using [[Java platform|Java]], [[Carbon (API)|Carbon]] and (increasingly) [[Cocoa (API)|Cocoa]] toolkits.@@@@1@26@@danf@17-8-2009
10641300@unknown@formal@none@1@S@NeoOffice adheres fairly closely to OS X UI standards (for example, using native pull-down menus), and has direct access to OS X’s installed fonts and printers.@@@@1@26@@danf@17-8-2009
10641310@unknown@formal@none@1@S@Its releases lag behind the official OpenOffice.org X11 releases, due to its small development team and the concurrent development of the technology used to port the user interface.@@@@1@28@@danf@17-8-2009
10641320@unknown@formal@none@1@S@Other projects run alongside the main OpenOffice.org project and are easier to contribute to.@@@@1@14@@danf@17-8-2009
10641330@unknown@formal@none@1@S@These include documentation, [[internationalisation and localisation]] and the API.@@@@1@9@@danf@17-8-2009
10641340@unknown@formal@none@1@S@==== OpenGroupware.org ====@@@@1@3@@danf@17-8-2009
10641350@unknown@formal@none@1@S@[[OpenGroupware.org]] is a set of extension programs to allow the sharing of OpenOffice.org documents, calendars, address books, [[e-mail]]s, [[instant messenger|instant messaging]] and blackboards, and provide access to other [[collaborative software|groupware]] applications.@@@@1@31@@danf@17-8-2009
10641360@unknown@formal@none@1@S@There is also an effort to create and share assorted document templates and other useful additions at OOExtras.@@@@1@18@@danf@17-8-2009
10641370@unknown@formal@none@1@S@A set of [[Perl]] extensions is available through the [[CPAN]] in order to allow OpenOffice.org document processing by external programs.@@@@1@20@@danf@17-8-2009
10641380@unknown@formal@none@1@S@These libraries do not use the OpenOffice.org API.@@@@1@8@@danf@17-8-2009
10641390@unknown@formal@none@1@S@They directly read or write the OpenOffice.org files using Perl standard file [[codec|compression/decompression]], XML access and [[UTF-8]] encoding modules.@@@@1@19@@danf@17-8-2009
10641400@unknown@formal@none@1@S@==== Portable ====@@@@1@3@@danf@17-8-2009
10641410@unknown@formal@none@1@S@A distribution of OpenOffice.org called OpenOffice.org Portable is designed to run the suite from a [[USB flash drive]].@@@@1@18@@danf@17-8-2009
10641420@unknown@formal@none@1@S@==== OxygenOffice Professional ====@@@@1@4@@danf@17-8-2009
10641430@unknown@formal@none@1@S@An enhancement of OpenOffice.org, providing: Current Version: 2.4@@@@1@8@@danf@17-8-2009
10641440@unknown@formal@none@1@S@* Possibility to run Visual Basic for Application (VBA) macros in Calc (for testing)@@@@1@14@@danf@17-8-2009
10641450@unknown@formal@none@1@S@* Improved Calc HTML export@@@@1@5@@danf@17-8-2009
10641460@unknown@formal@none@1@S@* Enhanced Access support for Base@@@@1@6@@danf@17-8-2009
10641470@unknown@formal@none@1@S@* Security fixes@@@@1@3@@danf@17-8-2009
10641480@unknown@formal@none@1@S@* Enhanced performance@@@@1@3@@danf@17-8-2009
10641490@unknown@formal@none@1@S@* Enhanced color-palette@@@@1@3@@danf@17-8-2009
10641500@unknown@formal@none@1@S@* Enhanced help menu, additional User’s Manual, and extended tips for beginners@@@@1@12@@danf@17-8-2009
10641510@unknown@formal@none@1@S@Optionally it provides, free for personal and professional use:@@@@1@9@@danf@17-8-2009
10641520@unknown@formal@none@1@S@* More than 3,200 graphics, both clip art and photos.@@@@1@10@@danf@17-8-2009
10641530@unknown@formal@none@1@S@* Several templates and sample documents@@@@1@6@@danf@17-8-2009
10641540@unknown@formal@none@1@S@* Over 90 free fonts.@@@@1@5@@danf@17-8-2009
10641550@unknown@formal@none@1@S@* Additional tools like OOoWikipedia@@@@1@5@@danf@17-8-2009
10641560@unknown@formal@none@1@S@====Extensions====@@@@1@1@@danf@17-8-2009
10641570@unknown@formal@none@1@S@Since version 2.0.4, OpenOffice.org has supported extensions in a similar manner to [[Mozilla Firefox]].@@@@1@14@@danf@17-8-2009
10641580@unknown@formal@none@1@S@Extensions make it easy to add new functionality to an existing OpenOffice.org installation.@@@@1@13@@danf@17-8-2009
10641590@unknown@formal@none@1@S@The [http://extensions.services.openoffice.org/most_pop_ext OpenOffice.org Extension Repository] lists already more than 80 extensions.@@@@1@11@@danf@17-8-2009
10641600@unknown@formal@none@1@S@Developers can easily build new extensions for OpenOffice.org, for example by using the [http://wiki.services.openoffice.org/wiki/OpenOffice_NetBeans_Integration OpenOffice.org API Plugin for NetBeans].@@@@1@19@@danf@17-8-2009
10641610@unknown@formal@none@1@S@==== The OpenOffice.org Bibliographic Project ====@@@@1@6@@danf@17-8-2009
10641620@unknown@formal@none@1@S@This aims to incorporate a powerful [[reference management software]] into the suite.@@@@1@12@@danf@17-8-2009
10641630@unknown@formal@none@1@S@The new major addition is slated for inclusion with the standard OpenOffice.org release on late-2007 to mid-2008, or possibly later depending upon the availability of programmers.@@@@1@26@@danf@17-8-2009
10641640@unknown@formal@none@1@S@=== Security ===@@@@1@3@@danf@17-8-2009
10641650@unknown@formal@none@1@S@OpenOffice.org includes a security team, and as of June 2008 the security organization [[Secunia]] reports no known unpatched security flaws for the software.@@@@1@23@@danf@17-8-2009
10641660@unknown@formal@none@1@S@[[Kaspersky Lab]] has shown a [[proof of concept]] virus for OpenOffice.org.@@@@1@11@@danf@17-8-2009
10641670@unknown@formal@none@1@S@This shows OOo viruses are possible, but there is no known virus "in the wild".@@@@1@15@@danf@17-8-2009
10641680@unknown@formal@none@1@S@In a private meeting of the French Ministry of Defense, macro-related security issues were raised.@@@@1@15@@danf@17-8-2009
10641690@unknown@formal@none@1@S@OpenOffice.org developers have responded and noted that the supposed vulnerability had not been announced through "well defined procedures" for disclosure and that the ministry had revealed nothing specific.@@@@1@28@@danf@17-8-2009
10641700@unknown@formal@none@1@S@However, the developers have been in talks with the researcher concerning the supposed vulnerability.@@@@1@14@@danf@17-8-2009
10641710@unknown@formal@none@1@S@As with Microsoft Word, documents created in OpenOffice can contain [[metadata]] which may include a complete history of what was changed, when and by whom.@@@@1@25@@danf@17-8-2009
10641720@unknown@formal@none@1@S@== Ownership ==@@@@1@3@@danf@17-8-2009
10641730@unknown@formal@none@1@S@The project and software are informally referred to as ''OpenOffice'', but project organizers report that this term is a [[trademark]] held by another party, requiring them to adopt ''OpenOffice.org'' as its formal name.@@@@1@33@@danf@17-8-2009
10641740@unknown@formal@none@1@S@(Due to a similar trademark issue, the [[Brazilian Portuguese]] version of the suite is distributed under the name ''BrOffice.org''.)@@@@1@19@@danf@17-8-2009
10641750@unknown@formal@none@1@S@Development is managed by staff members of StarOffice.@@@@1@8@@danf@17-8-2009
10641760@unknown@formal@none@1@S@Some delay and difficulty in implementing external contributions to the core codebase (even those from the project's corporate sponsors) has been noted.@@@@1@22@@danf@17-8-2009
10641770@unknown@formal@none@1@S@Currently, there are [http://wiki.services.openoffice.org/wiki/DerivedWorks several derived and/or proprietary works based on OOo], with some of them being:@@@@1@17@@danf@17-8-2009
10641780@unknown@formal@none@1@S@* Sun Microsystem's [[StarOffice]], with various complementary add-ons.@@@@1@8@@danf@17-8-2009
10641790@unknown@formal@none@1@S@* IBM's [[Lotus Symphony]], with a new interface based on [[Eclipse (software)|Eclipse]] (based on OO.o 1.x).@@@@1@16@@danf@17-8-2009
10641800@unknown@formal@none@1@S@* OpenOffice.org Novell edition, integrated with [[Novell Evolution|Evolution]] and with a [[OOXML]] filter.@@@@1@13@@danf@17-8-2009
10641810@unknown@formal@none@1@S@* Beijing [[Redflag]] Chinese 2000's [[RedOffice]], fully localized in Chinese characters.@@@@1@11@@danf@17-8-2009
10641820@unknown@formal@none@1@S@* Planamesa's [[NeoOffice]] for [[Mac OS X]] with Aqua support via Java.@@@@1@12@@danf@17-8-2009
10641830@unknown@formal@none@1@S@In [[May 23]], [[2007]], the OpenOffice.org community and Redflag Chinese 2000 Software Co, Ltd. announced a joint development effort focused on integrating the new features that have been added in the RedOffice localization of OpenOffice.org, as well as quality assurance and work on the core applications.@@@@1@46@@danf@17-8-2009
10641840@unknown@formal@none@1@S@Additionally, Redflag Chinese 2000 made public its commitment to the global OO.o community stating it would "strengthen its support of the development of the world's leading free and open source productivity suite", adding around 50 engineers (that have been working on RedOffice since 2006) to the project.@@@@1@47@@danf@17-8-2009
10641850@unknown@formal@none@1@S@In [[September 10]], [[2007]], the OO.o community announced that [[IBM]] had joined to support the development of OpenOffice.org.@@@@1@18@@danf@17-8-2009
10641860@unknown@formal@none@1@S@"IBM will be making initial code contributions that it has been developing as part of its Lotus Notes product, including accessibility enhancements, and will be making ongoing contributions to the feature richness and code quality of OpenOffice.org.@@@@1@37@@danf@17-8-2009
10641870@unknown@formal@none@1@S@Besides working with the community on the free productivity suite's software, IBM will also leverage OpenOffice.org technology in its products" as has been seen with [[Lotus Symphony]].@@@@1@27@@danf@17-8-2009
10641880@unknown@formal@none@1@S@Sean Poulley, the vice president of business and strategy in IBM's [[Lotus Software]] division said that IBM plans to take a leadership role in the OpenOffice.org community together with other companies such as Sun Microsystems.@@@@1@35@@danf@17-8-2009
10641890@unknown@formal@none@1@S@IBM will work within the leadership structure that exists.@@@@1@9@@danf@17-8-2009
10641900@unknown@formal@none@1@S@As of [[October 02]], [[2007]], [[Michael Meeks]] announced (and generated an answer by Sun's [[Simon Phipps]] and Mathias Bauer) a derived OpenOffice.org work, under the wing of his employer [[Novell]], with the purpose of including new features and fixes that do not get easily integrated in the OOo-build up-stream core.@@@@1@50@@danf@17-8-2009
10641910@unknown@formal@none@1@S@The work is called Go-OO (http://go-oo.org/) a name under which alternative OO.o software has been available for five years.@@@@1@19@@danf@17-8-2009
10641920@unknown@formal@none@1@S@The new features are shared with Novell's edition of OOo and include:@@@@1@12@@danf@17-8-2009
10641930@unknown@formal@none@1@S@* [[Visual Basic for Applications|VBA]] macros support.@@@@1@7@@danf@17-8-2009
10641940@unknown@formal@none@1@S@* Faster start up time.@@@@1@5@@danf@17-8-2009
10641950@unknown@formal@none@1@S@* "A [[Linear programming|linear optimization]] solver to optimize a cell value based on arbitrary constraints built into Calc".@@@@1@18@@danf@17-8-2009
10641960@unknown@formal@none@1@S@* Multimedia content supports into documents, using the [[gstreamer]] multimedia framework.@@@@1@11@@danf@17-8-2009
10641970@unknown@formal@none@1@S@* Support for [[Microsoft Works]] formats, [[WordPerfect]] graphics (WPG format) and T602 files imports.@@@@1@14@@danf@17-8-2009
10641980@unknown@formal@none@1@S@[http://wiki.services.openoffice.org/wiki/Contributing_Patches Details about the patch handling including metrics] can be found on the OpenOffice.org site.@@@@1@15@@danf@17-8-2009
10641990@unknown@formal@none@1@S@== Reactions ==@@@@1@3@@danf@17-8-2009
10642000@unknown@formal@none@1@S@Federal Computer Week issue listed OpenOffice.org as one of the "5 stars of open-source products."@@@@1@15@@danf@17-8-2009
10642010@unknown@formal@none@1@S@In contrast, OpenOffice.org was used in [[2005]] by ''[[The Guardian]]'' newspaper to illustrate what it claims are the limitations of open-source software, although the article does finish by stating that the software may be better than MS Word for books.@@@@1@40@@danf@17-8-2009
10642020@unknown@formal@none@1@S@=== Market share ===@@@@1@4@@danf@17-8-2009
10642030@unknown@formal@none@1@S@It is extremely difficult to estimate the market share of OpenOffice.org due to the fact that OpenOffice.org can be freely distributed via download sites including mirrors, peer-to-peer networks, CDs, Linux distros, etc.@@@@1@32@@danf@17-8-2009
10642040@unknown@formal@none@1@S@Nevertheless, the OpenOffice.org tries to capture key adoption data in a market share analysis@@@@1@14@@danf@17-8-2009
10642050@unknown@formal@none@1@S@Although Microsoft Office retains 95% of the general market as measured by revenue, OpenOffice.org and StarOffice have secured 14% of the large enterprise market as of 2004 and 19% of the small to midsize business market in 2005.@@@@1@38@@danf@17-8-2009
10642060@unknown@formal@none@1@S@The OpenOffice.org web site reports more than 98 million downloads.@@@@1@10@@danf@17-8-2009
10642070@unknown@formal@none@1@S@Other large scale users of OpenOffice.org include [[Ministry of Defence (Singapore)|Singapore’s Ministry of Defence]], and [[Bristol]] City Council in the UK.@@@@1@21@@danf@17-8-2009
10642080@unknown@formal@none@1@S@In [[France]], OpenOffice.org has attracted the attention of both local and national government administrations who wish to rationalize their software procurement, as well as have stable, standard file formats for archival purposes.@@@@1@32@@danf@17-8-2009
10642090@unknown@formal@none@1@S@It is now the official office suite for the [[French Gendarmerie]].@@@@1@11@@danf@17-8-2009
10642100@unknown@formal@none@1@S@Several government organizations in India, such as [[IIT Bombay]] (a renowned technical institute), the [[Supreme Court of India]], the [[Allahabad High Court]], which use Linux, completely rely on OpenOffice.org for their administration.@@@@1@32@@danf@17-8-2009
10642110@unknown@formal@none@1@S@On [[October 4]], [[2005]], Sun and [[Google]] announced a strategic partnership.@@@@1@11@@danf@17-8-2009
10642120@unknown@formal@none@1@S@As part of this agreement, Sun will add a Google search bar to OpenOffice.org, Sun and Google will engage in joint marketing activities as well as joint research and development, and Google will help distribute OpenOffice.org.@@@@1@36@@danf@17-8-2009
10642130@unknown@formal@none@1@S@Google is currently distributing StarOffice as part of the [[Google Pack]].@@@@1@11@@danf@17-8-2009
10642140@unknown@formal@none@1@S@Besides StarOffice, there are still a number of OpenOffice.org derived commercial products.@@@@1@12@@danf@17-8-2009
10642150@unknown@formal@none@1@S@Most of them are developed under [[SISSL]] license (which is valid up to OpenOffice.org 2.0 Beta 2).@@@@1@17@@danf@17-8-2009
10642160@unknown@formal@none@1@S@In general they are targeted at local or niche market, with proprietary add-ons such as speech recognition module, automatic database connection, or better [[CJK]] support.@@@@1@25@@danf@17-8-2009
10642170@unknown@formal@none@1@S@In July 2007 Everex, a division of First International Computer and the 9th largest PC supplier in the U.S., began shipping systems preloaded with OpenOffice.org 2.2 into Wal-Mart and Sam's Club throughout North America.@@@@1@34@@danf@17-8-2009
10642180@unknown@formal@none@1@S@In September 2007 IBM announced that it would supply and support OpenOffice.org branded as [[Lotus Symphony]], and integrated into Lotus Notes.@@@@1@21@@danf@17-8-2009
10642190@unknown@formal@none@1@S@IBM also announced 35 developers would be assigned to work on OpenOffice.org, and that it would join the OpenOffice.org foundation.@@@@1@20@@danf@17-8-2009
10642200@unknown@formal@none@1@S@Commentators noted parallels between IBM's 2000 support of Linux and this announcement.@@@@1@12@@danf@17-8-2009
10642210@unknown@formal@none@1@S@=== Java controversy ===@@@@1@4@@danf@17-8-2009
10642220@unknown@formal@none@1@S@In the past OpenOffice.org was criticized for an increasing dependency on the [[Java Runtime Environment]] which was not [[free software]].@@@@1@20@@danf@17-8-2009
10642230@unknown@formal@none@1@S@That Sun Microsystems is both the creator of Java and the chief supporter of OpenOffice.org drew accusations of ulterior motives for this technology choice.@@@@1@24@@danf@17-8-2009
10642240@unknown@formal@none@1@S@Version 1 depended on the [[Java Runtime Environment]] (JRE) being present on the user’s computer for some auxiliary functions, but version 2 increased the suite’s use of Java requiring a JRE.@@@@1@31@@danf@17-8-2009
10642250@unknown@formal@none@1@S@In response, [[Red Hat]] increased their efforts to improve [[free Java implementations]].@@@@1@12@@danf@17-8-2009
10642260@unknown@formal@none@1@S@Red Hat’s [[Fedora (Linux distribution)|Fedora Core]] 4 (released on [[June 13]], [[2005]]) included a beta version of OpenOffice.org version 2, running on [[GNU Compiler for Java|GCJ]] and [[GNU Classpath]].@@@@1@29@@danf@17-8-2009
10642270@unknown@formal@none@1@S@The issue of OpenOffice.org’s use of Java came to the fore in May 2005, when [[Richard Stallman]] appeared to call for a [[fork (software)|fork]] of the application in a posting on the [[Free Software Foundation]] website.@@@@1@36@@danf@17-8-2009
10642280@unknown@formal@none@1@S@This led to discussions within the OpenOffice.org community and between Sun staff and developers involved in [[GNU Classpath]], a free replacement for Sun’s Java implementation.@@@@1@25@@danf@17-8-2009
10642290@unknown@formal@none@1@S@Later that year, the OpenOffice.org developers also placed into their development guidelines various requirements to ensure that future versions of OpenOffice.org could be run on free implementations of Java and fixed the issues which previously prevented OpenOffice.org 2.0 from using free software Java implementations.@@@@1@44@@danf@17-8-2009
10642300@unknown@formal@none@1@S@On [[November 13]], [[2006]], Sun committed to releasing Java under the [[GNU General Public License]] in the near future.@@@@1@19@@danf@17-8-2009
10642310@unknown@formal@none@1@S@This process would end OpenOffice.org's dependence on [[non-free]] software.@@@@1@9@@danf@17-8-2009
10642320@unknown@formal@none@1@S@Between November 2006 and May 2007, Sun Microsystems made available most of their Java technologies under the GNU General Public License, in compliance with the specifications of the Java Community Process, thus making almost all of Sun's Java also free software.@@@@1@41@@danf@17-8-2009
10642330@unknown@formal@none@1@S@The following areas of OpenOffice.org 2.0 depend on the JRE being present:@@@@1@12@@danf@17-8-2009
10642340@unknown@formal@none@1@S@* The [[media player (application software)|media player]] on Unix-like systems@@@@1@10@@danf@17-8-2009
10642350@unknown@formal@none@1@S@* All document wizards in Writer@@@@1@6@@danf@17-8-2009
10642360@unknown@formal@none@1@S@* Accessibility tools@@@@1@3@@danf@17-8-2009
10642370@unknown@formal@none@1@S@* Report Autopilot@@@@1@3@@danf@17-8-2009
10642380@unknown@formal@none@1@S@* [[JDBC]] driver support@@@@1@4@@danf@17-8-2009
10642390@unknown@formal@none@1@S@* [[Hsqldb|HSQL]] database engine, which is used in OpenOffice.org Base@@@@1@10@@danf@17-8-2009
10642400@unknown@formal@none@1@S@* [[XSLT]] filters@@@@1@3@@danf@17-8-2009
10642410@unknown@formal@none@1@S@* [[BeanShell]], the [[NetBeans]] scripting language and the Java UNO bridge@@@@1@11@@danf@17-8-2009
10642420@unknown@formal@none@1@S@* Export filters to the Aportis.doc (.pdb) format for the [[Palm OS]] or [[Pocket Word]] (.psw) format for the [[Pocket PC]]@@@@1@21@@danf@17-8-2009
10642430@unknown@formal@none@1@S@* Export filter to [[LaTeX]]@@@@1@5@@danf@17-8-2009
10642440@unknown@formal@none@1@S@* Export filter to [[MediaWiki]]'s [[wikitext]]@@@@1@6@@danf@17-8-2009
10642450@unknown@formal@none@1@S@A common point of confusion is that [[mail merge]] to generate emails requires the Java API JavaMail in [[StarOffice]]; however, as of version 2.0.1, OpenOffice.org uses a [[Python (programming language)|Python]]-component instead.@@@@1@31@@danf@17-8-2009
10642460@unknown@formal@none@1@S@=== Complementary software ===@@@@1@4@@danf@17-8-2009
10642470@unknown@formal@none@1@S@OpenOffice.org provides replacement for MS Office's [[Microsoft Word]], [[Microsoft Excel]], [[Microsoft PowerPoint]], [[Microsoft Access]], [[Equation Editor|Microsoft Equation Editor]] and [[Microsoft Visio]].@@@@1@21@@danf@17-8-2009
10642480@unknown@formal@none@1@S@But to level the equivalent functionality from the rest of MS Office, OOo can be complemented with other open source programs such as:@@@@1@23@@danf@17-8-2009
10642490@unknown@formal@none@1@S@* [[Novell Evolution|Evolution]] or [[Mozilla Thunderbird|Thunderbird]]/[[Lightning (software)|Lightning]] for a PIM like [[Microsoft Outlook]].@@@@1@13@@danf@17-8-2009
10642500@unknown@formal@none@1@S@* [[OpenProj]] (which seeks integration with OOo, but might be limited due to licensing issues) for [[Microsoft Project]].@@@@1@18@@danf@17-8-2009
10642510@unknown@formal@none@1@S@* [[Scribus]] for [[Microsoft Publisher]]@@@@1@5@@danf@17-8-2009
10642520@unknown@formal@none@1@S@* [[O3spaces]] for [[Sharepoint]]@@@@1@4@@danf@17-8-2009
10642530@unknown@formal@none@1@S@Microsoft also provides Administrative Template Files ("adm files") that allow MS Office to be configured using Windows Group Policy.@@@@1@19@@danf@17-8-2009
10642540@unknown@formal@none@1@S@Equivalent functionality for OpenOffice.org is provided by [http://openoffice-enterprise.com/ OpenOffice-Enterprise], a commercial product from Open Office Technology, Inc.@@@@1@17@@danf@17-8-2009
10642550@unknown@formal@none@1@S@=== Issues ===@@@@1@3@@danf@17-8-2009
10642560@unknown@formal@none@1@S@OpenOffice.org has been criticized for slow start times and extensive CPU and RAM usage in comparison to other competitive software such as Microsoft Office.@@@@1@24@@danf@17-8-2009
10642570@unknown@formal@none@1@S@In comparison, tests between OpenOffice.org&nbsp;2.2 and Microsoft Office&nbsp;2007 have found that OpenOffice.org takes approximately 2 times the processing time and memory to load itself along with a blank file; and took approximately 4.7 times the processing time and 3.9 times the memory to open an extremely large spreadsheet file.@@@@1@49@@danf@17-8-2009
10642580@unknown@formal@none@1@S@Critics have pointed to excessive code bloat and OpenOffice.org's loading of the [[Java Virtual Machine|Java Runtime Environment]] as possible reasons for the slow speeds and excessive memory usage.@@@@1@28@@danf@17-8-2009
10642590@unknown@formal@none@1@S@However, since OpenOffice.org 2.2 the performance of OpenOffice.org has been improved dramatically.@@@@1@12@@danf@17-8-2009
10642600@unknown@formal@none@1@S@One of the greatest challenges is its ability to be truly cross compatible with other applications.@@@@1@16@@danf@17-8-2009
10642610@unknown@formal@none@1@S@Since Openoffice.org is forced to reverse engineer proprietary binary formats due to unavailability of open specifications, slight formatting incompatibilities tend to exist when files are saved in non-native format.@@@@1@29@@danf@17-8-2009
10642620@unknown@formal@none@1@S@For example, a complex .doc document formatted under OpenOffice.org, is usually not displayed with the correct format when opened with Microsoft Office.@@@@1@22@@danf@17-8-2009
10642630@unknown@formal@none@1@S@== Retail ==@@@@1@3@@danf@17-8-2009
10642640@unknown@formal@none@1@S@The [[free software license]] under which OpenOffice.org is distributed allows unlimited use of the software for both home and business use, including unlimited redistribution of the software.@@@@1@27@@danf@17-8-2009
10642650@unknown@formal@none@1@S@Several businesses sell the OpenOffice.org suite on auction websites such as [[eBay]], offering value-added services such as 24/7 technical support, download mirrors, and CD mailing.@@@@1@25@@danf@17-8-2009
10642660@unknown@formal@none@1@S@However, often the 24/7 support offered is not provided by the company selling the software, but rather by the official OpenOffice.org mailing list.@@@@1@23@@danf@17-8-2009
10650010@unknown@formal@none@1@S@<article>Parsing</article>@@@@1@1@@danf@17-8-2009
10650020@unknown@formal@none@1@S@In [[computer science]] and [[linguistics]], '''parsing''', or, more formally, '''syntactic analysis''', is the process of analyzing a sequence of [[Token (parser)|tokens]] to determine grammatical structure with respect to a given (more or less) [[formal grammar]].@@@@1@35@@danf@17-8-2009
10650030@unknown@formal@none@1@S@A '''parser''' is thus one of the components in an [[interpreter]] or [[compiler]], where it captures the implied hierarchy of the input text and transforms it into a form suitable for further processing (often some kind of [[parse tree]], [[abstract syntax tree]] or other hierarchical structure) and normally checks for syntax errors at the same time.@@@@1@56@@danf@17-8-2009
10650040@unknown@formal@none@1@S@The parser often uses a separate [[lexical analyser]] to create tokens from the sequence of input characters.@@@@1@17@@danf@17-8-2009
10650050@unknown@formal@none@1@S@Parsers may be programmed by hand or may be semi-automatically generated (in some programming language) by a tool (such as [[Yet Another Compiler Compiler|Yacc]]) from a grammar written in [[Backus-Naur form]].@@@@1@31@@danf@17-8-2009
10650060@unknown@formal@none@1@S@Parsing is also an earlier term for the diagramming of sentences of natural languages, and is still used for the diagramming of [[Inflection|inflected]] languages, such as the [[Romance languages|Romance languages]] or [[Latin]].@@@@1@32@@danf@17-8-2009
10650070@unknown@formal@none@1@S@Parsers can also be constructed as executable specifications of grammars in functional programming languages.@@@@1@14@@danf@17-8-2009
10650080@unknown@formal@none@1@S@Frost, Hafiz and Callaghan have built on the work of others to construct a set of [[higher-order function]]s (called [[parser combinators]]) which allow polynomial time and space complexity top-down parser to be constructed as executable specifications of ambiguous grammars containing left-recursive productions.@@@@1@42@@danf@17-8-2009
10650090@unknown@formal@none@1@S@The [http://www.cs.uwindsor.ca/~hafiz/proHome.html X-SAIGA] site has more about the algorithms and implementation details.@@@@1@12@@danf@17-8-2009
10650100@unknown@formal@none@1@S@== Human languages ==@@@@1@4@@danf@17-8-2009
10650110@unknown@formal@none@1@S@:''Also see [[:Category:Natural language parsing]]''@@@@1@5@@danf@17-8-2009
10650120@unknown@formal@none@1@S@In some [[machine translation]] and [[natural language processing]] systems, human languages are parsed by computer programs.@@@@1@16@@danf@17-8-2009
10650130@unknown@formal@none@1@S@Human sentences are not easily parsed by programs, as there is substantial [[syntactic ambiguity|ambiguity]] in the structure of human language.@@@@1@20@@danf@17-8-2009
10650140@unknown@formal@none@1@S@In order to parse natural language data, researchers must first agree on the [[grammar]] to be used.@@@@1@17@@danf@17-8-2009
10650150@unknown@formal@none@1@S@The choice of syntax is affected by both [[linguistic]] and computational concerns; for instance some parsing systems use [[lexical functional grammar]], but in general, parsing for grammars of this type is known to be [[NP-complete]].@@@@1@35@@danf@17-8-2009
10650160@unknown@formal@none@1@S@[[Head-driven phrase structure grammar]] is another linguistic formalism which has been popular in the parsing community, but other research efforts have focused on less complex formalisms such as the one used in the Penn [[Treebank]].@@@@1@35@@danf@17-8-2009
10650170@unknown@formal@none@1@S@[[Shallow parsing]] aims to find only the boundaries of major constituents such as noun phrases.@@@@1@15@@danf@17-8-2009
10650180@unknown@formal@none@1@S@Another popular strategy for avoiding linguistic controversy is [[dependency grammar]] parsing.@@@@1@11@@danf@17-8-2009
10650190@unknown@formal@none@1@S@Most modern parsers are at least partly [[statistics|statistical]]; that is, they rely on a corpus of training data which has already been annotated (parsed by hand).@@@@1@26@@danf@17-8-2009
10650200@unknown@formal@none@1@S@This approach allows the system to gather information about the frequency with which various constructions occur in specific contexts.@@@@1@19@@danf@17-8-2009
10650210@unknown@formal@none@1@S@''(See [[machine learning]].)''@@@@1@3@@danf@17-8-2009
10650220@unknown@formal@none@1@S@Approaches which have been used include straightforward [[PCFG]]s (probabilistic context free grammars), [[maximum entropy]], and [[neural net]]s.@@@@1@17@@danf@17-8-2009
10650230@unknown@formal@none@1@S@Most of the more successful systems use ''lexical'' statistics (that is, they consider the identities of the words involved, as well as their [[part of speech]]).@@@@1@26@@danf@17-8-2009
10650240@unknown@formal@none@1@S@However such systems are vulnerable to [[overfitting]] and require some kind of smoothing to be effective.@@@@1@16@@danf@17-8-2009
10650250@unknown@formal@none@1@S@Parsing algorithms for natural language cannot rely on the grammar having 'nice' properties as with manually-designed grammars for programming languages.@@@@1@20@@danf@17-8-2009
10650260@unknown@formal@none@1@S@As mentioned earlier some grammar formalisms are very computationally difficult to parse; in general, even if the desired structure is not [[context-free]], some kind of context-free approximation to the grammar is used to perform a first pass.@@@@1@37@@danf@17-8-2009
10650265@unknown@formal@none@1@S@Algorithms which use context-free grammars often rely on some variant of the [[CKY algorithm]], usually with some [[heuristic (computer science)|heuristic]] to prune away unlikely analyses to save time.@@@@1@28@@danf@17-8-2009
10650270@unknown@formal@none@1@S@''(See [[chart parsing]].)''@@@@1@3@@danf@17-8-2009
10650280@unknown@formal@none@1@S@However some systems trade speed for accuracy using, eg, linear-time versions of the [[Shift-reduce parsing|shift-reduce]] algorithm.@@@@1@16@@danf@17-8-2009
10650290@unknown@formal@none@1@S@A somewhat recent development has been [[parse reranking]] in which the parser proposes some large number of analyses, and a more complex system selects the best option.@@@@1@27@@danf@17-8-2009
10650300@unknown@formal@none@1@S@It is normally branching of one part and its subparts@@@@1@10@@danf@17-8-2009
10650310@unknown@formal@none@1@S@== Programming languages ==@@@@1@4@@danf@17-8-2009
10650320@unknown@formal@none@1@S@The most common use of a parser is as a component of a [[compiler]] or [[interpreter]].@@@@1@16@@danf@17-8-2009
10650330@unknown@formal@none@1@S@This parses the [[source code]] of a [[computer programming language]] to create some form of internal representation.@@@@1@17@@danf@17-8-2009
10650340@unknown@formal@none@1@S@Programming languages tend to be specified in terms of a [[context-free grammar]] because fast and efficient parsers can be written for them.@@@@1@22@@danf@17-8-2009
10650350@unknown@formal@none@1@S@Parsers are written by hand or generated by [[parser generator]]s.@@@@1@10@@danf@17-8-2009
10650360@unknown@formal@none@1@S@Context-free grammars are limited in the extent to which they can express all of the requirements of a language.@@@@1@19@@danf@17-8-2009
10650370@unknown@formal@none@1@S@Informally, the reason is that the memory of such a language is limited.@@@@1@13@@danf@17-8-2009
10650380@unknown@formal@none@1@S@The grammar cannot remember the presence of a construct over an arbitrarily long input; this is necessary for a language in which, for example, a name must be declared before it may be referenced.@@@@1@34@@danf@17-8-2009
10650390@unknown@formal@none@1@S@More powerful grammars that can express this constraint, however, cannot be parsed efficiently.@@@@1@13@@danf@17-8-2009
10650400@unknown@formal@none@1@S@Thus, it is a common strategy to create a relaxed parser for a context-free grammar which accepts a superset of the desired language constructs (that is, it accepts some invalid constructs); later, the unwanted constructs can be filtered out.@@@@1@39@@danf@17-8-2009
10650410@unknown@formal@none@1@S@===Overview of process===@@@@1@3@@danf@17-8-2009
10650420@unknown@formal@none@1@S@[[image:Parser_Flow.gif|right|Flow of data in a typical parser]] The following example demonstrates the common case of parsing a computer language with two levels of grammar: lexical and syntactic.@@@@1@27@@danf@17-8-2009
10650430@unknown@formal@none@1@S@The first stage is the token generation, or [[lexical analysis]], by which the input character stream is split into meaningful symbols defined by a grammar of [[regular expression]]s.@@@@1@28@@danf@17-8-2009
10650440@unknown@formal@none@1@S@For example, a calculator program would look at an input such as "<code>12*(3+4)^2</code>" and split it into the tokens <code>12</code>, <code>*</code>, <code>(</code>, <code>3</code>, <code>+</code>, <code>4</code>, <code>)</code>, <code>^</code>, and <code>2</code>, each of which is a meaningful symbol in the context of an arithmetic expression.@@@@1@43@@danf@17-8-2009
10650450@unknown@formal@none@1@S@The parser would contain rules to tell it that the characters <code>*</code>, <code>+</code>, <code>^</code>, <code>(</code> and <code>)</code> mark the start of a new token, so meaningless tokens like "<code>12*</code>" or "<code>(3</code>" will not be generated.@@@@1@35@@danf@17-8-2009
10650460@unknown@formal@none@1@S@The next stage is parsing or syntactic analysis, which is checking that the tokens form an allowable expression.@@@@1@18@@danf@17-8-2009
10650470@unknown@formal@none@1@S@This is usually done with reference to a [[context-free grammar]] which recursively defines components that can make up an expression and the order in which they must appear.@@@@1@28@@danf@17-8-2009
10650480@unknown@formal@none@1@S@However, not all rules defining programming languages can be expressed by context-free grammars alone, for example type validity and proper declaration of identifiers.@@@@1@23@@danf@17-8-2009
10650490@unknown@formal@none@1@S@These rules can be formally expressed with [[attribute grammar]]s.@@@@1@9@@danf@17-8-2009
10650500@unknown@formal@none@1@S@The final phase is [[Semantic analysis (computer science)|semantic parsing]] or analysis, which is working out the implications of the expression just validated and taking the appropriate action.@@@@1@27@@danf@17-8-2009
10650510@unknown@formal@none@1@S@In the case of a calculator or interpreter, the action is to evaluate the expression or program; a compiler, on the other hand, would generate some kind of code.@@@@1@29@@danf@17-8-2009
10650520@unknown@formal@none@1@S@Attribute grammars can also be used to define these actions.@@@@1@10@@danf@17-8-2009
10650530@unknown@formal@none@1@S@==Types of parsers==@@@@1@3@@danf@17-8-2009
10650540@unknown@formal@none@1@S@The task of the parser is essentially to determine if and how the input can be derived from the start symbol of the grammar.@@@@1@24@@danf@17-8-2009
10650550@unknown@formal@none@1@S@This can be done in essentially two ways:@@@@1@8@@danf@17-8-2009
10650560@unknown@formal@none@1@S@*[[Top-down parsing]] - Top-down parsing can be viewed as an attempt to find left-most derivations of an input-stream by searching for [[parse tree|parse-trees]] using a top-down expansion of the given [[formal grammar]] rules.@@@@1@33@@danf@17-8-2009
10650570@unknown@formal@none@1@S@Tokens are consumed from left to right.@@@@1@7@@danf@17-8-2009
10650580@unknown@formal@none@1@S@Inclusive choice is used to accommodate [[ambiguity]] by expanding all alternative right-hand-sides of grammar rules .@@@@1@16@@danf@17-8-2009
10650590@unknown@formal@none@1@S@[[LL parser]]s and [[recursive-descent parser]] are examples of top-down parsers, which cannot accommodate [[left recursion | left recursive]] productions.@@@@1@19@@danf@17-8-2009
10650600@unknown@formal@none@1@S@Although it has been believed that simple implementations of top-down parsing cannot accommodate direct and indirect left-recursion and may require exponential time and space complexity while parsing ambiguous [[context-free grammar]]s, more sophisticated algorithm for top-down parsing have been created by Frost, Hafiz, and Callaghan which accommodates [[ambiguity]] and [[left recursion]] in polynomial time and which generates polynomial-size representations of the potentially-exponential number of parse trees.@@@@1@65@@danf@17-8-2009
10650610@unknown@formal@none@1@S@Their algorithm is able to produce both left-most and right-most derivations of an input w.r.t. a given CFG.@@@@1@18@@danf@17-8-2009
10650620@unknown@formal@none@1@S@*[[Bottom-up parsing]] - A parser can start with the input and attempt to rewrite it to the start symbol.@@@@1@19@@danf@17-8-2009
10650630@unknown@formal@none@1@S@Intuitively, the parser attempts to locate the most basic elements, then the elements containing these, and so on.@@@@1@18@@danf@17-8-2009
10650640@unknown@formal@none@1@S@[[LR parser]]s are examples of bottom-up parsers.@@@@1@7@@danf@17-8-2009
10650650@unknown@formal@none@1@S@Another term used for this type of parser is Shift-Reduce parsing.@@@@1@11@@danf@17-8-2009
10650660@unknown@formal@none@1@S@Another important distinction is whether the parser generates a ''leftmost derivation'' or a ''rightmost derivation'' (see [[context-free grammar]]).@@@@1@18@@danf@17-8-2009
10650670@unknown@formal@none@1@S@LL parsers will generate a leftmost [[derivation]] and LR parsers will generate a rightmost derivation (although usually in reverse) .@@@@1@20@@danf@17-8-2009
10650680@unknown@formal@none@1@S@== Examples of parsers ==@@@@1@5@@danf@17-8-2009
10650690@unknown@formal@none@1@S@=== Top-down parsers ===@@@@1@4@@danf@17-8-2009
10650700@unknown@formal@none@1@S@Some of the parsers that use [[top-down parsing]] include:@@@@1@9@@danf@17-8-2009
10650710@unknown@formal@none@1@S@* [[Recursive descent parser]]@@@@1@4@@danf@17-8-2009
10650720@unknown@formal@none@1@S@* [[LL parser]] ('''L'''eft-to-right, '''L'''eftmost derivation)@@@@1@6@@danf@17-8-2009
10650730@unknown@formal@none@1@S@* [http://www.cs.uwindsor.ca/~hafiz/proHome.html X-SAIGA] - eXecutable SpecificAtIons of GrAmmars.@@@@1@8@@danf@17-8-2009
10650740@unknown@formal@none@1@S@Contains publications related to top-down parsing algorithm that supports left-recursion and ambiguity in polynomial time and space.@@@@1@17@@danf@17-8-2009
10650750@unknown@formal@none@1@S@=== Bottom-up parsers ===@@@@1@4@@danf@17-8-2009
10650760@unknown@formal@none@1@S@Some of the parsers that use [[bottom-up parsing]] include:@@@@1@9@@danf@17-8-2009
10650770@unknown@formal@none@1@S@* Precedence parser@@@@1@3@@danf@17-8-2009
10650780@unknown@formal@none@1@S@** [[Operator-precedence parser]]@@@@1@3@@danf@17-8-2009
10650790@unknown@formal@none@1@S@** [[Simple precedence parser]]@@@@1@4@@danf@17-8-2009
10650800@unknown@formal@none@1@S@* BC (bounded context) parsing@@@@1@5@@danf@17-8-2009
10650810@unknown@formal@none@1@S@* [[LR parser]] ('''L'''eft-to-right, '''R'''ightmost derivation)@@@@1@6@@danf@17-8-2009
10650820@unknown@formal@none@1@S@** [[SLR parser|Simple LR (SLR) parser]]@@@@1@6@@danf@17-8-2009
10650830@unknown@formal@none@1@S@** [[LALR parser]]@@@@1@3@@danf@17-8-2009
10650840@unknown@formal@none@1@S@** [[Canonical LR parser|Canonical LR (LR(1)) parser]]@@@@1@7@@danf@17-8-2009
10650850@unknown@formal@none@1@S@** [[GLR parser]]@@@@1@3@@danf@17-8-2009
10650860@unknown@formal@none@1@S@* [[CYK algorithm|CYK parser]]@@@@1@4@@danf@17-8-2009
10660010@unknown@formal@none@1@S@<article>Lexical category</article>@@@@1@2@@danf@17-8-2009
10660020@unknown@formal@none@1@S@In [[grammar]], a '''lexical category''' (also '''word class''', '''lexical class''', or in traditional grammar '''part of speech''') is a linguistic category of words (or more precisely ''lexical items''), which is generally defined by the [[syntactic]] or [[morphology (linguistics)|morphological]] behaviour of the lexical item in question.@@@@1@45@@danf@17-8-2009
10660030@unknown@formal@none@1@S@Common linguistic categories include ''noun'' and ''verb'', among others.@@@@1@9@@danf@17-8-2009
10660040@unknown@formal@none@1@S@There are [[open class word|open word classes]], which constantly acquire new members, and [[closed class word|closed word classes]], which acquire new members infrequently if at all.@@@@1@26@@danf@17-8-2009
10660050@unknown@formal@none@1@S@Different languages may have different lexical categories, or they might associate different properties to the same one.@@@@1@17@@danf@17-8-2009
10660060@unknown@formal@none@1@S@For example, [[Japanese language|Japanese]] has at least three classes of adjectives where English has one; Chinese and Japanese have [[measure word]]s while European languages have nothing resembling them; many languages don't have a distinction between adjectives and adverbs, or adjectives and nouns, etc.@@@@1@43@@danf@17-8-2009
10660070@unknown@formal@none@1@S@Many linguists argue that the formal distinctions between parts of speech must be made within the framework of a specific language or language family, and should not be carried over to other languages or language families.@@@@1@36@@danf@17-8-2009
10660080@unknown@formal@none@1@S@==History==@@@@1@1@@danf@17-8-2009
10660090@unknown@formal@none@1@S@The classification of words into lexical categories is found from the earliest moments in the [[history of linguistics]].@@@@1@18@@danf@17-8-2009
10660100@unknown@formal@none@1@S@In the ''[[Nirukta]]'', written in the [[5th century BCE|5th]] or [[6th century BCE]], the [[Sanskrit grammarian]] [[Yāska]] defined four main categories of words :@@@@1@24@@danf@17-8-2009
10660110@unknown@formal@none@1@S@# nāma - [[noun]]s or substantives@@@@1@6@@danf@17-8-2009
10660120@unknown@formal@none@1@S@# ākhyāta - [[verb]]s@@@@1@4@@danf@17-8-2009
10660130@unknown@formal@none@1@S@# upasarga - pre-verbs or [[prefix]]es@@@@1@6@@danf@17-8-2009
10660140@unknown@formal@none@1@S@# nipāta - [[Grammatical particle|particle]]s, invariant words (perhaps [[prepositions]])@@@@1@9@@danf@17-8-2009
10660150@unknown@formal@none@1@S@These four were grouped into two large classes: [[inflection|inflected]] (nouns and verbs) and uninflected (pre-verbs and particles).@@@@1@17@@danf@17-8-2009
10660160@unknown@formal@none@1@S@A century or two later, the [[Classical Greece|Greek]] scholar [[Plato]] wrote in the [[Cratylus (dialogue)|''Cratylus'' dialog]] that "... sentences are, I conceive, a combination of verbs [''rhēma''] and nouns [''ónoma'']".@@@@1@30@@danf@17-8-2009
10660170@unknown@formal@none@1@S@Another class, "conjunctions" (covering [[Grammatical conjunction|conjunction]]s, [[pronoun]]s, and the [[article (grammar)|article]]), was later added by [[Aristotle]].@@@@1@16@@danf@17-8-2009
10660180@unknown@formal@none@1@S@By the end of the [[2nd century BCE]], the classification scheme had been expanded into eight categories, seen in the ''[[Art of Grammar|Tékhnē grammatiké]]'':@@@@1@24@@danf@17-8-2009
10660190@unknown@formal@none@1@S@# Noun: a part of speech inflected for case, signifying a concrete or abstract entity@@@@1@15@@danf@17-8-2009
10660200@unknown@formal@none@1@S@# Verb: a part of speech without case inflection, but inflected for tense, person and number, signifying an activity or process performed or undergone@@@@1@24@@danf@17-8-2009
10660210@unknown@formal@none@1@S@# Participle: a part of speech sharing the features of the verb and the noun@@@@1@15@@danf@17-8-2009
10660220@unknown@formal@none@1@S@# Article: a part of speech inflected for case and preposed or postposed to nouns (the relative pronoun is meant by the postposed article)@@@@1@24@@danf@17-8-2009
10660230@unknown@formal@none@1@S@# Pronoun: a part of speech substitutable for a noun and marked for person@@@@1@14@@danf@17-8-2009
10660240@unknown@formal@none@1@S@# Preposition: a part of speech placed before other words in composition and in syntax@@@@1@15@@danf@17-8-2009
10660250@unknown@formal@none@1@S@# Adverb: a part of speech without inflection, in modification of or in addition to a verb@@@@1@17@@danf@17-8-2009
10660260@unknown@formal@none@1@S@# Conjunction: a part of speech binding together the discourse and filling gaps in its interpretation@@@@1@16@@danf@17-8-2009
10660270@unknown@formal@none@1@S@The [[Latin grammar]]ian [[Priscian]] ([[floruit|fl.]] [[500 CE]]) modified the above eight-fold system, substituting "[[interjection]]" for "article".@@@@1@16@@danf@17-8-2009
10660280@unknown@formal@none@1@S@It wasn't until 1767 that the [[adjective]] was taken as a separate class.@@@@1@13@@danf@17-8-2009
10660290@unknown@formal@none@1@S@Traditional English grammar is patterned after the European tradition above, and is still taught in schools and used in [[dictionaries]].@@@@1@20@@danf@17-8-2009
10660300@unknown@formal@none@1@S@It names eight parts of speech: [[noun]], [[verb]], [[adjective]], [[adverb]], [[pronoun]], [[preposition]], [[Grammatical conjunction|conjunction]], and [[interjection]] (sometimes called an exclamation).@@@@1@20@@danf@17-8-2009
10660310@unknown@formal@none@1@S@==Controversies==@@@@1@1@@danf@17-8-2009
10660320@unknown@formal@none@1@S@Since the Greek grammarians of 2nd century BCE, parts of speech have been defined by [[morphology (linguistics)|morphological]], [[syntax|syntactic]] and [[semantics|semantic]] criteria.@@@@1@21@@danf@17-8-2009
10660330@unknown@formal@none@1@S@However, there is currently no generally agreed-upon classification scheme that can apply to all languages, or even a set of criteria upon which such a scheme should be based.@@@@1@29@@danf@17-8-2009
10660340@unknown@formal@none@1@S@Linguists recognize that the above list of eight word classes is simplified and artificial.@@@@1@14@@danf@17-8-2009
10660350@unknown@formal@none@1@S@For example, "adverb" is to some extent a catch-all class that includes words with many different functions.@@@@1@17@@danf@17-8-2009
10660360@unknown@formal@none@1@S@Some have even argued that the most basic of category distinctions, that of nouns and verbs, is unfounded, or not applicable to certain languages.@@@@1@24@@danf@17-8-2009
10660370@unknown@formal@none@1@S@==Functional classification==@@@@1@2@@danf@17-8-2009
10660380@unknown@formal@none@1@S@Common ways of delimiting words by function include:@@@@1@8@@danf@17-8-2009
10660390@unknown@formal@none@1@S@* '''[[Open word classes]]:'''@@@@1@4@@danf@17-8-2009
10660400@unknown@formal@none@1@S@**[[adjective]]s@@@@1@1@@danf@17-8-2009
10660410@unknown@formal@none@1@S@**[[adverb]]s@@@@1@1@@danf@17-8-2009
10660420@unknown@formal@none@1@S@**[[interjection]]s@@@@1@1@@danf@17-8-2009
10660430@unknown@formal@none@1@S@**[[noun]]s@@@@1@1@@danf@17-8-2009
10660440@unknown@formal@none@1@S@**[[verb]]s (except [[auxiliary verb]]s)@@@@1@4@@danf@17-8-2009
10660450@unknown@formal@none@1@S@* '''[[Closed word classes]]:'''@@@@1@4@@danf@17-8-2009
10660460@unknown@formal@none@1@S@**[[auxiliary verb]]s@@@@1@2@@danf@17-8-2009
10660470@unknown@formal@none@1@S@**[[clitic]]s@@@@1@1@@danf@17-8-2009
10660480@unknown@formal@none@1@S@**[[coverb]]s@@@@1@1@@danf@17-8-2009
10660490@unknown@formal@none@1@S@**[[Grammatical conjunction|conjunction]]s@@@@1@2@@danf@17-8-2009
10660500@unknown@formal@none@1@S@**[[determiner (class)|Determiner]]s ([[article (grammar)|article]]s, [[quantifier]]s, [[demonstrative adjective]]s, and [[possessive adjective]]s)@@@@1@10@@danf@17-8-2009
10660510@unknown@formal@none@1@S@**[[grammatical particle|particle]]s@@@@1@2@@danf@17-8-2009
10660520@unknown@formal@none@1@S@**[[measure word]]s@@@@1@2@@danf@17-8-2009
10660530@unknown@formal@none@1@S@**[[adposition]]s (prepositions, postpositions, and circumpositions)@@@@1@5@@danf@17-8-2009
10660540@unknown@formal@none@1@S@**[[preverb]]s@@@@1@1@@danf@17-8-2009
10660550@unknown@formal@none@1@S@**[[pronoun]]s@@@@1@1@@danf@17-8-2009
10660560@unknown@formal@none@1@S@**[[Contraction (grammar)|contraction]]s@@@@1@2@@danf@17-8-2009
10660570@unknown@formal@none@1@S@**[[Names of numbers in English#Cardinal numbers|cardinal numbers]]@@@@1@7@@danf@17-8-2009
10660580@unknown@formal@none@1@S@==English==@@@@1@1@@danf@17-8-2009
10660590@unknown@formal@none@1@S@[[English language|English]] frequently does not [[marker (linguistics)|mark]] words as belonging to one part of speech or another.@@@@1@17@@danf@17-8-2009
10660600@unknown@formal@none@1@S@Words like ''neigh'', ''break'', ''outlaw'', ''laser'', ''microwave'' and ''telephone'' might all be either verb forms or nouns.@@@@1@17@@danf@17-8-2009
10660610@unknown@formal@none@1@S@Although ''-ly'' is an adverb marker, not all adverbs end in ''-ly'' and not all words ending in ''-ly'' are adverbs.@@@@1@21@@danf@17-8-2009
10660620@unknown@formal@none@1@S@For instance, ''tomorrow'', ''slow'', ''fast'', ''crosswise'' can all be adverbs, while ''early'', ''friendly'', ''ugly'' are all adjectives (though ''early'' can also function as an adverb).@@@@1@25@@danf@17-8-2009
10660630@unknown@formal@none@1@S@In certain circumstances, even words with primarily grammatical functions can be used as verbs or nouns, as in "We must look to the ''hows'' and not just the ''whys''" or "Miranda was ''to-ing and fro-ing'' and not paying attention".@@@@1@39@@danf@17-8-2009
10670010@unknown@formal@none@1@S@<article>Part-of-speech tagging</article>@@@@1@2@@danf@17-8-2009
10670020@unknown@formal@none@1@S@'''Part-of-speech tagging''' ('''POS tagging''' or '''POST'''), also called '''grammatical tagging''', is the process of marking up the words in a text as corresponding to a particular [[parts of speech|part of speech]], based on both its definition, as well as its context—i.e., relationship with adjacent and related words in a [[phrase]], [[sentence]], or [[paragraph]].@@@@1@53@@danf@17-8-2009
10670030@unknown@formal@none@1@S@A simplified form of this is commonly taught school-age children, in the identification of words as [[noun]]s, [[verb]]s, [[adjective]]s, [[adverb]]s, etc.@@@@1@21@@danf@17-8-2009
10670040@unknown@formal@none@1@S@Once performed by hand, POS tagging is now done in the context of [[computational linguistics]], using [[algorithms]] which associate discrete terms, as well as hidden parts of speech, in accordance with a set of descriptive tags.@@@@1@36@@danf@17-8-2009
10670050@unknown@formal@none@1@S@==History==@@@@1@1@@danf@17-8-2009
10670060@unknown@formal@none@1@S@Research on part-of-speech tagging has been closely tied to [[corpus linguistics]].@@@@1@11@@danf@17-8-2009
10670070@unknown@formal@none@1@S@The first major corpus of English for computer analysis was the [[Brown Corpus]] developed at [[Brown University]] by [[Henry Kucera]] and [[Nelson Francis]], in the mid-1960s.@@@@1@26@@danf@17-8-2009
10670080@unknown@formal@none@1@S@It consists of about 1,000,000 words of running English prose text, made up of 500 samples from randomly chosen publications.@@@@1@20@@danf@17-8-2009
10670090@unknown@formal@none@1@S@Each sample is 2,000 or more words (ending at the first sentence-end after 2,000 words, so that the corpus contains only complete sentences).@@@@1@23@@danf@17-8-2009
10670100@unknown@formal@none@1@S@The [[Brown Corpus]] was painstakingly "tagged" with part-of-speech markers over many years.@@@@1@12@@danf@17-8-2009
10670110@unknown@formal@none@1@S@A first approximation was done with a program by Greene and Rubin, which consisted of a huge handmade list of what categories could co-occur at all.@@@@1@26@@danf@17-8-2009
10670120@unknown@formal@none@1@S@For example, article then noun can occur, but article verb (arguably) cannot.@@@@1@12@@danf@17-8-2009
10670130@unknown@formal@none@1@S@The program got about 70% correct.@@@@1@6@@danf@17-8-2009
10670140@unknown@formal@none@1@S@Its results were repeatedly reviewed and corrected by hand, and later users sent in errata, so that by the late 70s the tagging was nearly perfect (allowing for some cases even human speakers might not agree on).@@@@1@37@@danf@17-8-2009
10670150@unknown@formal@none@1@S@This corpus has been used for innumerable studies of word-frequency and of part-of-speech, and inspired the development of similar "tagged" corpora in many other languages.@@@@1@25@@danf@17-8-2009
10670160@unknown@formal@none@1@S@Statistics derived by analyzing it formed the basis for most later part-of-speech tagging systems, such as CLAWS and [[VOLSUNGA]].@@@@1@19@@danf@17-8-2009
10670170@unknown@formal@none@1@S@However, by this time (2005) it has been superseded by larger corpora such as the 100 million word [[British National Corpus]].@@@@1@21@@danf@17-8-2009
10670180@unknown@formal@none@1@S@For some time, part-of-speech tagging was considered an inseparable part of [[natural language processing]], because there are certain cases where the correct part of speech cannot be decided without understanding the [[semantics]] or even the [[pragmatics]] of the context.@@@@1@39@@danf@17-8-2009
10670190@unknown@formal@none@1@S@This is extremely expensive, especially because analyzing the higher levels is much harder when multiple part-of-speech possibilities must be considered for each word.@@@@1@23@@danf@17-8-2009
10670200@unknown@formal@none@1@S@In the mid 1980s, researchers in Europe began to use [[hidden Markov model]]s (HMMs) to disambiguate parts of speech, when working to tag the [[Lancaster-Oslo-Bergen Corpus]] of British English.@@@@1@29@@danf@17-8-2009
10670210@unknown@formal@none@1@S@HMMs involve counting cases (such as from the Brown Corpus), and making a table of the probabilities of certain sequences.@@@@1@20@@danf@17-8-2009
10670220@unknown@formal@none@1@S@For example, once you've seen an article such as 'the', perhaps the next word is a noun 40% of the time, an adjective 40%, and a number 20%.@@@@1@28@@danf@17-8-2009
10670230@unknown@formal@none@1@S@Knowing this, a program can decide that "can" in "the can" is far more likely to be a noun than a verb or a modal.@@@@1@25@@danf@17-8-2009
10670240@unknown@formal@none@1@S@The same method can of course be used to benefit from knowledge about following words.@@@@1@15@@danf@17-8-2009
10670250@unknown@formal@none@1@S@More advanced ("higher order") HMMs learn the probabilities not only of pairs, but triples or even larger sequences.@@@@1@18@@danf@17-8-2009
10670260@unknown@formal@none@1@S@So, for example, if you've just seen an article and a verb, the next item may be very likely a preposition, article, or noun, but even less likely another verb.@@@@1@30@@danf@17-8-2009
10670270@unknown@formal@none@1@S@When several ambiguous words occur together, the possibilities multiply.@@@@1@9@@danf@17-8-2009
10670280@unknown@formal@none@1@S@However, it is easy to enumerate every combination and to assign a relative probability to each one, by multiplying together the probabilities of each choice in turn.@@@@1@27@@danf@17-8-2009
10670290@unknown@formal@none@1@S@The combination with highest probability is then chosen.@@@@1@8@@danf@17-8-2009
10670300@unknown@formal@none@1@S@The European group developed CLAWS, a tagging program that did exactly this, and achieved accuracy in the 93-95% range.@@@@1@19@@danf@17-8-2009
10670310@unknown@formal@none@1@S@It is worth remembering, as [[Eugene Charniak]] points out in ''Statistical techniques for natural language parsing'' [http://www.cs.brown.edu/people/ec/home.html], that merely assigning the most common tag to each known word and the tag "proper noun" to all unknowns, will approach 90% accuracy because many words are unambiguous.@@@@1@45@@danf@17-8-2009
10670320@unknown@formal@none@1@S@CLAWS pioneered the field of HMM-based part of speech tagging, but was quite expensive since it enumerated all possibilities.@@@@1@19@@danf@17-8-2009
10670330@unknown@formal@none@1@S@It sometimes had to resort to backup methods when there were simply too many (the [[Brown Corpus]] contains a case with 17 ambiguous words in a row, and there are words such as "still" that can represent as many as 7 distinct parts of speech).@@@@1@45@@danf@17-8-2009
10670340@unknown@formal@none@1@S@In 1987, [[Steve DeRose]] and [[Ken Church]] independently developed [[dynamic programming]] algorithms to solve the same problem in vastly less time.@@@@1@21@@danf@17-8-2009
10670350@unknown@formal@none@1@S@Their methods were similar to the [[Viterbi algorithm]] known for some time in other fields.@@@@1@15@@danf@17-8-2009
10670360@unknown@formal@none@1@S@DeRose used a table of pairs, while Church used a table of triples and an ingenious method of estimating the values for triples that were rare or nonexistent in the Brown Corpus (actual measurement of triple probabilities would require a much larger corpus).@@@@1@43@@danf@17-8-2009
10670370@unknown@formal@none@1@S@Both methods achieved accuracy over 95%.@@@@1@6@@danf@17-8-2009
10670380@unknown@formal@none@1@S@DeRose's 1990 dissertation at [[Brown University]] included analyses of the specific error types, probabilities, and other related data, and replicated his work for Greek, where it proved similarly effective.@@@@1@29@@danf@17-8-2009
10670390@unknown@formal@none@1@S@These findings were surprisingly disruptive to the field of [[Natural Language Processing]].@@@@1@12@@danf@17-8-2009
10670400@unknown@formal@none@1@S@The accuracy reported was higher than the typical accuracy of very sophisticated algorithms that integrated part of speech choice with many higher levels of linguistic analysis: syntax, morphology, semantics, and so on.@@@@1@32@@danf@17-8-2009
10670410@unknown@formal@none@1@S@CLAWS, DeRose's and Church's methods did fail for some of the known cases where semantics is required, but those proved negligibly rare.@@@@1@22@@danf@17-8-2009
10670420@unknown@formal@none@1@S@This convinced many in the field that part-of-speech tagging could usefully be separated out from the other levels of processing; this in turn simplified the theory and practice of computerized language analysis, and encouraged researchers to find ways to separate out other pieces as well.@@@@1@45@@danf@17-8-2009
10670430@unknown@formal@none@1@S@Markov Models are now the standard method for part-of-speech assignment.@@@@1@10@@danf@17-8-2009
10670440@unknown@formal@none@1@S@The methods already discussed involve working from a pre-existing corpus to learn tag probabilities.@@@@1@14@@danf@17-8-2009
10670450@unknown@formal@none@1@S@It is, however, also possible to [[Bootstrapping (linguistics)|bootstrap]] using "unsupervised" tagging.@@@@1@11@@danf@17-8-2009
10670460@unknown@formal@none@1@S@Unsupervised tagging techniques use an untagged corpus for their training data and produce the tagset by induction.@@@@1@17@@danf@17-8-2009
10670470@unknown@formal@none@1@S@That is, they observe patterns in word use, and derive part-of-speech categories themselves.@@@@1@13@@danf@17-8-2009
10670480@unknown@formal@none@1@S@For example, statistics readily reveal that "the", "a", and "an" occur in similar contexts, while "eat" occurs in very different ones.@@@@1@21@@danf@17-8-2009
10670490@unknown@formal@none@1@S@With sufficient iteration, similarity classes of words emerge that are remarkably similar to those human linguists would expect; and the differences themselves sometimes suggest valuable new insights.@@@@1@27@@danf@17-8-2009
10670500@unknown@formal@none@1@S@These two categories can be further subdivided into rule-based, stochastic, and neural approaches.@@@@1@13@@danf@17-8-2009
10670510@unknown@formal@none@1@S@Some current major algorithms for '''part-of-speech tagging''' include the [[Viterbi algorithm]], [[Brill Tagger]], and the [[Baum-Welch algorithm]] (also known as the forward-backward algorithm).@@@@1@23@@danf@17-8-2009
10670520@unknown@formal@none@1@S@[[Hidden Markov model]] and [[visible Markov model]] taggers can both be implemented using the [[Viterbi algorithm]].@@@@1@16@@danf@17-8-2009
10680010@unknown@formal@none@1@S@<article>Pattern recognition</article>@@@@1@2@@danf@17-8-2009
10680020@unknown@formal@none@1@S@'''Pattern recognition''' is a sub-topic of [[machine learning]].@@@@1@8@@danf@17-8-2009
10680030@unknown@formal@none@1@S@It can be defined as@@@@1@5@@danf@17-8-2009
10680040@unknown@formal@none@1@S@:"the act of taking in raw data and taking an action based on the [[Category (taxonomy)|category]] of the data".@@@@1@19@@danf@17-8-2009
10680050@unknown@formal@none@1@S@Most research in pattern recognition is about methods for [[supervised learning]] and [[unsupervised learning]].@@@@1@14@@danf@17-8-2009
10680060@unknown@formal@none@1@S@Pattern recognition aims to classify [[data]] ([[pattern]]s) based on either ''[[A priori and a posteriori (philosophy)|a priori]]'' knowledge or on [[statistics|statistical]] information extracted from the patterns.@@@@1@26@@danf@17-8-2009
10680070@unknown@formal@none@1@S@The patterns to be classified are usually groups of measurements or observations, defining points in an appropriate [[space (mathematics)|multidimensional space]].@@@@1@20@@danf@17-8-2009
10680080@unknown@formal@none@1@S@This is in contrast to '''[[pattern matching]]''', where the pattern is rigidly specified.@@@@1@13@@danf@17-8-2009
10680090@unknown@formal@none@1@S@==Overview==@@@@1@1@@danf@17-8-2009
10680100@unknown@formal@none@1@S@A complete pattern recognition system consists of a [[sensor]] that gathers the observations to be classified or described; a [[feature extraction]] mechanism that computes numeric or symbolic information from the observations; and a [[statistical classification|classification]] or description scheme that does the actual job of classifying or describing observations, relying on the extracted features.@@@@1@53@@danf@17-8-2009
10680110@unknown@formal@none@1@S@The classification or description scheme is usually based on the availability of a set of patterns that have already been classified or described.@@@@1@23@@danf@17-8-2009
10680120@unknown@formal@none@1@S@This set of patterns is termed the [[training set]] and the resulting learning strategy is characterized as [[supervised learning]].@@@@1@19@@danf@17-8-2009
10680130@unknown@formal@none@1@S@Learning can also be [[unsupervised learning|unsupervised]], in the sense that the system is not given an ''a priori'' labeling of patterns, instead it establishes the classes itself based on the statistical regularities of the patterns.@@@@1@35@@danf@17-8-2009
10680140@unknown@formal@none@1@S@The classification or description scheme usually uses one of the following approaches: [[statistical classification|statistical]] (or decision theoretic), [[syntactic pattern recognition|syntactic]] (or structural).@@@@1@22@@danf@17-8-2009
10680150@unknown@formal@none@1@S@Statistical pattern recognition is based on statistical characterisations of patterns, assuming that the patterns are generated by a [[probabilistic]] system.@@@@1@20@@danf@17-8-2009
10680160@unknown@formal@none@1@S@Syntactical (or structural) pattern recognition is based on the structural interrelationships of features.@@@@1@13@@danf@17-8-2009
10680170@unknown@formal@none@1@S@A wide range of algorithms can be applied for pattern recognition, from very simple [[Naive Bayes classifier|Bayesian classifiers]] to much more powerful [[Artificial neural network|neural networks]].@@@@1@26@@danf@17-8-2009
10680180@unknown@formal@none@1@S@An intriguing problem in pattern recognition yet to be solved is the relationship between the problem to be solved (data to be classified) and the performance of various pattern recognition algorithms (classifiers).@@@@1@32@@danf@17-8-2009
10680190@unknown@formal@none@1@S@Pattern recognition is more complex when templates are used to generate variants.@@@@1@12@@danf@17-8-2009
10680200@unknown@formal@none@1@S@For example, in English, sentences often follow the "N-VP" (noun - verb phrase) pattern, but some knowledge of the English language is required to detect the pattern.@@@@1@27@@danf@17-8-2009
10680210@unknown@formal@none@1@S@Pattern recognition is studied in many fields, including [[psychology]], [[ethology]], and [[computer science]].@@@@1@13@@danf@17-8-2009
10680220@unknown@formal@none@1@S@[[Holographic associative memory]] is another type of pattern matching scheme where a target small patterns can be searched from a large set of learned patterns based on cognitive meta-weight.@@@@1@29@@danf@17-8-2009
10680230@unknown@formal@none@1@S@==Uses==@@@@1@1@@danf@17-8-2009
10680240@unknown@formal@none@1@S@Within medical science pattern recognition creates the basis for [[computer-aided diagnosis]] (CAD) systems.@@@@1@13@@danf@17-8-2009
10680250@unknown@formal@none@1@S@CAD describes a procedure that supports the doctor's interpretations and findings.@@@@1@11@@danf@17-8-2009
10680260@unknown@formal@none@1@S@Typical applications are automatic [[speech recognition]], [[document classification|classification of text into several categories]] (e.g. spam/non-spam email messages), the [[handwriting recognition|automatic recognition of handwritten postal codes]] on postal envelopes, or the [[facial recognition system|automatic recognition of images]] of human faces.@@@@1@39@@danf@17-8-2009
10680270@unknown@formal@none@1@S@The last two examples form the subtopic [[image analysis]] of pattern recognition that deals with digital images as input to pattern recognition systems.@@@@1@23@@danf@17-8-2009
10690010@unknown@formal@none@1@S@<article>Phrase</article>@@@@1@1@@danf@17-8-2009
10690020@unknown@formal@none@1@S@In [[grammar]], a '''phrase''' is a group of [[word]]s that functions as a single unit in the [[syntax]] of a [[Sentence (linguistics)|sentence]].@@@@1@22@@danf@17-8-2009
10690030@unknown@formal@none@1@S@For example ''the house at the end of the street'' (example 1) is a phrase.@@@@1@15@@danf@17-8-2009
10690040@unknown@formal@none@1@S@It acts like a noun.@@@@1@5@@danf@17-8-2009
10690050@unknown@formal@none@1@S@It contains the phrase ''at the end of the street'' (example 2), a prepositional phrase which acts like an adjective.@@@@1@20@@danf@17-8-2009
10690060@unknown@formal@none@1@S@Example 2 could be replaced by ''white'', to make the phrase ''the white house''.@@@@1@14@@danf@17-8-2009
10690070@unknown@formal@none@1@S@Examples 1 and 2 contain the phrase ''the end of the street'' (example 3) which acts like a noun.@@@@1@19@@danf@17-8-2009
10690080@unknown@formal@none@1@S@It could be replaced by ''the cross-roads'' to give ''the house at the cross-roads''.@@@@1@14@@danf@17-8-2009
10690090@unknown@formal@none@1@S@Most phrases have a or central word which defines the type of phrase.@@@@1@13@@danf@17-8-2009
10690100@unknown@formal@none@1@S@This word is called the [[head (linguistics)|head]] of the phrase.@@@@1@10@@danf@17-8-2009
10690110@unknown@formal@none@1@S@In English the head is often the first word of the phrase.@@@@1@12@@danf@17-8-2009
10690120@unknown@formal@none@1@S@Some phrases, however, can be headless.@@@@1@6@@danf@17-8-2009
10690130@unknown@formal@none@1@S@For example, ''the rich'' is a noun phrase composed of a determiner and an adjective, but no noun.@@@@1@18@@danf@17-8-2009
10690140@unknown@formal@none@1@S@Phrases may be classified by the type of head they take@@@@1@11@@danf@17-8-2009
10690150@unknown@formal@none@1@S@*[[Prepositional phrase]] (PP) with a [[preposition]] as head (e.g. ''in love'', ''over the rainbow'').@@@@1@14@@danf@17-8-2009
10690160@unknown@formal@none@1@S@Languages that use [[postposition]]s instead have [[postpositional phrase]]s.@@@@1@8@@danf@17-8-2009
10690170@unknown@formal@none@1@S@The two types are sometimes commonly referred to as [[adpositional phrase]]s.@@@@1@11@@danf@17-8-2009
10690180@unknown@formal@none@1@S@*[[Noun phrase]] (NP) with a [[noun]] as head (e.g. ''the black cat'', ''a cat on the mat'')@@@@1@17@@danf@17-8-2009
10690190@unknown@formal@none@1@S@*[[Verb phrase]] (VP) with a [[verb]] as head (e.g. ''eat cheese'', ''jump up and down'')@@@@1@15@@danf@17-8-2009
10690200@unknown@formal@none@1@S@*[[Adjectival phrase]] with an [[adjective]] as head (e.g. ''full of toys'')@@@@1@11@@danf@17-8-2009
10690210@unknown@formal@none@1@S@*[[Adverbial phrase]] with [[adverb]] as head (e.g. ''very carefully'')@@@@1@9@@danf@17-8-2009
10690220@unknown@formal@none@1@S@== Formal definition ==@@@@1@4@@danf@17-8-2009
10690230@unknown@formal@none@1@S@A '''phrase''' is a [[syntax|syntactic]] structure which has syntactic properties derived from its [[head (linguistics)|head]].@@@@1@15@@danf@17-8-2009
10690240@unknown@formal@none@1@S@== Complexity ==@@@@1@3@@danf@17-8-2009
10690250@unknown@formal@none@1@S@A complex phrase consists of several words, whereas a simple phrase consists of only one word.@@@@1@16@@danf@17-8-2009
10690260@unknown@formal@none@1@S@This terminology is especially often used with [[verb]] phrases:@@@@1@9@@danf@17-8-2009
10690270@unknown@formal@none@1@S@* simple past and present are simple verb, which require just one verb@@@@1@13@@danf@17-8-2009
10690280@unknown@formal@none@1@S@* complex verb have one or two [[grammatical aspect|aspect]]s added, hence require additional two or three words@@@@1@17@@danf@17-8-2009
10690290@unknown@formal@none@1@S@"Complex", which is phrase-level, is often confused with "[[compound (linguistics)|compound]]", which is [[word]]-level.@@@@1@13@@danf@17-8-2009
10690300@unknown@formal@none@1@S@However, there are certain phenomena that formally seem to be phrases but semantically are more like compounds, like "women's magazines", which has the form of a possessive noun phrase, but which refers (just like a compound) to one specific [[lexeme]] (i.e. a magazine for women and not some magazine owned by a woman).@@@@1@53@@danf@17-8-2009
10690310@unknown@formal@none@1@S@== Semiotic approaches to the concept of "phrase" ==@@@@1@9@@danf@17-8-2009
10690320@unknown@formal@none@1@S@In more [[semiotic]] approaches to language, such as the more cognitivist versions of [[construction grammar]], a phrasal structure is not only a certain formal combination of word types whose features are inherited from the head.@@@@1@35@@danf@17-8-2009
10690330@unknown@formal@none@1@S@Here each phrasal structure also expresses some type of [[concept]]ual content, be it specific or abstract.@@@@1@16@@danf@17-8-2009
10700010@unknown@formal@none@1@S@<article>Portuguese language</article>@@@@1@2@@danf@17-8-2009
10700020@unknown@formal@none@1@S@'''Portuguese''' ( or ''língua portuguesa'') is a [[Romance language]] that originated in what is now [[Galicia (Spain)]] and [[Portugal|northern Portugal]] from the [[Latin language|Latin]] spoken by [[Romanization (cultural)|romanized]] [[Pre-Roman peoples of the Iberian Peninsula]] (namely the [[Gallaeci]], the [[Lusitanians]], the [[Celtici]] and the [[Conii]]) about 2000 years ago.@@@@1@48@@danf@17-8-2009
10700030@unknown@formal@none@1@S@It spread worldwide in the 15th and 16th centuries as Portugal established a [[Portuguese Empire|colonial and commercial empire]] (1415&ndash;1999) which spanned from [[Brazil]] in the [[Americas]] to [[Goa]] in [[India]] and [[Macau]] in [[China]], in fact it was used exclusively on the island of [[Sri Lanka]] as the [[lingua franca]] for almost 350 years.@@@@1@54@@danf@17-8-2009
10700040@unknown@formal@none@1@S@During that time, many [[Portuguese Creole|creole languages based on Portuguese]] also appeared around the world, especially in [[Africa]], [[Asia]], and the [[Caribbean]].@@@@1@22@@danf@17-8-2009
10700050@unknown@formal@none@1@S@Today it is one of the world's major languages, [[List of languages by number of native speakers|ranked 6th]] according to number of native speakers (approximately 177 million).@@@@1@27@@danf@17-8-2009
10700060@unknown@formal@none@1@S@It is the language with the largest number of speakers in [[South America]], spoken by nearly all of Brazil's population, which amounts to over 51% of the continent's population even though it is the only Portuguese-speaking nation in [[the Americas]].@@@@1@40@@danf@17-8-2009
10700070@unknown@formal@none@1@S@It is also a major lingua franca in Portugal's former colonial possessions in Africa.@@@@1@14@@danf@17-8-2009
10700080@unknown@formal@none@1@S@It is the official language of ten countries (see the table on the right), also being co-official with [[Spanish language|Spanish]] and [[French language|French]] in [[Equatorial Guinea]], with [[Standard Cantonese|Cantonese]] [[Chinese language|Chinese]] in the Chinese special administrative region of [[Macau]], and with [[Tetum]] in [[East Timor]].@@@@1@45@@danf@17-8-2009
10700090@unknown@formal@none@1@S@There are sizable communities of Portuguese-speakers in various regions of North America, notably in the [[United States]] ([[New Jersey]], [[New England]] and south [[Florida]]) and in [[Ontario]], [[Canada]].@@@@1@28@@danf@17-8-2009
10700100@unknown@formal@none@1@S@[[Spain|Spanish]] author [[Miguel de Cervantes]] once called Portuguese "the sweet language", while Brazilian writer [[Olavo Bilac]] poetically described it as ''a última flor do Lácio, inculta e bela'': "the last flower of [[Latium]], wild and beautiful".@@@@1@36@@danf@17-8-2009
10700110@unknown@formal@none@1@S@==Geographic distribution==@@@@1@2@@danf@17-8-2009
10700120@unknown@formal@none@1@S@Today, Portuguese is the [[official language]] of [[Angola]], [[Brazil]], [[Cape Verde]], [[Guinea-Bissau]], [[Portugal]], [[São Tomé and Príncipe]] and [[Mozambique]].@@@@1@19@@danf@17-8-2009
10700130@unknown@formal@none@1@S@It is also one of the official languages of [[Equatorial Guinea]] (with [[Spanish language|Spanish]] and [[French language|French]]), the [[Special Administrative Region of the People's Republic of China|Chinese special administrative region]] of [[Macau]] (with [[Chinese language|Chinese]]), and [[East Timor]], (with [[Tetum]]).@@@@1@40@@danf@17-8-2009
10700140@unknown@formal@none@1@S@It is a [[First language|native language]] of most of the population in Portugal (100%), Brazil (99%), Angola (60%), and São Tomé and Príncipe (50%), and it is spoken by a [[plurality]] of the population of Mozambique (40%), though only 6.5% are native speakers.@@@@1@43@@danf@17-8-2009
10700150@unknown@formal@none@1@S@No data is available for Cape Verde, but almost all the population is bilingual, and the monolingual population speaks [[Cape Verdean Creole]].@@@@1@22@@danf@17-8-2009
10700160@unknown@formal@none@1@S@Small Portuguese-speaking communities subsist in former overseas colonies of Portugal such as Macau, where it is spoken as a first language by 0.6% of the population and East Timor.@@@@1@29@@danf@17-8-2009
10700170@unknown@formal@none@1@S@[[Uruguay]] gave Portuguese an equal status to Spanish in its educational system at the north border with Brazil.@@@@1@18@@danf@17-8-2009
10700180@unknown@formal@none@1@S@In the rest of the country, it's taught as an obligatory subject beginning by the 6th grade.@@@@1@17@@danf@17-8-2009
10700190@unknown@formal@none@1@S@It is also spoken by substantial immigrant communities, though not official, in [[Andorra]], [[France]], [[Luxembourg]], [[Jersey]] (with a statistically significant Portuguese-speaking community of approximately 10,000 people), [[Paraguay]], [[Namibia]], [[South Africa]], [[Switzerland]], [[Venezuela]] and in the [[U.S.]] states of [[California]], [[Connecticut]], [[Florida]], [[Massachusetts]], [[New Jersey]], [[New York]] and [[Rhode Island]].@@@@1@49@@danf@17-8-2009
10700200@unknown@formal@none@1@S@In some parts of India, such as [[Goa]] and [[Daman and Diu]] Portuguese is still spoken.@@@@1@16@@danf@17-8-2009
10700210@unknown@formal@none@1@S@There are also significant populations of Portuguese speakers in [[Canada]] (mainly concentrated in and around [[Toronto]]) [[Bermuda]] and [[Netherlands Antilles]].@@@@1@20@@danf@17-8-2009
10700220@unknown@formal@none@1@S@Portuguese is an official language of several international organizations.@@@@1@9@@danf@17-8-2009
10700230@unknown@formal@none@1@S@The [[Community of Portuguese Language Countries]] (with the Portuguese acronym CPLP) consists of the eight independent countries that have Portuguese as an official language.@@@@1@24@@danf@17-8-2009
10700240@unknown@formal@none@1@S@It is also an official language of the [[European Union]], [[Mercosul]], the [[Organization of American States]], the [[Organization of Ibero-American States]], the [[Union of South American Nations]], and the [[African Union]] (one of the working languages) and one of the official languages of other organizations.@@@@1@45@@danf@17-8-2009
10700250@unknown@formal@none@1@S@The Portuguese language is gaining popularity in Africa, Asia, and South America as a second language for study.@@@@1@18@@danf@17-8-2009
10700260@unknown@formal@none@1@S@Portuguese and Spanish are the fastest-growing European languages, and, according to estimates by UNESCO, Portuguese is the language with the highest potential for growth as an international language in southern Africa and South America.@@@@1@34@@danf@17-8-2009
10700270@unknown@formal@none@1@S@The Portuguese-speaking African countries are expected to have a combined population of 83 million by 2050.@@@@1@16@@danf@17-8-2009
10700280@unknown@formal@none@1@S@Since 1991, when Brazil signed into the economic market of Mercosul with other South American nations, such as Argentina, Uruguay, and Paraguay, there has been an increase in interest in the study of Portuguese in those South American countries.@@@@1@39@@danf@17-8-2009
10700290@unknown@formal@none@1@S@The demographic weight of Brazil in the continent will continue to strengthen the presence of the language in the region.@@@@1@20@@danf@17-8-2009
10700300@unknown@formal@none@1@S@Although in the early 21st century, after Macau was ceded to China in 1999, the use of Portuguese was in decline in Asia, it is becoming a language of opportunity there; mostly because of East Timor's boost in the number of speakers in the last five years but also because of increased Chinese diplomatic and financial ties with Portuguese-speaking countries.@@@@1@60@@danf@17-8-2009
10700310@unknown@formal@none@1@S@In July 2007, President Teodoro Obiang Nguema announced his government's decision to make Portuguese [[Equatorial Guinea]]'s third official language, in order to meet the requirements to apply for full membership of the [[Community of Portuguese Language Countries]].@@@@1@37@@danf@17-8-2009
10700320@unknown@formal@none@1@S@This upgrading from its current Associate Observer condition would result in Equatorial Guinea being able to access several professional and academic exchange programs and the facilitation of cross-border circulation of citizens.@@@@1@31@@danf@17-8-2009
10700330@unknown@formal@none@1@S@Its application is currently being assessed by other CPLP members.@@@@1@10@@danf@17-8-2009
10700340@unknown@formal@none@1@S@In March 1994 the [[Bosque de Portugal]] (Portugal's Woods) was founded in the Brazilian city of [[Curitiba]].@@@@1@17@@danf@17-8-2009
10700350@unknown@formal@none@1@S@The park houses the Portuguese Language Memorial, which honors the Portuguese immigrants and the countries that adopted the Portuguese language.@@@@1@20@@danf@17-8-2009
10700360@unknown@formal@none@1@S@Originally there were seven nations represented with pillars, but the independence of [[East Timor]] brought yet another pillar for that nation in 2007.@@@@1@23@@danf@17-8-2009
10700370@unknown@formal@none@1@S@In March 2006, the [[Museum of the Portuguese Language]], an interactive museum about the Portuguese language, was founded in [[São Paulo]], Brazil, the city with the largest number of Portuguese speakers in the world.@@@@1@34@@danf@17-8-2009
10700380@unknown@formal@none@1@S@==Dialects==@@@@1@1@@danf@17-8-2009
10700390@unknown@formal@none@1@S@Portuguese is a [[pluricentric language]] with two main groups of [[dialect]]s, those of [[Brazil]] and those of the [[Old World]].@@@@1@20@@danf@17-8-2009
10700400@unknown@formal@none@1@S@For historical reasons, the dialects of Africa and Asia are generally closer to those of Portugal than the Brazilian dialects, although in some aspects of their phonetics, especially the pronunciation of unstressed vowels, they resemble [[Brazilian Portuguese]] more than [[European Portuguese]].@@@@1@41@@danf@17-8-2009
10700410@unknown@formal@none@1@S@They have not been studied as widely as European and Brazilian Portuguese.@@@@1@12@@danf@17-8-2009
10700420@unknown@formal@none@1@S@Audio samples of some dialects of Portuguese are available below.@@@@1@10@@danf@17-8-2009
10700430@unknown@formal@none@1@S@There are some differences between the areas but these are the best approximations possible.@@@@1@14@@danf@17-8-2009
10700440@unknown@formal@none@1@S@For example, the ''caipira'' dialect has some differences from the one of Minas Gerais, but in general it is very close.@@@@1@21@@danf@17-8-2009
10700450@unknown@formal@none@1@S@A good example of Brazilian Portuguese may be found in the capital city, [[Brasília]], because of the generalized population from all parts of the country.@@@@1@25@@danf@17-8-2009
10700460@unknown@formal@none@1@S@'''[[Angola]]'''@@@@1@1@@danf@17-8-2009
10700470@unknown@formal@none@1@S@# ''Benguelense'' &mdash; [[Benguela]] province.@@@@1@5@@danf@17-8-2009
10700480@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som85.html ''Luandense''] &mdash; [[Luanda]] province.@@@@1@6@@danf@17-8-2009
10700490@unknown@formal@none@1@S@# ''Sulista'' &mdash; South of Angola.@@@@1@6@@danf@17-8-2009
10700500@unknown@formal@none@1@S@'''[[Brazil]]'''@@@@1@1@@danf@17-8-2009
10700510@unknown@formal@none@1@S@# ''[[Caipira]]'' &mdash; States of [[São Paulo (state)|São Paulo]] (countryside; the city of São Paulo and the eastern areas of the state have their own dialect, called ''paulistano''); southern [[Minas Gerais]], northern [[Paraná (state)|Paraná]], [[Goiás]] and [[Mato Grosso do Sul]].@@@@1@40@@danf@17-8-2009
10700520@unknown@formal@none@1@S@# ''Cearense'' &mdash; [[Ceará]].@@@@1@4@@danf@17-8-2009
10700530@unknown@formal@none@1@S@# ''Baiano'' &mdash; [[Bahia]].@@@@1@4@@danf@17-8-2009
10700540@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som90.html ''Fluminense''] &mdash; Variants spoken in the states of [[Rio de Janeiro (state)|Rio de Janeiro]] and [[Espírito Santo]] (excluding the city of Rio de Janeiro and its adjacent metropolitan areas, which have their own dialect, called ''[[carioca]]'').@@@@1@38@@danf@17-8-2009
10700550@unknown@formal@none@1@S@# ''[[Gaucho|Gaúcho]]'' &mdash; [[Rio Grande do Sul]].@@@@1@7@@danf@17-8-2009
10700560@unknown@formal@none@1@S@(There are many distinct accents in Rio Grande do Sul, mainly due to the heavy influx of European immigrants of diverse origins, those which have settled several colonies throughout the state.)@@@@1@31@@danf@17-8-2009
10700570@unknown@formal@none@1@S@# ''[[Mineiro]]'' &mdash; [[Minas Gerais]] (not prevalent in the [[Triângulo Mineiro]], southern and southeastern [[Minas Gerais]]).@@@@1@16@@danf@17-8-2009
10700580@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som91.html ''Nordestino''] &mdash; [[Northeast Region, Brazil|northeastern states of Brazil]] ([[Pernambuco]] and [[Rio Grande do Norte]] have a particular way of speaking).@@@@1@22@@danf@17-8-2009
10700590@unknown@formal@none@1@S@# ''Nortista'' &mdash; [[Amazon Basin]] states.@@@@1@6@@danf@17-8-2009
10700600@unknown@formal@none@1@S@# ''Paulistano'' &mdash; Variants spoken around [[São Paulo]] city and the eastern areas of São Paulo state.@@@@1@17@@danf@17-8-2009
10700610@unknown@formal@none@1@S@# ''Sertanejo'' &mdash; States of [[Goiás]] and [[Mato Grosso]] (the city of [[Cuiabá]] has a particular way of speaking).@@@@1@19@@danf@17-8-2009
10700620@unknown@formal@none@1@S@# ''Sulista'' &mdash; Variants spoken in the areas between the northern regions of [[Rio Grande do Sul]] and southern regions of São Paulo state.@@@@1@24@@danf@17-8-2009
10700630@unknown@formal@none@1@S@(The cities of [[Curitiba]], [[Florianópolis]], and [[Itapetininga]] have fairly distinct accents as well.)@@@@1@13@@danf@17-8-2009
10700640@unknown@formal@none@1@S@'''[[Portugal]]'''@@@@1@1@@danf@17-8-2009
10700650@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som69.html ''Açoriano''] (Azorean) &mdash; [[Azores]].@@@@1@6@@danf@17-8-2009
10700660@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som40.html ''Alentejano''] &mdash; [[Alentejo]]@@@@1@5@@danf@17-8-2009
10700670@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som44.html ''Algarvio''] &mdash; [[Algarve]] (there is a particular dialect in a small part of western Algarve).@@@@1@17@@danf@17-8-2009
10700680@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som1.html ''Alto-Minhoto''] &mdash; North of [[Braga]] (hinterland).@@@@1@8@@danf@17-8-2009
10700690@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som49.html ''Baixo-Beirão''; ''Alto-Alentejano''] &mdash; Central Portugal (hinterland).@@@@1@8@@danf@17-8-2009
10700700@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som9.html ''Beirão''] &mdash; Central Portugal.@@@@1@6@@danf@17-8-2009
10700710@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som22.html ''Estremenho''] &mdash; Regions of [[Coimbra]] and [[Lisbon]] (the Lisbon dialect has some peculiar features not shared with the one of Coimbra).@@@@1@23@@danf@17-8-2009
10700720@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som60.html ''Madeirense''] (Madeiran) &mdash; [[Madeira]].@@@@1@6@@danf@17-8-2009
10700730@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som14.html ''Nortenho''] &mdash; Regions of Braga and [[Porto]].@@@@1@9@@danf@17-8-2009
10700740@unknown@formal@none@1@S@# [http://www.instituto-camoes.pt/cvc/hlp/geografia/som6.html ''Transmontano''] &mdash; [[Trás-os-Montes e Alto Douro]].@@@@1@8@@danf@17-8-2009
10700750@unknown@formal@none@1@S@Other countries@@@@1@2@@danf@17-8-2009
10700760@unknown@formal@none@1@S@* '''[[Cape Verde]]''' &mdash; [http://www.instituto-camoes.pt/cvc/hlp/geografia/som87.html ''Português cabo-verdiano''] ([[Cape Verdean Portuguese]])@@@@1@10@@danf@17-8-2009
10700770@unknown@formal@none@1@S@* '''[[Daman and Diu]]''', India &mdash; ''Damaense''.@@@@1@7@@danf@17-8-2009
10700780@unknown@formal@none@1@S@* '''[[East Timor]]''' &mdash; [http://www.instituto-camoes.pt/cvc/hlp/geografia/som84.html ''Timorense''] ([[East Timorese Portuguese|East Timorese]])@@@@1@10@@danf@17-8-2009
10700790@unknown@formal@none@1@S@* '''[[Goa]]''', India &mdash; ''Goês''.@@@@1@5@@danf@17-8-2009
10700800@unknown@formal@none@1@S@* '''[[Guinea-Bissau]]''' &mdash; [http://www.instituto-camoes.pt/cvc/hlp/geografia/som88.html ''Guineense''] ([[Guinean Portuguese]]).@@@@1@7@@danf@17-8-2009
10700810@unknown@formal@none@1@S@* '''[[Macau]]''', China &mdash; [http://www.instituto-camoes.pt/cvc/hlp/geografia/som92.html ''Macaense''] ([[Macanese Portuguese|Macanese]])@@@@1@8@@danf@17-8-2009
10700820@unknown@formal@none@1@S@* '''[[Mozambique]]''' &mdash; [http://www.instituto-camoes.pt/cvc/hlp/geografia/som89.html ''Moçambicano''] ([[Mozambican Portuguese|Mozambican]])@@@@1@7@@danf@17-8-2009
10700830@unknown@formal@none@1@S@* '''[[São Tomé and Príncipe]]''' &mdash; [http://www.instituto-camoes.pt/cvc/hlp/geografia/som83.html ''Santomense'']@@@@1@8@@danf@17-8-2009
10700840@unknown@formal@none@1@S@* '''[[Uruguay]]''' &mdash; [[Riverense Portuñol language|''Dialectos Portugueses del Uruguay (DPU)'']].@@@@1@10@@danf@17-8-2009
10700850@unknown@formal@none@1@S@Differences between dialects are mostly of [[accent (linguistics)|accent]] and [[vocabulary]], but between the Brazilian dialects and other dialects, especially in their most coloquial forms, there can also be some grammatical differences.@@@@1@31@@danf@17-8-2009
10700860@unknown@formal@none@1@S@The [[Portuguese creole|Portuguese-based creole]]s spoken in various parts of Africa, Asia, and the Americas are independent languages which should not be confused with Portuguese itself.@@@@1@25@@danf@17-8-2009
10700870@unknown@formal@none@1@S@==History==@@@@1@1@@danf@17-8-2009
10700880@unknown@formal@none@1@S@Arriving in the Iberian Peninsula in 216 BC, the Romans brought with them the [[Latin language]], from which all Romance languages descend.@@@@1@22@@danf@17-8-2009
10700890@unknown@formal@none@1@S@The language was spread by arriving Roman soldiers, settlers and merchants, who built Roman cities mostly near the settlements of previous civilizations.@@@@1@22@@danf@17-8-2009
10700900@unknown@formal@none@1@S@Between AD 409 and 711, as the Roman Empire collapsed in Western Europe, the Iberian Peninsula was conquered by Germanic peoples ([[Migration Period]]).@@@@1@23@@danf@17-8-2009
10700910@unknown@formal@none@1@S@The occupiers, mainly [[Suebi]] and [[Visigoths]], quickly adopted late Roman culture and the [[Vulgar Latin]] dialects of the peninsula.@@@@1@19@@danf@17-8-2009
10700920@unknown@formal@none@1@S@After the [[Moors|Moorish]] invasion of 711, [[Arabic language|Arabic]] became the administrative language in the conquered regions, but most of the population continued to speak a form of [[Romance languages|Romance]] commonly known as [[Mozarabic]].@@@@1@33@@danf@17-8-2009
10700930@unknown@formal@none@1@S@The influence exerted by Arabic on the Romance dialects spoken in the Christian kingdoms of the north was small, affecting mainly their lexicon.@@@@1@23@@danf@17-8-2009
10700940@unknown@formal@none@1@S@The earliest surviving records of a distinctively Portuguese language are administrative documents of the 9th century, still interspersed with many Latin phrases.@@@@1@22@@danf@17-8-2009
10700950@unknown@formal@none@1@S@Today this phase is known as Proto-Portuguese (between the 9th and the 12th centuries).@@@@1@14@@danf@17-8-2009
10700960@unknown@formal@none@1@S@In the first period of Old Portuguese — [[Galician-Portuguese]] Period (from the 12th to the 14th century) — the language gradually came into general use.@@@@1@25@@danf@17-8-2009
10700970@unknown@formal@none@1@S@For some time, it was the language of preference for [[lyric poetry]] in Christian Hispania, much like [[Occitan]] was the language of the [[Occitan literature#Poetry_of_the_troubadours|poetry of the troubadours]].@@@@1@28@@danf@17-8-2009
10700980@unknown@formal@none@1@S@Portugal was formally recognized as an independent kingdom by the [[Kingdom of Leon]] in 1143, with [[Afonso I of Portugal|Afonso Henriques]] as king.@@@@1@23@@danf@17-8-2009
10700990@unknown@formal@none@1@S@In 1290, king [[Denis of Portugal|Dinis]] created the first Portuguese university in Lisbon (the ''Estudos Gerais'', later moved to [[Coimbra]]) and decreed that Portuguese, then simply called the "common language" should be known as the Portuguese language and used officially.@@@@1@40@@danf@17-8-2009
10701000@unknown@formal@none@1@S@In the second period of Old Portuguese, from the 14th to the 16th century, with the [[Age of discovery|Portuguese discoveries]], the language was taken to many regions of [[Asia]], [[Africa]] and the [[Americas]] (nowadays, the great majority of Portuguese speakers live in Brazil, in South America).@@@@1@46@@danf@17-8-2009
10701010@unknown@formal@none@1@S@By the 16th century it had become a ''[[lingua franca]]'' in Asia and Africa, used not only for colonial administration and trade but also for communication between local officials and Europeans of all nationalities.@@@@1@34@@danf@17-8-2009
10701020@unknown@formal@none@1@S@Its spread was helped by mixed marriages between Portuguese and local people, and by its association with [[Roman Catholic]] [[missionary]] efforts, which led to the formation of a [[creole language]] called [[Kristang language|Kristang]] in many parts of Asia (from the word ''cristão'', "Christian").@@@@1@43@@danf@17-8-2009
10701030@unknown@formal@none@1@S@The language continued to be popular in parts of Asia until the 19th century.@@@@1@14@@danf@17-8-2009
10701040@unknown@formal@none@1@S@Some Portuguese-speaking Christian communities in [[India]], [[Sri Lanka]], [[Malaysia]], and [[Indonesia]] preserved their language even after they were isolated from Portugal.@@@@1@21@@danf@17-8-2009
10701050@unknown@formal@none@1@S@The end of the Old Portuguese period was marked by the publication of the ''Cancioneiro Geral'' by [[Garcia de Resende]], in 1516.@@@@1@22@@danf@17-8-2009
10701060@unknown@formal@none@1@S@The early times of Modern Portuguese, which spans from the 16th century to present day, were characterized by an increase in the number of learned words borrowed from Classical Latin and Classical Greek since the Renaissance, which greatly enriched the lexicon.@@@@1@41@@danf@17-8-2009
10701070@unknown@formal@none@1@S@===Characterization===@@@@1@1@@danf@17-8-2009
10701080@unknown@formal@none@1@S@A distinctive feature of Portuguese is that it preserved the stressed vowels of [[Vulgar Latin]], which became diphthongs in other Romance languages; cf. Fr. ''pierre'', Sp. ''piedra'', It. ''pietra'', Port. ''pedra'', from Lat. ''petra''; or Sp. ''fuego'', It. ''fuoco'', Port. ''fogo'', from Lat. ''focum''.@@@@1@44@@danf@17-8-2009
10701090@unknown@formal@none@1@S@Another characteristic of early Portuguese was the loss of [[:wiktionary:intervocalic|intervocalic]] ''l'' and ''n'', sometimes followed by the merger of the two surrounding vowels, or by the insertion of an [[epenthesis|epenthetic vowel]] between them: cf. Lat. ''salire'', ''tenere'', ''catena'', Sp. ''salir'', ''tener'', ''cadena'', Port. ''sair'', ''ter'', ''cadeia''.@@@@1@46@@danf@17-8-2009
10701100@unknown@formal@none@1@S@When the [[elision|elided]] consonant was ''n'', it often [[nasalization|nasalized]] the preceding vowel: cf. Lat. ''manum'', ''rana'', ''bonum'', Port. ''mão'', ''rãa'', ''bõo'' (now ''mão'', ''rã'', ''bom'').@@@@1@25@@danf@17-8-2009
10701110@unknown@formal@none@1@S@This process was the source of most of the nasal diphthongs which are typical of Portuguese.@@@@1@16@@danf@17-8-2009
10701120@unknown@formal@none@1@S@In particular, the Latin endings ''-anem'', ''-anum'' and ''-onem'' became ''-ão'' in most cases, cf. Lat. ''canem'', ''germanum'', ''rationem'' with Modern Port. ''cão'', ''irmão'', ''razão'', and their plurals ''-anes'', ''-anos'', ''-ones'' normally became ''-ães'', ''-ãos'', ''-ões'', cf. ''cães'', ''irmãos'', ''razões''.@@@@1@40@@danf@17-8-2009
10701130@unknown@formal@none@1@S@===Movement to make Portuguese an official language of the UN===@@@@1@10@@danf@17-8-2009
10701140@unknown@formal@none@1@S@There is a growing number of people in the Portuguese speaking media and the internet who are presenting the case to the CPLP and other organizations to run a debate in the [[Lusophone]] community with the purpose of bringing forward a petition to make Portuguese an official language of the United Nations.@@@@1@52@@danf@17-8-2009
10701150@unknown@formal@none@1@S@In October 2005, during the international Convention of the [http://www.elosinternacional.com.br/index.htm Elos Club International ] that took place in Tavira, Portugal a petition was written and unanimously approved whose text can be found on the internet with the title ''Petição Para Tornar Oficial o Idioma Português na ONU''.@@@@1@47@@danf@17-8-2009
10701160@unknown@formal@none@1@S@Romulo Alexandre Soares, president of the Brazil-Portugal Chamber highlights that the positioning of Brazil in the international arena as one of the emergent powers of the 21 century, the size of its population, and the presence of the language around the world provides legitimacy and justifies a petition to the UN to make the Portuguese an official language at the UN.@@@@1@61@@danf@17-8-2009
10701170@unknown@formal@none@1@S@==Vocabulary==@@@@1@1@@danf@17-8-2009
10701180@unknown@formal@none@1@S@Most of the lexicon of Portuguese is derived from Latin.@@@@1@10@@danf@17-8-2009
10701190@unknown@formal@none@1@S@Nevertheless, because of the [[Moors|Moorish]] occupation of the [[Iberian Peninsula]] during the Middle Ages, and the participation of Portugal in the [[Age of Discovery]], it has adopted loanwords from all over the world.@@@@1@33@@danf@17-8-2009
10701200@unknown@formal@none@1@S@Very few Portuguese words can be traced to the [[Pre-Roman peoples of the Iberian Peninsula|pre-Roman inhabitants of Portugal]], which included the [[Gallaeci]], [[Lusitanians]], [[Celtici]] and [[Cynetes]].@@@@1@26@@danf@17-8-2009
10701210@unknown@formal@none@1@S@The [[Phoenicians]] and [[Carthaginians]], briefly present, also left some scarce traces.@@@@1@11@@danf@17-8-2009
10701220@unknown@formal@none@1@S@Some notable examples are ''abóbora'' "pumpkin" and ''bezerro'' "year-old calf", from the nearby [[Celtiberian language]] (probably through the Celtici); ''cerveja'' "beer", from [[Celtic languages|Celtic]]; ''saco'' "bag", from [[Phoenician language|Phoenician]]; and ''cachorro'' "dog, puppy", from [[Basque language|Basque]].@@@@1@36@@danf@17-8-2009
10701230@unknown@formal@none@1@S@In the 5th century, the Iberian Peninsula (the [[Ancient Rome|Roman]] [[Hispania]]) was conquered by the [[Germanic peoples|Germanic]] [[Suevi]] and [[Visigoths]].@@@@1@20@@danf@17-8-2009
10701240@unknown@formal@none@1@S@As they adopted the Roman civilization and language, however, these people contributed only a few words to the lexicon, mostly related to warfare — such as ''espora'' "spur", ''estaca'' "stake", and ''guerra'' "war", from [[Gothic language|Gothic]] ''*spaúra'', ''*stakka'', and ''*wirro'', respectively.@@@@1@41@@danf@17-8-2009
10701250@unknown@formal@none@1@S@Between the 9th and 15th centuries Portuguese acquired about 1000 words from [[Arabic language|Arabic]] by influence of [[al-Andalus|Moorish Iberia]].@@@@1@19@@danf@17-8-2009
10701260@unknown@formal@none@1@S@They are often recognizable by the initial Arabic article ''a''(''l'')''-'', and include many common words such as ''aldeia'' "village" from الضيعة ''aldaya'', ''alface'' "lettuce" from الخس ''alkhass'', ''armazém'' "warehouse" from المخزن ''almahazan'', and ''azeite'' "olive oil" from زيت ''azzait''.@@@@1@39@@danf@17-8-2009
10701270@unknown@formal@none@1@S@From Arabic came also the grammatically peculiar word [[Insha'Allah|''oxalá'']] "hopefully".@@@@1@10@@danf@17-8-2009
10701280@unknown@formal@none@1@S@The Mozambican currency name [[Mozambican Metical|''metical'']] was derived from the word مطقال ''miṭqāl'', a unit of weight.@@@@1@17@@danf@17-8-2009
10701290@unknown@formal@none@1@S@The word Mozambique itself is from the Arabic name of sultan Muça Alebique (Musa Alibiki).@@@@1@15@@danf@17-8-2009
10701300@unknown@formal@none@1@S@The name of the Portuguese town of [[Fátima, Portugal|Fátima]] comes from the name of one of the daughters of the prophet [[Muhammad]].@@@@1@22@@danf@17-8-2009
10701310@unknown@formal@none@1@S@Starting in the 15th century, the Portuguese maritime explorations led to the introduction of many loanwords from [[Asia]]n languages.@@@@1@19@@danf@17-8-2009
10701320@unknown@formal@none@1@S@For instance, ''catana'' "cutlass" from Japanese ''katana''; ''corja'' "rabble" from Malay ''kórchchu''; and ''chá'' "tea" from [[Chinese language|Chinese]] ''[[Tea#The word tea|''chá'']]''.@@@@1@21@@danf@17-8-2009
10701330@unknown@formal@none@1@S@From South America came ''batata'' "[[potato]]", from [[Taino]]; ''ananás'' and ''abacaxi'', from [[Tupi-Guarani]] ''naná'' and [[Tupi language|Tupi]] ''ibá cati'', respectively (two species of [[pineapple]]), and ''tucano'' "[[toucan]]" from [[Guarani language|Guarani]] ''tucan''.@@@@1@31@@danf@17-8-2009
10701340@unknown@formal@none@1@S@See [[List of Brazil state name etymologies]], for some more examples.@@@@1@11@@danf@17-8-2009
10701350@unknown@formal@none@1@S@From the 16th to the 19th century, the role of Portugal as intermediary in the [[Atlantic slave trade]], with the establishment of large Portuguese colonies in Angola, Mozambique, and Brazil, Portuguese got several words of African and [[indigenous peoples of Brazil|Amerind]] origin, especially names for most of the animals and plants found in those territories.@@@@1@55@@danf@17-8-2009
10701360@unknown@formal@none@1@S@While those terms are mostly used in the former colonies, many became current in European Portuguese as well.@@@@1@18@@danf@17-8-2009
10701370@unknown@formal@none@1@S@From [[Kimbundu language|Kimbundu]], for example, came ''kifumate'' → ''cafuné'' "head caress", ''kusula'' → ''caçula'' "youngest child", ''marimbondo'' "tropical wasp", and ''kubungula'' → ''bungular'' "to dance like a wizard".@@@@1@28@@danf@17-8-2009
10701380@unknown@formal@none@1@S@Finally, it has received a steady influx of loanwords from other European languages.@@@@1@13@@danf@17-8-2009
10701390@unknown@formal@none@1@S@For example, ''melena'' "hair lock", ''fiambre'' "wet-cured ham" (in contrast with ''presunto'' "dry-cured ham" from Latin ''prae-exsuctus'' "dehydrated"), and ''castelhano'' "Castilian", from Spanish; ''colchete''/''crochê'' "bracket"/"crochet", ''paletó'' "jacket", ''batom'' "lipstick", and ''filé''/''filete'' "steak"/"slice" respectively, from French ''crochet'', ''paletot'', ''bâton'', ''filet''; ''macarrão'' "pasta", ''piloto'' "pilot", ''carroça'' "carriage", and ''barraca'' "barrack", from Italian ''maccherone'', ''pilota'', ''carrozza'', ''baracca''; and ''bife'' "steak", ''futebol'', ''revólver'', ''estoque'', ''folclore'', from English ''beef'', ''football'', ''revolver'', ''stock'', ''folklore''.@@@@1@68@@danf@17-8-2009
10701400@unknown@formal@none@1@S@==Classification and related languages==@@@@1@4@@danf@17-8-2009
10701410@unknown@formal@none@1@S@Portuguese belongs to the [[West Iberian languages|West Iberian]] branch of the [[Romance language]]s, and it has special ties with the following members of this group:@@@@1@25@@danf@17-8-2009
10701420@unknown@formal@none@1@S@* [[Galician language|Galician]] and the [[Fala language|Fala]], its closest relatives.@@@@1@10@@danf@17-8-2009
10701430@unknown@formal@none@1@S@See below.@@@@1@2@@danf@17-8-2009
10701440@unknown@formal@none@1@S@* [[Spanish language|Spanish]], the major language closest to Portuguese.@@@@1@9@@danf@17-8-2009
10701450@unknown@formal@none@1@S@(See also [[Differences between Spanish and Portuguese]].)@@@@1@7@@danf@17-8-2009
10701460@unknown@formal@none@1@S@* [[Mirandese language|Mirandese]], another West Iberian language spoken in Portugal.@@@@1@10@@danf@17-8-2009
10701470@unknown@formal@none@1@S@* [[Judeo-Portuguese]] and [[Ladino language|Judeo-Spanish]], languages spoken by [[Sephardic Jew]]s, which remained close to Portuguese and Spanish.@@@@1@17@@danf@17-8-2009
10701480@unknown@formal@none@1@S@Despite the obvious lexical and grammatical similarities between Portuguese and other Romance languages, it is not [[mutually intelligible]] with most of them.@@@@1@22@@danf@17-8-2009
10701490@unknown@formal@none@1@S@Apart from Galician, Portuguese speakers will usually need some formal study of basic grammar and vocabulary, before attaining a reasonable level of comprehension of those languages, and vice-versa.@@@@1@28@@danf@17-8-2009
10701500@unknown@formal@none@1@S@===Galician and the Fala===@@@@1@4@@danf@17-8-2009
10701510@unknown@formal@none@1@S@The closest language to Portuguese is Galician, spoken in the autonomous community of Galicia (northwestern Spain).@@@@1@16@@danf@17-8-2009
10701520@unknown@formal@none@1@S@The two were at one time a single language, known today as [[Galician-Portuguese]], but since the political separation of Portugal from Galicia they have diverged somewhat, especially in pronunciation and vocabulary.@@@@1@31@@danf@17-8-2009
10701530@unknown@formal@none@1@S@Nevertheless, the core vocabulary and grammar of Galician are still noticeably closer to Portuguese than to Spanish.@@@@1@17@@danf@17-8-2009
10701540@unknown@formal@none@1@S@In particular, like Portuguese, it uses the future subjunctive, the personal infinitive, and the synthetic pluperfect (see the section on the grammar of Portuguese, below).@@@@1@25@@danf@17-8-2009
10701550@unknown@formal@none@1@S@Mutual intelligibility (estimated at 85% by R. A. Hall, Jr., 1989) is good between Galicians and northern Portuguese, but poorer between Galicians and speakers from central Portugal.@@@@1@27@@danf@17-8-2009
10701560@unknown@formal@none@1@S@The Fala language is another descendant of Galician-Portuguese, spoken by a small number of people in the Spanish towns of Valverdi du Fresnu, As Ellas and Sa Martín de Trebellu (autonomous community of [[Extremadura]], near the border with Portugal).@@@@1@39@@danf@17-8-2009
10701570@unknown@formal@none@1@S@===Influence on other languages===@@@@1@4@@danf@17-8-2009
10701580@unknown@formal@none@1@S@Many languages have [[loanword|borrowed words]] from Portuguese, such as [[Bahasa Indonesia|Indonesian]], [[Sri Lanka]]n [[Sri Lanka Tamils (native)|Tamil]] and [[Sinhalese language|Sinhalese]] (see [[Sri Lanka Indo-Portuguese language|Sri Lanka Indo-Portuguese]]), [[Malay language|Malay]], [[Bengali language|Bengali]], [[English (language)|English]], [[Hindi]], [[Konkani language|Konkani]], [[Marathi language|Marathi]], [[Tetum language|Tetum]], [[Tsonga language|Xitsonga]], [[Papiamentu]], [[Japanese language|Japanese]], [[Barbadian|Bajan Creole]] (Spoken in Barbados), [[Lanc-Patuá]] (spoken in northern Brazil) and [[Sranan Tongo]] (spoken in Suriname).@@@@1@61@@danf@17-8-2009
10701590@unknown@formal@none@1@S@It left a strong influence on the ''[[Old Tupi|língua brasílica]]'', a [[Tupi-Guarani|Tupi-Guarani language]] which was the most widely spoken in [[Brazil]] until the 18th century, and on the language spoken around [[Sikka]] in [[Flores|Flores Island]], [[Indonesia]].@@@@1@36@@danf@17-8-2009
10701600@unknown@formal@none@1@S@In nearby [[Larantuka]], Portuguese is used for prayers in [[Holy Week]] rituals.@@@@1@12@@danf@17-8-2009
10701610@unknown@formal@none@1@S@The Japanese-Portuguese dictionary ''[[Nippo Jisho]]'' (1603) was the first dictionary of Japanese in a European language, a product of [[Society of Jesus|Jesuit]] missionary activity in [[Japan]].@@@@1@26@@danf@17-8-2009
10701620@unknown@formal@none@1@S@Building on the work of earlier Portuguese missionaries, the ''Dictionarium Anamiticum, Lusitanum et Latinum'' (Annamite-Portuguese-Latin dictionary) of [[Alexandre de Rhodes]] (1651) introduced the modern [[Vietnamese alphabet|orthography of Vietnamese]], which is based on the orthography of 17th-century Portuguese.@@@@1@37@@danf@17-8-2009
10701630@unknown@formal@none@1@S@The [[Romanization]] of [[Chinese language|Chinese]] was also influenced by the Portuguese language (among others), particularly regarding [[List of common Chinese surnames|Chinese surnames]]; one example is ''Mei''.@@@@1@26@@danf@17-8-2009
10701640@unknown@formal@none@1@S@See also [[List of English words of Portuguese origin]], [[Loan words in Indonesian]], [[Japanese words of Portuguese origin]], [[Malay_language#Borrowed_words|Borrowed words in Malay]], [[Sinhala words of Portuguese origin]], [[Loan words in Sri Lankan Tamil#Portuguese|Loan words from Portuguese in Sri Lankan Tamil]].@@@@1@40@@danf@17-8-2009
10701650@unknown@formal@none@1@S@===Derived languages===@@@@1@2@@danf@17-8-2009
10701660@unknown@formal@none@1@S@Beginning in the 16th century, the extensive contacts between Portuguese travelers and settlers, African slaves, and local populations led to the appearance of many [[pidgin]]s with varying amounts of Portuguese influence.@@@@1@31@@danf@17-8-2009
10701670@unknown@formal@none@1@S@As these pidgins became the mother tongue of succeeding generations, they evolved into fully fledged [[creole language]]s, which remained in use in many parts of Asia and Africa until the 18th century.@@@@1@32@@danf@17-8-2009
10701680@unknown@formal@none@1@S@Some Portuguese-based or Portuguese-influenced creoles are still spoken today, by over 3 million people worldwide, especially people of partial [[Portuguese people|Portuguese]] ancestry.@@@@1@22@@danf@17-8-2009
10701690@unknown@formal@none@1@S@== Phonology ==@@@@1@3@@danf@17-8-2009
10701700@unknown@formal@none@1@S@There is a maximum of 9 oral vowels and 19 consonants, though some varieties of the language have fewer phonemes (Brazilian Portuguese has only 8 oral vowel [[phone]]s).@@@@1@28@@danf@17-8-2009
10701710@unknown@formal@none@1@S@There are also five nasal vowels, which some linguists regard as allophones of the oral vowels, ten oral [[diphthong]]s, and five nasal diphthongs.@@@@1@23@@danf@17-8-2009
10701720@unknown@formal@none@1@S@===Vowels===@@@@1@1@@danf@17-8-2009
10701730@unknown@formal@none@1@S@To the seven vowels of [[Vulgar Latin]], European Portuguese has added two [[Mid-centralized vowel|near central vowels]], one of which tends to be [[elision|elided]] in [[relaxed pronunciation|rapid speech]], like the ''e caduc'' of [[French language|French]] (represented either as {{IPA|/ɯ̽/}}, or {{IPA|/ɨ/}}, or {{IPA|/ə/}}).@@@@1@42@@danf@17-8-2009
10701740@unknown@formal@none@1@S@The high vowels {{IPA|/e o/}} and the low vowels {{IPA|/ɛ ɔ/}} are four distinct phonemes, and they alternate in various forms of [[apophony]].@@@@1@23@@danf@17-8-2009
10701750@unknown@formal@none@1@S@Like [[Catalan language|Catalan]], Portuguese uses vowel quality to contrast stressed syllables with unstressed syllables: isolated vowels tend to be [[Vowel#Height|raised]], and in some cases centralized, when unstressed.@@@@1@27@@danf@17-8-2009
10701760@unknown@formal@none@1@S@Nasal diphthongs occur mostly at the end of words.@@@@1@9@@danf@17-8-2009
10701770@unknown@formal@none@1@S@===Consonants===@@@@1@1@@danf@17-8-2009
10701780@unknown@formal@none@1@S@The consonant inventory of Portuguese is fairly conservative.@@@@1@8@@danf@17-8-2009
10701790@unknown@formal@none@1@S@The medieval affricates {{IPA|/ts/}}, {{IPA|/dz/}}, {{IPA|/tʃ/}}, {{IPA|/dʒ/}} merged with the fricatives {{IPA|/s/}}, {{IPA|/z/}}, {{IPA|/ʃ/}}, {{IPA|/ʒ/}}, respectively, but not with each other, and there were no other significant changes to the consonant phonemes since then.@@@@1@34@@danf@17-8-2009
10701800@unknown@formal@none@1@S@However, some remarkable dialectal variants and [[allophone]]s have appeared, among which:@@@@1@11@@danf@17-8-2009
10701810@unknown@formal@none@1@S@*In many regions of Brazil, {{IPA|/t/}} and {{IPA|/d/}} have the affricate allophones {{IPA|[tʃ]}} and {{IPA|[dʒ]}}, respectively, before {{IPA|/i/}} and {{IPA|/ĩ/}}.@@@@1@20@@danf@17-8-2009
10701820@unknown@formal@none@1@S@([[Quebec French]] has a similar phenomenon, with alveolar affricates instead of postalveolars.@@@@1@12@@danf@17-8-2009
10701830@unknown@formal@none@1@S@[[Japanese language|Japanese]] is another example).@@@@1@5@@danf@17-8-2009
10701840@unknown@formal@none@1@S@*At the end of a syllable, the phoneme {{IPA|/l/}} has the allophone {{IPA|[u̯]}} in Brazilian Portuguese (''[[L-vocalization#L-vocalization|L-vocalization]]'').@@@@1@17@@danf@17-8-2009
10701850@unknown@formal@none@1@S@*In many parts of Brazil and Angola, intervocalic {{IPA|/ɲ/}} is pronounced as a [[nasalization|nasalized]] [[palatal approximant]] {{IPA|[j̃]}} which nasalizes the preceding vowel, so that for instance {{IPA|/ˈniɲu/}} is pronounced {{IPA|[ˈnĩj̃u]}}.@@@@1@30@@danf@17-8-2009
10701860@unknown@formal@none@1@S@*In most of Brazil, the alveolar sibilants {{IPA|/s/}} and {{IPA|/z/}} occur in complementary distribution at the end of syllables, depending on whether the consonant that follows is voiceless or voiced, as in English.@@@@1@33@@danf@17-8-2009
10701870@unknown@formal@none@1@S@But in most of Portugal and parts of Brazil sibilants are postalveolar at the end of syllables, {{IPA|/ʃ/}} before voiceless consonants, and {{IPA|/ʒ/}} before voiced consonants (in [[Ladino language|Judeo-Spanish]], {{IPA|/s/}} is often replaced with {{IPA|/ʃ/}} at the end of syllables, too).@@@@1@41@@danf@17-8-2009
10701880@unknown@formal@none@1@S@*There is considerable dialectal variation in the value of the [[Rhotic consonant|rhotic]] phoneme {{IPA|/ʁ/}}.@@@@1@14@@danf@17-8-2009
10701890@unknown@formal@none@1@S@See [[Guttural R#Portuguese|Guttural R in Portuguese]], for details.@@@@1@8@@danf@17-8-2009
10701900@unknown@formal@none@1@S@==Grammar==@@@@1@1@@danf@17-8-2009
10701910@unknown@formal@none@1@S@A particularly interesting aspect of the grammar of Portuguese is the verb.@@@@1@12@@danf@17-8-2009
10701920@unknown@formal@none@1@S@Morphologically, more verbal inflections from classical Latin have been preserved by Portuguese than any other major Romance language.@@@@1@18@@danf@17-8-2009
10701930@unknown@formal@none@1@S@See [[Romance copula#Morphological comparison|Romance copula]], for a detailed comparison.@@@@1@9@@danf@17-8-2009
10701940@unknown@formal@none@1@S@It has also some innovations not found in other Romance languages (except Galician and the Fala):@@@@1@16@@danf@17-8-2009
10701950@unknown@formal@none@1@S@* The [[present perfect tense]] has an iterative sense unique among the Romance languages.@@@@1@14@@danf@17-8-2009
10701960@unknown@formal@none@1@S@It denotes an action or a series of actions which began in the past and are expected to keep repeating in the future.@@@@1@23@@danf@17-8-2009
10701970@unknown@formal@none@1@S@For instance, the sentence ''Tenho tentado falar com ela'' would be translated to "I have been trying to talk to her", not "I have tried to talk to her".@@@@1@29@@danf@17-8-2009
10701980@unknown@formal@none@1@S@On the other hand, the correct translation of the question "Have you heard the latest news?" is not ''*Tem ouvido a última notícia?'', but ''Ouviu a última notícia?'', since no repetition is implied.@@@@1@33@@danf@17-8-2009
10701990@unknown@formal@none@1@S@* The future [[Subjunctive mood|subjunctive]] tense, which was developed by medieval [[West Iberian languages|West Iberian Romance]], but has now fallen into disuse in Spanish, is still used in [[vernacular]] Portuguese.@@@@1@30@@danf@17-8-2009
10702000@unknown@formal@none@1@S@It appears in dependent clauses that denote a condition which must be fulfilled in the future, so that the independent clause will occur.@@@@1@23@@danf@17-8-2009
10702010@unknown@formal@none@1@S@Other languages normally employ the present tense under the same circumstances:@@@@1@11@@danf@17-8-2009
10702020@unknown@formal@none@1@S@:''Se ''for'' eleito presidente, mudarei a lei.''@@@@1@7@@danf@17-8-2009
10702030@unknown@formal@none@1@S@:If ''I am'' elected president, I will change the law.@@@@1@10@@danf@17-8-2009
10702040@unknown@formal@none@1@S@:''Quando ''fores'' mais velho, vais entender.''@@@@1@6@@danf@17-8-2009
10702050@unknown@formal@none@1@S@:When ''you are'' older, you will understand.@@@@1@7@@danf@17-8-2009
10702060@unknown@formal@none@1@S@* The personal [[infinitive]]: infinitives can [[inflection|inflect]] according to their subject in [[Grammatical person|person]] and [[Grammatical number|number]], often showing who is expected to perform a certain action; cf. ''É melhor voltares'' "It is better [for you] to go back," ''É melhor voltarmos'' "It is better [for us] to go back."@@@@1@50@@danf@17-8-2009
10702070@unknown@formal@none@1@S@Perhaps for this reason, infinitive clauses replace subjunctive clauses more often in Portuguese than in other Romance languages.@@@@1@18@@danf@17-8-2009
10702080@unknown@formal@none@1@S@==Writing system==@@@@1@2@@danf@17-8-2009
10702090@unknown@formal@none@1@S@Portuguese is written with the [[Latin alphabet]], making use of five [[diacritic]]s to denote stress, vowel height, contraction, nasalization, and other sound changes (acute accent, grave accent, circumflex accent, tilde, and cedilla).@@@@1@32@@danf@17-8-2009
10702100@unknown@formal@none@1@S@[[Brazilian Portuguese]] also uses the diaeresis mark.@@@@1@7@@danf@17-8-2009
10702110@unknown@formal@none@1@S@Accented characters and digraphs are not counted as separate letters for [[collation]] purposes.@@@@1@13@@danf@17-8-2009
10702120@unknown@formal@none@1@S@===Brazilian vs. European spelling===@@@@1@4@@danf@17-8-2009
10702130@unknown@formal@none@1@S@There are some minor differences between the orthographies of Brazil and other Portuguese language countries.@@@@1@15@@danf@17-8-2009
10702140@unknown@formal@none@1@S@One of the most pervasive is the use of acute accents in the European/African/Asian orthography in many words such as ''sinónimo'', where the Brazilian orthography has a circumflex accent, ''sinônimo''.@@@@1@30@@danf@17-8-2009
10702150@unknown@formal@none@1@S@Another important difference is that Brazilian spelling often lacks ''c'' or ''p'' before ''c'', ''ç'', or ''t'', where the European orthography has them; for example, cf. Brazilian ''fato'' with European ''facto'', "fact", or Brazilian ''objeto'' with European ''objecto'', "object".@@@@1@39@@danf@17-8-2009
10702160@unknown@formal@none@1@S@Some of these spelling differences reflect differences in the pronunciation of the words, but others are merely graphic.@@@@1@18@@danf@17-8-2009
10702170@unknown@formal@none@1@S@==Examples==@@@@1@1@@danf@17-8-2009
10702180@unknown@formal@none@1@S@;Excerpt from the Portuguese [[national epic]] ''[[Os Lusíadas]]'', by author [[Luís de Camões]] (I, 33)@@@@1@15@@danf@17-8-2009