tech

  • Introducing the Document Liberation Project

    DLP logoThe Document Foundation blog today announced the birth of the Document Liberation Project, a home for the growing community of developers united to free users from vendor lock-in of contents. Together, these hackers will offer a solution to the routine problem faced by many computer users who have their personal digital contents stored in an old, outdated and inaccessible file format.

    A routine problem encountered by computer users today is the discovery of personal digital content created years ago and stored in old, outdated file formats. These old files often cannot be opened by any application on the user’s current operating system. The users are quite simply locked out of their own content. The most common reason for this inability to access old data is the use of proprietary file formats that result in vendor lock-in.

    The Document Liberation Project has been created in the hope that it would enable individuals, organisations and governments to recover their data from proprietary formats and provide a mechanism to convert that data into open file formats, returning effective control of the content from software companies to the actual authors.

    The Document Liberation Project believes:

    • documents and their content belong to their creators, not software vendors;
    • that access to content you own should not be hindered by the fact that the application that created it is not maintained any more or that the application does not work on the particular operating system that you use;
    • that use of truly open and free standards for encoding digital content is the only long-term guarantee that a user’s digital content will never be beholden to a single vendor;
    • that implementation of free and open source software that can read proprietary file formats is the best solution to escape vendor lock-in during the transition period to truly open and free standards.

    Its mission is as follows:

    • to try to understand the structure and details of undocumented proprietary file formats;
    • to use that understanding of the file formats to implement libraries that are able to parse such documents and extract as much information as possible from them;
    • to use our existing framework to encode this data in a truly free and open standard file format: Open Document Format.

    Developers have so far provided read support for proprietary file formats including MS Visio, CorelDraw, MS Publisher, Apple Keynote and a handful of different old Macintosh formats. In addition to LibreOffice, import libraries for these file formats are used by Abiword, Calligra, CorelDRAW File Viewer, Inkscape and Scribus.

  • Call for proposals to improve major LibreOffice/OpenOffice features

    The Office Interoperability Working Group of the Open Source Business Alliance has called for proposals to improve major features in LibreOffice and OpenOffice suites.

    OSB logoAs announced in September 2013, the OSB Alliance’s Office Interoperability Working Group has held a requirements engineering workshop in Stuttgart. OSB Alliance members Munich city council, Leipzig city council, Jena city council, the Swiss Federal Court, and the Swiss Federal IT Steering Unit FITSU defined commonly required improvements for the open source LibreOffice and OpenOffice suites. Other public authorities will providing supporting funding.

    A new specification entitled “Major Feature Improvements for LibreOffice/Apache OpenOffice” (PDF) has been developed recently on the basis of the Stuttgart workshop. It proposes six major improvements, as follows:

    1. Improve mail merge in Writer;
    2. Improve paragraph handling in Writer;
    3. Implement styles in all content elements of Writer;
    4. Add chart styles in Calc;
    5. Make more functions available in shared spreadsheets in Calc;
    6. Develop a change tracking specification for the ODF standard.

    ODF file iconOne important feature of today’s office suites is change tracking within documents: both LibreOffice and OpenOffice offer change tracking in ODF files. However, Microsoft Office has not implemented change tracking for ODF, stating that the current change tracking specification within the ODF standard is insufficiently defined. The last of the above requirements therefore covers the exact specification of change tracking within the ODF standard to enable Microsoft to implement it in future versions of Microsoft Office.

    Open source providers are now invited to submit offers to cover one or more of these requirements. The detailed requirements, general conditions and tendering procedure are described in the specification document. All proposals need to be submitted until 30 April 2014 to the working group’s spokesman, Dr. Matthias Stürmer (email: stuermer (at) osb-alliance.com).

  • A lost Bristol street re-emerges

    Avon Archaeology is currently conducting a dig on a site at the junction of Wade Street and Little Ann Street in St Judes that is going to be redeveloped for housing; it was most recently used as a secure car park.

    Yesterday I managed to get a couple of pictures through the fencing around the site.

    image of archaelogical dig in St Judes

    The red brickwork in the centre foreground is the remains of a collapsed vault, suggesting there was a cellar beneath the building.

    image of archaeological dig in St Judes

    The cobbled and accompanying paved footways are one of Bristol’s lost streets seeing daylight again.

    The street itself was known as Pratten’s Court and can be seen on the following screenshot from the excellent Know Your Place website (posts passim) showing the 1880 Ordnance Survey map layer.

    Screenshot showing Pratten's Court on 1880s OS map

    The housing around Pratten’s Court was originally developed in the 18th century and demolished some time in the first half of the 20th century. It does not show up on the 1946 aerial photographs layer on Know Your Place.

    Avon Archaeological Unit carried out an assessment of the Wade Street area in 2000 which concluded as follows:

    An archaeological desk-based assessment of sites on the north and south sides of the junction of Wade Street and Little Ann Street was carried out by Andrew Smith for the Avon Archaeological Unit in April 2000. The likely survival of palaeo-environmental evidence for the formation of the floodplain of the river Frome, for Romano-British activity and for the development of the area as artisanal housing in the early-eighteenth century was noted.

    Further down Wade Street crosses the River Frome. Somewhere in this area a Roman Road, the Via Julia, which went from London to South Wales via Portus Abonae (now better known as Sea Mills. Ed.) crossed the Frome. In 1865 2 Roman lead pigs were discovered near the river. This find was reported in Part 23 of the Archaeological Journal in 1866. Know Your Place records this find as follows:

    In 1865, during commercial excavations in Wade Street possibly associated with the construction of a stone revetment wall for the river Frome, two lead ingots of Roman date (one weighing 76 pounds and the other 89 pounds) were found. Both carried inscriptions with identical damage, which was taken to suggest that the ingots had been cast from the same mould. The inscription read “IMP’ CAES’ A[NTON]INI’ AUG’ PII P’ P”. One (89 pounds) passed into the possession of a Mr. Edkins and the other was taken to Sheldon, Bush shot works on Redcliff Hill. Mr. Arthur Bush subsequently donated this ingot to the British Museum (Anon. 1866). Elkington (in Branigan & Fowler, 1976 195) implies that the ingots were almost certainly produced by the Mendip lead-mining industry and points out that flaws on the Wade Street ingots establish that they were cast in the same mould as two of the four ingots found at Rookery Farm, Green Ore, near Wells in 1956 (Anon. 1957, 230-231). However, sampling of the ingot held by Bristol Museum and Art Gallery in 2001 by Vincent Gardiner as part of his postgraduate research into the technology and distribution of Romano-British lead pigs found that the isotopes present suggested an origin in the Bristol/Frome/Weston-super-Mare area.

  • Today is DFD 2014

    logo for Document Freedom Day 2014Today is Document Freedom Day (DFD) 2014. DFD is an annual celebration of and opportunity to promote the use of open formats and standards for digital documents and takes place on the last Wednesday in March each year.

    Document freedom means documents that are free can be used in any way that the author intends. They can be read, transmitted, edited, and transformed using a variety of tools.

    Open standards are formats which everybody can use free of charge and restriction. They come with compatibility “built-in” – the way they work is shared publicly and any organisation or person can use them in their products and services without asking for permission. Open Standards are the foundation of co-operation and modern society.

    However, don’t just take my word for it.

    Below are some testimonials for open standards and document freedom from people with a bit more influence than your ‘umble scribe.

    Neelie Kroes, Vice-President, European Commission

    I know a smart business decision when I see one – choosing open standards is a very smart business decision indeed.

    Stephen Fry, actor, screenwriter, author, playwright, journalist, poet, comedian, television presenter and film director

    Open standards make sense. What makes no sense is that large companies in the field still do not understand this. It is time once and for all to end the pointless nonsense of one document sent on one platform being incomprehensible to the user of another.

    Chris DiBona, Open Source Manager, Google

    Over time, files that have been saved in closed formats tend to be less and less accessible to their creators. We prefer people to use modern and truly open formats like ODF whenever possible to ensure that they can continue to access and enjoy their work today and into the far future.

    Happy DFD 2014!

  • Canary Islands government to adopt OpenOffice

    The autonomous government of Spain’s Canary Islands has announced in a press release (Spanish) that the Directorate General of Telecommunications and New Technologies has proposed that the free and open source OpenOffice 4.0 office suite be adopted by the government of the islands as its corporate office productivity software.

    screenshot of OpenOffice splash screen

    At the same time it also announced a standard for web site content management systems to be preferred by all Canary Islands government departments. It decided on “Portal web Tipo”, a package built in-house as part of the islands’ Platino e-government services platform. Platino and its components are being made available as open source to other Spanish public sector organisations via the CTT (Centro de Transferencia de Tecnología – Technology Transfer Centre) software repository.

  • Debian Installer Jessie Alpha 1 release

    Debian logoThe first alpha of the installation media for Debian 8 (codenamed Jessie) GNU/Linux comes with the lightweight Xfce desktop as standard. The reloading of firmware is not working in this initial version.

    The Debian installer team has released an initial alpha for Debian 8 (Jessie). The standard images for testing the Jessie installation are supplied with the Xfce desktop as standard. However, it is currently uncertain whether this will be retain for the actual Debian 8 release as the developers want to discuss the standard desktop once more in August. If necessary, the decision taken then will be considered once again, which is possible since the main development phase doesn’t end until 5th November; this “freeze” is typically followed by a stabilisation phase lasting several months before the distribution is finally released.

    However, some of the features introduced with the alpha might not be altered any further. Thus there is no alpha version for Itanium (IA-64) processors because the Debian Project will not be supporting this processor architecture in future. In addition, the IBM S390 architecture has been replaced with the S390x architecture.

    The AMD64 edition of the first alpha of Jessie takes up three DVDs and uses a kernel which is based on Linux 3.13. Amongst this releases known problems is a bug that missing firmware files cannot be reloaded.

    I’m already running Jessie on one of my machines, but did an upgrade on an existing machine, rather than a fresh install, and am finding it very reliable and stable. Read about my experience.

  • South Tyrol region to save €1 mn. with free software

    Südtirol coat of armsThe government of Italy’s Autonomous Province of South Tyrol wants to save &euro 1 mn. per year by using free software, according to a press release issued last week.

    Public sector organisations took the first step towards the use of free software nearly one year ago with the change to the LibreOffice productivity suite (news passim). On 11th March 2014 the regional government decided to continue to pursue this route and to resort to the use of open source where possible. “We are expecting savings of one million Euro per year through free software,” declares governor Arno Kompatscher.

    “The use of proprietary or free software has in the meantime degenerated into almost a religious war, not only in the public sector, but also in private businesses,” Kompatscher continued, speaking after a regional government meeting.

    During the preceding legislature period the region and regional government had already made a decision in principle to opt for the use of free software. In June last year the first major step towards free software was made with the change from MS Office to LibreOffice. The regional government alone migrated 7,000 to the open source office suite.

    Governor Kompatscher stressed that it was not a matter of using free software exclusively, but to find the best solution as regards citizens: “We’re standing by using free software. However, it’s not a matter of deciding between free and proprietary software, but between requirements.” Free software, according to Kompatscher, is not always suitable, but: “Citizens must always have access to public sector documents without having to resort to paid-for software as well. That is the key issue”.

    The city of Munich is acting as an example for the use of free software in government. “For example, Munich’s city council is using free software; in spite of this ten per cent of its computers are still running proprietary programs. We’re aligning ourselves with this. There will be no either or; the principal objective is friendliness towards citizens,” Kompatscher emphasises.

    The governor also refers to the potential savings arising from free software: “A very, very large amount of money is involved. The target is savings of one million euro per year.” Just from its first major step, switching to LibreOffice should save the regional government paying Microsoft some €600,000 in licence fees in the next few years.

  • Bloggers under attack as pingback abused

    WordPress logoWordPress’ pingback function can be abused to mount a denial of service (DoS) attack on blogs without their owners noticing, Germany’s Heise IT website reports. It is unlikely that the problem will be remedied with an update.

    Security company Sucuri reports on an attack on one WordPress installation in which more than 162,000 other WordPress sites were misused as a DDoS platform. In this instance the attacker used the software’s pingback function in order to cripple the target website. With a pingback, one WordPress site can notify another that it has quoted its blog post.

    The attack works in such a manner that an attacker searches for a legitimate blog which has pingback activated (currently the default configuration for new WordPress installations) and then simulate a pingback from the victim’s site. The victim’s blog then queries the victim for the post which was quoted in the faked pingback. If the attacker does this with many sites, the flood of traffic is difficult for the target site to black as the queries look completely legitimate and originate from trustworthy sources.

    In the attack observed by Sucuri randomly generated URLs were quoted in the fake pingbacks for the victim’s ostensible posts. This results in WordPress’ caching mechanism does not take effect and the web server is so overloaded since the database must attempt to deliver the supposed posts for each request. Of course, in reality the server just serves up 404 error pages, since there are no pages for the random URLs. Nevertheless, if there are many such requests, this is sufficient to cripple the WordPress installation’s database. On Unix and Linux systems such an attack can be launched very simply by using the curl command on the command line.

    As the attack is misusing the ordinary working of the pingback function, it cannot be assumed that WordPress developers are going to do something about the problem. Site owners can prevent their blog being miused in this way by deactivating their installation’s pingback functions. Sucuri itself is proposing source code for a WordPress plug-in which should block the attacks, as follows:

    add_filter( ‘xmlrpc_methods’, function( $methods ) {
    unset( $methods[‘pingback.ping’] );
    return $methods;
    } );

    Use Sicuri’s WordPress DDoS Scanner to check if your site is being used for launching such attacks on other websites.

  • OmegaT, the basics

    OmegaT is a free and open source translation memory application written in Java. It’s a tool intended for professional translators.

    OmegaT has the following features:

    • Fuzzy matching
    • Match propagation
    • Simultaneous processing of multiple-file projects
    • Simultaneous use of multiple translation memories
    • User glossaries with recognition of inflected forms
    • Document file formats include:
      Open Document Format (the native format of the LibreOffice, OpenOffice and Calligra office suites)
      Microsoft Word, Excel, Powerpoint (.docx, .xlsx, .pptx)
      XHTML and HTML
      MediaWiki (Wikipedia)
      Plain text
      …plus about 30 other file formats
    • Unicode (UTF-8) support: can be used with non-Latin alphabets
    • Support for right-to-left languages
    • Integral spelling checker
    • Compatible with other translation memory applications (TMX, TTX, TXML, XLIFF, SDLXLIFF)

    OmegaT is cross-platform: it will run on any system on which the JRE (Java Runtime Environment) has been or can be installed.

    Over on YouTube, user weasel75 has produced a short (10 minutes) tutorial on the basics of OmegaT. Hopefully you’ll find it as useful as I did.

Posts navigation