tech

  • LibreOffice 7.0 beginner’s guide launched

    Cover of LibreOffice 7.0 Getting Started GuideThe Document Foundation’s blog announced last week that the LibreOffice Documentation Team had released its LibreOffice 7.0 Getting Started Guide. The Guide, which was previously issued for LibreOffice version 6.4, has been updated to include all the new and improved features of LibreOffice 7.0, the latest version of LibreOffice, the free and open source alternative to proprietary office suites.

    The guide has been drafted especially for those wanting to get up to speed quickly with LibreOffice, whether they are new users of office productivity software or already have some familiarity with other office suites, such as Microsoft’s ubiquitous and expensive offering.

    The guide provides an introduction the LibreOffice’s 6 major components, i.e.:

    • Writer (word processing)
    • Calc (spreadsheets)
    • Impress (presentations)
    • Draw (vector graphics)
    • Base (database)
    • Math (equation editor)

    Furthermore, it also covers some of the features common to all those components – set-up and customisation, styles and templates, macro recording, digital signing and printing.

    The guide can be downloaded (PDF format) from LibreOffice’s English Documentation site., which also includes links to documentation in other languages, as well as user guides for earlier LibreOffice releases.

  • The S key, German special characters and a Linux keyboard

    Occasionally in recent weeks, this blog has provided information on keyboard shortcuts for unusual characters (unusual for English that is. Ed.) on a Linux keyboard.

    The last of these took the umlaut (diaresis) as its subject (posts passim).

    German road sign for Schloßstraße in Erfurt
    German road sign for Schloßstraße in Erfurt

    Today, attention turns once again to German and the s key, which can produce two characters, depending upon the combination of keystrokes.

    Depressing the AltGr key and s produces “ß“, the German sharp s or esszett, usually transcribed in English as ss.

    The other character that can be produced is “§“, which can be produced with the AltGr, Shift and s keys.

    Known as the Section sign, it is believed to originate from the Latin signum sectionis, meaning section sign and usually turns up in with reference to legal documents.

    Where more than one section of a legal text is involved, the sign is repeated, i.e §§.

  • Focus on OCR

    The way a completed translation has been produced has changed markedly over the decades since my first days as a translator for Imperial Tobacco in Bedminster, Bristol.

    In those days I’d write out the translation in longhand from printed source material and take my manuscript to the typing pool where it would be transformed into typescript.

    The next big change came with my learning how to touch-type. By this time I was a freelance with no more access to a typing pool.

    In my early freelance days, it was rare to get editable copy that one could overkey with one’s usual word processor, spreadsheet or presentation package. The standard way of working was still from hard copy propped up in a copyholder alongside one’s keyboard.

    Then there came a large surge in the use of formats such as PDF – Portable Document Format. This format enables documents, including text formatting and images, to be presented in a manner independent of application software, hardware and operating systems.

    If the PDF was text-based, one could simply export the text as plain ASCII text or copy and paste it into a word processor.

    However, if I had an image-based PDF to work with, my usual answer was to print it out as hard copy to be propped up in a copyholder alongside my keyboard. This was very expensive in terms of paper and other consumables for the printer, even with a machine as parsimonious as my trusty mono laser printer, whose cartridge was good for printing 3,000 or so pages of copy.

    In addition to the expense of printing, there was a far greater drawback to bear in mind, i.e. one could easily miss a sentence or paragraph from the original text when keying in the translated from a hard copy original, with the consequent implications for the quality of the finished work and the client’s satisfaction with it.

    Then I discovered OCR – Optical Character Recognition – the mechanical or electronic conversion of images of typed, handwritten or printed text into machine-encoded text.

    Here’s a short video explaining the basics of OCR.

    My preferred OCR package is gImageReader and – as with the software I recommend for use by translators – is open source and available for both Linux and Windows.

    Gimagereader in action on Italian language PDF
    gImageReader in action on Italian language PDF

    gImageReader provides a simple graphical front-end to the tesseract OCR engine. The features of gImageReader include:

    • Importing PDF documents and images from disk, scanning devices, clipboard and screenshots;
    • Process multiple images and documents in one go;
    • Manual or automatic recognition area definition;
    • Recognising to plain text or to hOCR documents;
    • Recognized text displayed directly next to the image;
    • Post-processing of the recognised text, including spellchecking;
    • Generating PDF documents from hOCR documents.

    I generally just stick scanning the input file to plain text, which can then be fed into a regular office suite for translation. If your office suite can handle HTML that’s the format gImageReader outputs as its hOCR output.

    The tesseract OCR engine mentioned above can also be enhanced with language packs for post-recognition spellchecking, as mentioned in the features above. At present, tesseract can recognise over 100 different languages.

    In addition to GUI-based OCR, there are also Linux packages available which can perform OCR via the command line interface.

    My tool of choice here is OCRmyPDF.

    ocrmypdf in action in KDE Konsole terminal
    ocrmypdf being used in KDE’s Konsole terminal to add OCR layer to Spanish language PDF

    OCRmyPDF is a package written in Python 3 that adds OCR layers to PDFs and, like gImageReader, also uses the tesseract OCR engine.

    Using OCRmyPDF on the command line is simplicity itself (as shown in the screenshot above:

    ocrmypdf -l [language option] inputfile.pdf outputfile.pdf

    More complicated command options are possible, but after using that simple string above, you’ll be able to extract the text from your formerly image-based PDF ready for translation.

    By way of conclusion depending on the software itself, OCR packages can also extract text from images such as .jpg files.

  • Easy umlauts on a Linux keyboard

    Some weeks ago, I blogged about the keyboard shortcut for guillemets – French quotation marks – on a Linux keyboard (posts passim).

    My attention in this post is on the German umlaut, also known as diaresis (or in French as a trema. Ed.) the two dots placed over a vowel modifying its pronunciation.

    Once again, one could always use the character map to insert a specific vowel with an umlaut.

    KCharselect with an upper case A umlaut selected
    KCharselect with an upper case A umlaut selected

    However, the keyboard shortcut is much quicker.

    To produce the letter a with an umlaut – “ä“, follow these steps.

    Depress AltGr key and the left-hand square bracket “[” followed by “a“.

    The AltGr and left-hand bracket symbol plus the vowel of your choice will give you that character plus an umlaut.

    For the upper case version, I find the easiest way to avoid knotting your fingers is to turn on the CapsLock key before the AltGr key and the left-hand square bracket “[” plus vowel sequence.

  • LibreOffice 6.2 released

    Yesterday The Document Foundation (TDF) announced the release of LibreOffice 6.2, a significant major release of the free and open source office suite which features a radical new approach to the user interface – based on the MUFFIN concept – and provides user experience options to meet all users’ preferences.

    LibreOffice 6.2 banner

    The NotebookBar is available in Tabbed, Grouped and Contextual versions. Each one has a different approach to the menu layout and complements the traditional Toolbars and Sidebar. The Tabbed variant aims to provide a familiar interface for users coming from suites such as MS Office and is supposed to be used primarily without the sidebar, while the Grouped one allows to access “first-level” functions with one click and “second-level” functions with a maximum of two clicks.

    The design community has also made substantial changes and improvements to icon themes, in particular Elementary and Karasa Jaga.

    LibreOffice 6.2 new and improved features

    • The help system offers faster filtering of index keywords, highlighting search terms as they are typed and displaying results based on the selected module.
    • Context menus have been tidied up, to be more consistent across the different components in the suite.
    • Change tracking performances have been dramatically improved, especially in large documents.
    • In Writer, it is now possible to copy spreadsheet data into tables instead of just inserting them as objects.
    • In Calc it is now possible to do multivariate regression analysis using the regression tool. In addition, many more statistical measures are now available in the analysis output and the new REGEX function has been added, to match text against a regular expression and optionally replace it.
    • In Impress and Draw the motion path of animations can now be modified by dragging its control points. In addition, a couple of text-related drawing styles have been added, as well as a Format Table submenu in Draw.
    • LibreOffice Online, the cloud-based version of the suite, includes many improvements too. On mobile devices, the user interface has been simplified, with better responsiveness and updates to the on-screen keyboard.

    As with every major and minor release of LibreOffice, interoperability with proprietary file formats has also been improved for better compatibility with Office documents, including old versions which have been dropped by Microsoft. The focus has been on charts, animations and document security features. To assist with interoperability, LibreOffice 6.2 is built with document conversion libraries from the Document Liberation Project.

    LibreOffice 6.2’s new features have been developed by a large community of contributors: 74% of commits are from developers employed by companies on the TDF’s the Advisory Board, such as Collabora, Red Hat and CIB and by other contributors such as the City of Munich. Individual volunteers account for 26% of commits.

    In addition, there is a global community of individual volunteers taking care of quality assurance, software localization, user interface design and user experience, editing the help pages and documentation.

    LibreOffice 6.1.5 for commercial deployments

    The Document Foundation has also released LibreOffice 6.1.5, a more mature version which includes some months of back-ported fixes and is better suited for commercial deployments, where features are less important as individual productivity is the main objective.

    Companies wishing to deploy LibreOffice are advised to seek assistance for such matters as software support, migrations and training from qualified professionals.

    Download LibreOffice 6.2 or LibreOffice 6.1.5

    LibreOffice 6.2 and LibreOffice 6.1.5 are now available for immediate download. Builds of the latest LibreOffice Online source code are also available, released as Docker images.

    LibreOffice Online is fundamentally a server service and should be installed and configured by adding cloud storage and an SSL certificate. It might be considered an enabling technology for the cloud services offered by ISPs or the private cloud of enterprises and large organisations.

    LibreOffice users, free software advocates and community members are encouraged to support The Document Foundation with a donation.

  • Guillemets (French quotation marks) on a Linux keyboard

    My first experiences of computing took place before the widespread use of graphical user interfaces (GUIs).

    Consequently, I use a lot of keyboard* shortcuts.

    These can also be used to create individual characters and, if known, represent an alternative such as using a visual character map, such as KCharSelect, the character map on the KDE desktop environment on my Linux machines.

    KCharselect with the left-hand guillemet selected
    KCharselect with the left-hand guillemet selected

    So what’s the keyboard shortcut alternative for French quotation marks?

    On Linux, most special characters can be inserted into a text editor or office package using the AltGr key plus one or two other keystrokes. If you have the patience to learn them, they can save a lot of time.

    For the left guillemet, AltGr+z produces «.

    For the right guillemet, AltGr+x produces ».

    As you can see, it’s a lot quicker than using a GUI-based alternative.

    * = I’ve always used a standard EN-GB keyboard layout.

  • Public money, public code – new FSFE brochure

    Let’s start with a trio of questions.

    1. Why should governments develop free software*?
    2. Where is free software already generating benefits in the public sector?
    3. What are free software business models?

    Answers to the above questions and practical guidelines are given in the new expert policy brochure published today by the Free Software Foundation Europe.

    Entitled “Public Money Public Code – Modernising Public Infrastructure with Free Software“, the brochure aims to answer decision-takers’ questions about the benefits of using and developing free software for public sector organisations.

    To help understand the important role that public sector procurement plays in this, the brochure presents an overview of EU free software projects and policies, uncovering legislation on software procurement.

    The FSFE will use this brochure in the forthcoming European Parliament elections to inform potential MEPs how to speed up the distribution and development of free software in the public sector and putting appropriate legislation in place.

    Download the brochure (PDF).

    Front cover of FSFE brochure

    The brochure evaluates the modernisation of public infrastructure by using free software from the perspectives of academia, law, business and government. Expert articles, reports and interviews help readers to understand the opportunities for free software in the public sector. Practical guidance is provided for decision-makers to move forward and start modernising public infrastructure with free software.

    FSFE President Matthias Kirschner states: “Free software licences have proven to generate tremendous benefits for the public sector. This is not a trend that will pass, but rather a long-term development that is based on very positive experiences and strategic considerations resulting from serious vendor lock-in cases in the past. In a few years, free software licences could become the default setting for publicly-financed IT projects. The Free Software Foundation Europe watches these developments very carefully and we want to contribute our knowledge to support the public sector in this transition.”

    Initial steps for making free software licenses the default in publicly-financed IT projects are outlined in the brochure. Other topics include competition and potential vendor lock-in, security, democracy, “smart cities” and other important contemporary topics. The language and examples used have been specifically chosen for readers interested in politics and public administrations.

    The brochure features leading experts from various ICT areas. Amongst others, these include Francesca Bria, Chief of Technology and Digital Innovation Officer (CTIO) for the Barcelona City Council, Prof. Dr. Simon Schlauri, author of a detailed legal analysis on the benefits of free software for the Swiss canton of Bern, Cedric Thomas, CEO of OW2, Matthias Stürmer, head of the Research Center for Digital Sustainability at the University of Bern and Basanta Thapa from the Competence Center for Public IT (ÖFIT) within the Fraunhofer Institute for Open Communication Systems.

    The brochure is available in both digital and printed formats, and is published under a CC BY-SA 4.0 licence and I’ll be sending a copy of the PDF to my MP for her information.

    * = In this context the definition of free software is free as in freedom, not beer.

  • French IT website praises LibreOffice 6

    Yesterday French IT website Le Monde Informatique posted a review of the free and open source LibreOffice 6 office suite.

    LibreOffice 6 splash screen

    Author Michael Ansaldo speaks warmly of the office suite your ‘umble scribe has been using since its inception in 2010, following the mass departure of OpenOffice.org developers from Sun Microsystems following its takeover by Oracle.

    Translated into English, Ansaldo’s final paragraph reads as follows:

    In summary, amongst the notable features of LibreOffice 6, we note its excellent compatibility with the [Microsoft] Office formats, as well as an interface that will not disorientate the aficionados of Microsoft’s office suite. Nevertheless, some features are lacking, such as integrated cloud storage or even joint real-time editing. Anyway, LibreOffice 6 is still the best choice for open source fans and all those wanting compatibility with Office without buying Microsoft Office. Its availability for multiple platforms and its frequent updates also make it a clear choice for individuals and businesses.

Posts navigation