Tech

  • KDE Plasma 5.21 preview

    Ever since I first starting using Linux seriously 16 years ago, KDE has always been my preferred desktop environment.

    I’ve dabbled with GNOME and other desktops, but always come back to KDE for usability, customisability and good looks.

    Now known as Plasma, the KDE desktop will soon be reaching the release of its 5.21 version.

    Below is a sneak preview of what can be expected when the new version is released, together with a tribute at the end to Gustavo Carneiro. a KDE developer who unfortunately passed away from Covid-19 in January and whose work concentrated on KDE’s Konsole terminal emulator.

    This 5.21 release of Plasma is dedicated to Gustavo’s memory.

  • TDF is 9 years old

    TDF logoToday The Document Foundation (TDF), the organisation behind the free and open source LibreOffice productivity suite, celebrates its 9th birthday.

    On 17th February 2012 The Document Foundation was registered in Berlin as a German charitable foundation (Stiftung).

    TDF had been created by members of the OpenOffice.org community to manage and develop LibreOffice, partially out of fears that Oracle Corporation would cease development of OpenOffice.org after its takeover of Sun Microsystems, the custodians of the OpenOffice.org project. The original OpenOffice.org project is now curated by the Apache Software Foundation.

    The TDF’s goal is to produce a vendor-independent office suite with ODF support in a development environment free from control by an individual company.

    This goal has been achieved too. LibreOffice is now on release version 7.1, is included as the standard office suite in many GNU/Linux distributions and been downloaded millions of times.

    Many happy returns, TDF!

  • How to make a speech bubble in GIMP

    GIMP, also known as the GNU Image Manipulation Program, is a great free and open source graphics creation and editing suite suite, which comes as part of the standard software installation package for many GNU/Linux distributions, although it is also available for other operating systems.

    My main use for GIMP is for dealing with graphics containing text in files submitted for translation, as well as tweaking digital photographs.

    Nevertheless, every now and again the urge strikes me to learn a bit more to make the most of the software installed on my systems.

    If, like me, you have ever wondered how to add a speech bubble to an image using GIMP, help is at hand in the form of the handy video tutorial below.

    Now you too know how to put words into other people’s mouths . 😀

  • The S key, German special characters and a Linux keyboard

    Occasionally in recent weeks, this blog has provided information on keyboard shortcuts for unusual characters (unusual for English that is. Ed.) on a Linux keyboard.

    The last of these took the umlaut (diaresis) as its subject (posts passim).

    German road sign for Schloßstraße in Erfurt
    German road sign for Schloßstraße in Erfurt

    Today, attention turns once again to German and the s key, which can produce two characters, depending upon the combination of keystrokes.

    Depressing the AltGr key and s produces “ß“, the German sharp s or esszett, usually transcribed in English as ss.

    The other character that can be produced is “§“, which can be produced with the AltGr, Shift and s keys.

    Known as the Section sign, it is believed to originate from the Latin signum sectionis, meaning section sign and usually turns up in with reference to legal documents.

    Where more than one section of a legal text is involved, the sign is repeated, i.e §§.

  • Focus on OCR

    The way a completed translation has been produced has changed markedly over the decades since my first days as a translator for Imperial Tobacco in Bedminster, Bristol.

    In those days I’d write out the translation in longhand from printed source material and take my manuscript to the typing pool where it would be transformed into typescript.

    The next big change came with my learning how to touch-type. By this time I was a freelance with no more access to a typing pool.

    In my early freelance days, it was rare to get editable copy that one could overkey with one’s usual word processor, spreadsheet or presentation package. The standard way of working was still from hard copy propped up in a copyholder alongside one’s keyboard.

    Then there came a large surge in the use of formats such as PDF – Portable Document Format. This format enables documents, including text formatting and images, to be presented in a manner independent of application software, hardware and operating systems.

    If the PDF was text-based, one could simply export the text as plain ASCII text or copy and paste it into a word processor.

    However, if I had an image-based PDF to work with, my usual answer was to print it out as hard copy to be propped up in a copyholder alongside my keyboard. This was very expensive in terms of paper and other consumables for the printer, even with a machine as parsimonious as my trusty mono laser printer, whose cartridge was good for printing 3,000 or so pages of copy.

    In addition to the expense of printing, there was a far greater drawback to bear in mind, i.e. one could easily miss a sentence or paragraph from the original text when keying in the translated from a hard copy original, with the consequent implications for the quality of the finished work and the client’s satisfaction with it.

    Then I discovered OCR – Optical Character Recognition – the mechanical or electronic conversion of images of typed, handwritten or printed text into machine-encoded text.

    Here’s a short video explaining the basics of OCR.

    My preferred OCR package is gImageReader and – as with the software I recommend for use by translators – is open source and available for both Linux and Windows.

    Gimagereader in action on Italian language PDF
    gImageReader in action on Italian language PDF

    gImageReader provides a simple graphical front-end to the tesseract OCR engine. The features of gImageReader include:

    • Importing PDF documents and images from disk, scanning devices, clipboard and screenshots;
    • Process multiple images and documents in one go;
    • Manual or automatic recognition area definition;
    • Recognising to plain text or to hOCR documents;
    • Recognized text displayed directly next to the image;
    • Post-processing of the recognised text, including spellchecking;
    • Generating PDF documents from hOCR documents.

    I generally just stick scanning the input file to plain text, which can then be fed into a regular office suite for translation. If your office suite can handle HTML that’s the format gImageReader outputs as its hOCR output.

    The tesseract OCR engine mentioned above can also be enhanced with language packs for post-recognition spellchecking, as mentioned in the features above. At present, tesseract can recognise over 100 different languages.

    In addition to GUI-based OCR, there are also Linux packages available which can perform OCR via the command line interface.

    My tool of choice here is OCRmyPDF.

    ocrmypdf in action in KDE Konsole terminal
    ocrmypdf being used in KDE’s Konsole terminal to add OCR layer to Spanish language PDF

    OCRmyPDF is a package written in Python 3 that adds OCR layers to PDFs and, like gImageReader, also uses the tesseract OCR engine.

    Using OCRmyPDF on the command line is simplicity itself (as shown in the screenshot above:

    ocrmypdf -l [language option] inputfile.pdf outputfile.pdf

    More complicated command options are possible, but after using that simple string above, you’ll be able to extract the text from your formerly image-based PDF ready for translation.

    By way of conclusion depending on the software itself, OCR packages can also extract text from images such as .jpg files.

  • Easy umlauts on a Linux keyboard

    Some weeks ago, I blogged about the keyboard shortcut for guillemets – French quotation marks – on a Linux keyboard (posts passim).

    My attention in this post is on the German umlaut, also known as diaresis (or in French as a trema. Ed.) the two dots placed over a vowel modifying its pronunciation.

    Once again, one could always use the character map to insert a specific vowel with an umlaut.

    KCharselect with an upper case A umlaut selected
    KCharselect with an upper case A umlaut selected

    However, the keyboard shortcut is much quicker.

    To produce the letter a with an umlaut – “ä“, follow these steps.

    Depress AltGr key and the left-hand square bracket “[” followed by “a“.

    The AltGr and left-hand bracket symbol plus the vowel of your choice will give you that character plus an umlaut.

    For the upper case version, I find the easiest way to avoid knotting your fingers is to turn on the CapsLock key before the AltGr key and the left-hand square bracket “[” plus vowel sequence.

  • LibreOffice 6.2 released

    Yesterday The Document Foundation (TDF) announced the release of LibreOffice 6.2, a significant major release of the free and open source office suite which features a radical new approach to the user interface – based on the MUFFIN concept – and provides user experience options to meet all users’ preferences.

    LibreOffice 6.2 banner

    The NotebookBar is available in Tabbed, Grouped and Contextual versions. Each one has a different approach to the menu layout and complements the traditional Toolbars and Sidebar. The Tabbed variant aims to provide a familiar interface for users coming from suites such as MS Office and is supposed to be used primarily without the sidebar, while the Grouped one allows to access “first-level” functions with one click and “second-level” functions with a maximum of two clicks.

    The design community has also made substantial changes and improvements to icon themes, in particular Elementary and Karasa Jaga.

    LibreOffice 6.2 new and improved features

    • The help system offers faster filtering of index keywords, highlighting search terms as they are typed and displaying results based on the selected module.
    • Context menus have been tidied up, to be more consistent across the different components in the suite.
    • Change tracking performances have been dramatically improved, especially in large documents.
    • In Writer, it is now possible to copy spreadsheet data into tables instead of just inserting them as objects.
    • In Calc it is now possible to do multivariate regression analysis using the regression tool. In addition, many more statistical measures are now available in the analysis output and the new REGEX function has been added, to match text against a regular expression and optionally replace it.
    • In Impress and Draw the motion path of animations can now be modified by dragging its control points. In addition, a couple of text-related drawing styles have been added, as well as a Format Table submenu in Draw.
    • LibreOffice Online, the cloud-based version of the suite, includes many improvements too. On mobile devices, the user interface has been simplified, with better responsiveness and updates to the on-screen keyboard.

    As with every major and minor release of LibreOffice, interoperability with proprietary file formats has also been improved for better compatibility with Office documents, including old versions which have been dropped by Microsoft. The focus has been on charts, animations and document security features. To assist with interoperability, LibreOffice 6.2 is built with document conversion libraries from the Document Liberation Project.

    LibreOffice 6.2’s new features have been developed by a large community of contributors: 74% of commits are from developers employed by companies on the TDF’s the Advisory Board, such as Collabora, Red Hat and CIB and by other contributors such as the City of Munich. Individual volunteers account for 26% of commits.

    In addition, there is a global community of individual volunteers taking care of quality assurance, software localization, user interface design and user experience, editing the help pages and documentation.

    LibreOffice 6.1.5 for commercial deployments

    The Document Foundation has also released LibreOffice 6.1.5, a more mature version which includes some months of back-ported fixes and is better suited for commercial deployments, where features are less important as individual productivity is the main objective.

    Companies wishing to deploy LibreOffice are advised to seek assistance for such matters as software support, migrations and training from qualified professionals.

    Download LibreOffice 6.2 or LibreOffice 6.1.5

    LibreOffice 6.2 and LibreOffice 6.1.5 are now available for immediate download. Builds of the latest LibreOffice Online source code are also available, released as Docker images.

    LibreOffice Online is fundamentally a server service and should be installed and configured by adding cloud storage and an SSL certificate. It might be considered an enabling technology for the cloud services offered by ISPs or the private cloud of enterprises and large organisations.

    LibreOffice users, free software advocates and community members are encouraged to support The Document Foundation with a donation.

  • Guillemets (French quotation marks) on a Linux keyboard

    My first experiences of computing took place before the widespread use of graphical user interfaces (GUIs).

    Consequently, I use a lot of keyboard* shortcuts.

    These can also be used to create individual characters and, if known, represent an alternative such as using a visual character map, such as KCharSelect, the character map on the KDE desktop environment on my Linux machines.

    KCharselect with the left-hand guillemet selected
    KCharselect with the left-hand guillemet selected

    So what’s the keyboard shortcut alternative for French quotation marks?

    On Linux, most special characters can be inserted into a text editor or office package using the AltGr key plus one or two other keystrokes. If you have the patience to learn them, they can save a lot of time.

    For the left guillemet, AltGr+z produces «.

    For the right guillemet, AltGr+x produces ».

    As you can see, it’s a lot quicker than using a GUI-based alternative.

    * = I’ve always used a standard EN-GB keyboard layout.

Posts navigation