Install catdoc by entering the following commands in the terminal:
sudo apt update sudo apt install catdoc
Description:
text extractor for MS-Office files
The catdoc program reads one or more Microsoft Word files and outputs their contents to standard output as text. . It is accompanied by xls2csv, a program which converts Excel spreadsheets into comma-separated-values format, and catppt, a utility to extract textual information from PowerPoint files. . It doesn't try to preserve Word formatting; its goal is to extract plain text and allow you to read it (and, probably, reformat it with TeX). . This package suggests tk because it also includes wordview, an optional Tk-based GUI for catdoc. The MIME config provided in this package will use wordview if X is running, or catdoc directly if it is not.
Homepage: http://www.wagner.pp.ru/~vitus/software/catdoc/
Version: 1:0.95-4.1
Section: universe/text