Page 1 of 1

Apple iWork support in search in Office documents

Posted: 2013-08-30, 15:53 UTC
by Lefteous
There is a new option in search to search for text in Office documents. It would be great if Apple iWork documents could be found as well.

Technical information:
Typical filename extensions: .key, .numbers, .pages
Compression: Standard ZIP
XML document: A single file called "index.apxl" in the ZIP file root folder.

Posted: 2013-08-30, 15:55 UTC
by ghisler(Author)
Sounds like normal style packed .xml, so it shouldn't be hard to add. Can you send me a sample of each (.key, .numbers, .pages) with a short German text with Umlauts to support at ghisler dot com, please?

Posted: 2013-09-03, 15:44 UTC
by Lefteous
Thanks for adding this feature in beta 2 ;-)

I found out that TC seems to handle the XML file as plain text. The XML structure isn't considered. Is this a bug or a known limitation?

Example: There are three files where one contains the word 'Bratwurst'.
When I search for 'Brat' this will find all files. Why? All these XML files contain an attribute value 'sfa:calibrated-rgb-color-type'.

In the case of iWork files the text is always content (text inside of elements).

Posted: 2013-09-03, 20:13 UTC
by ghisler(Author)
Unfortunately parsing the xml is currently far beyond what the function can do. I rely on the fact that in normal texts, text attributes like bold do not change in the middle of a word. You will need to use a special "text search" plugin to handle such cases.