Professional Documents
Culture Documents
Abstract—Sharing documents across different platforms is highly 2. Uploads the file and the metadata to the server
necessary nowadays. However, many devices, as embedded ones,
cannot deal with formats like PDF or PowerPoint, despite their 3. Performs all necessary tasks to convert the
widespread use. In this paper we propose an automated format document to the desired format
conversion system to allow the visualization of formats that are 4. Carries out database updating to allow users to
not supported in simple multimedia embedded devices. access the document in the output format
Format conversion, multimedia, embedded devices.
A. PDF conversion
I. INTRODUCTION We use the ImageMagick tool [4]. The -density parameter
defines the resolution at which the document is rendered, with
Some documents we handle in everyday life can only be a default value of 72 dpi (100%). At this density, a typical A4
viewed in advanced electronic devices such as desktop page maps to a 595×842 pixel image.
computers or laptops. This is the case of PDF documents or
slideshows. In some scenarios, the possibility of playing those Note that -density is not equivalent to a simple bitmap
contents on a set-top-box or any other screen-enabled rescaling (-geometry), since it exploits the vector properties of
embedded device would be of great interest. PDF documents.
We present an automated conversion system from PDF and Although 72 dpi may be enough for some documents, in
PowerPoint to portable graphics, to allow the visualization of other cases a higher resolution must be chosen, for instance if
those commonly used formats on simple embedded multimedia the document includes tiny fonts that would be otherwise
devices like video players, digital photo frames or IP set-top illegible. The resulting image usually exceeds screen
boxes. These devices usually include a plain image viewer or a dimensions, and scrolling is necessary. This is irrelevant, since
very limited web browser. Our conversion system is also of the orientation of PDF pages is usually vertical.
interest for institutional and educational IP TV environments We automatically obtain an image for every page in the
with centralized multimedia servers. document. For example, the ImageMagick command line
Regarding the final image format, JPEG is suitable when convert -density 144 foundations.pdf foun-%d.png
the original document includes true color images or generates images foun-0.png, foun-1.png... To extract a
photographs. However, we recommend PNG when possible, single specific page from the document, it is possible to run
convert -density 144 foundations.pdf [0] cover.png.
since it does not incur compression losses.
There exist other tools, mainly desktop applications, to B. PowerPoint conversion
convert PDF and PPT files to images [1-3,7-9]. But they have For this MS Office proprietary format we need a reliable
not been conceived as part of an automated framework, and free rendering tool, which should be adequate for an automated
they not follow a submit-and-forget remote server-based framework. A specially designed OpenOffice [5] macro
approach. We do not impose any computational requirements performs image conversion from PPT and PPS documents.
for document conversion or playback at the client side.
Since the process that invokes the macro is a PHP server
II. SYSTEM DESCRIPTION daemon, it is necessary to create a virtual video buffer before
launching OpenOffice. A bash script deals with this
Our document conversion system is a Linux-based tool that intermediate step and then calls OpenOffice with proper
is part of a video on-demand educational platform. To upload a arguments. The basic script is the following:
document to the system there is a PHP web script that performs
the following operations: #!/bin/sh
pid=`ps -eaf | grep Xvfb | grep :20`
1. Presents a template to enter all the document- if [ ! $pid ]; then
related info xauth add :20 . `/usr/bin/mcookie` > dev/null
III. CONCLUSIONS
We present an automated conversion system to adapt
document formats to simple embedded devices. Unlike
previous approaches, it does not impose any requirements for
content conversion or playback at the client side. As future
improvements, we plan to support new special formats, such
as Macromedia Flash. Another interesting feature would be
inserting delays between slides, and even exporting some
audiovisual effects to enrich slideshows.