Re: what is the best way of storing text+image documents in postgresql - Mailing list pgsql-general

From John R Pierce
Subject Re: what is the best way of storing text+image documents in postgresql
Date
Msg-id 4DEFCF8F.1030900@hogranch.com
Whole thread Raw
In response to Re: what is the best way of storing text+image documents in postgresql  (Craig Ringer <craig@postnewspapers.com.au>)
Responses Re: what is the best way of storing text+image documents in postgresql
List pgsql-general
On 06/08/11 6:06 AM, Craig Ringer wrote:
>> 1. save .doc documents in bytea columns. and show them with a word
>> reader in web page (disadvantage: it needs a proper .doc reader
>> installed on user computer)
>
> 1a: Convert the .doc files to a standard format like PDF that most
> browsers can display. That's what I'd do.

thats harder to integrate with a website in the sense that the PDF
documents are hard page formatted, and can at best be displayed in an
<iframe> within your site, and half the time, only displayed in an
external PDF viewer since browser-pdf integration remains flakey and
bugridden after all these years.   PDF text won't flow to fit your page
layout, etc etc.

one approach to conversion might be to save the documents as an RTF type
format, and run that through a preprocessor that reencodes them as a
clean HTML or similar metalanguage that you can deal with intelligently.
    MS Word's own HTML converter creates wretched HTML with tons of
extra bizarro-world tags which likely will trip up your page formatting
if you display these in context in your pages.

as Craig suggested, images can be stored as bytea objects, and the image
links converted to point to a CGI that fetches them from the database
and presents them to the client browser.

--
john r pierce                            N 37, W 122
santa cruz ca                         mid-left coast


pgsql-general by date:

Previous
From: Mike Christensen
Date:
Subject: Converting uuid primary key column to serial int
Next
From: "David Johnston"
Date:
Subject: Re: Converting uuid primary key column to serial int