Thread: BLOBs

BLOBs

From
Thomas Swan
Date:
I know that BLOBs are on the TODO list, but I had an idea.

I think the storage of a BLOB outside of the table is an elegant 
solution and keeps table sizes down without the bloat of the stored 
object.   Granted, if you are searching with a regular expression or 
using like or ilike clauses, you're likely to be a little slower but it 
shouldn't be by much.  More than likely, you won't be searching for 
patterns in the BLOB but rather the fields in the table associated with 
the BLOB.

Wouldn't it be wonderful if you used the methods you had already 
implemented and instead create a behavoir similar to the following.

on an insert   take the data that was to be the blob...   create your externally "to be referenced" file   save the
datato the file   store the reference to that file
 

on an update   take the data that was to be the blob...   create your externally "to be referenced" file   save the
datato the file   store the reference to that file   delete the old referenced file
 

on a delete   delete the reference to your file   delete the external file

I was thinking that the BLOB column type might be a trigger for a macro 
that could handle the lo_import, lo_export juggling...

I know it seems overly simplified, but having fought with MySQL and then 
trying to wrestle with postgresql and importing,exporting BLOBs, it 
seemed there might be a little more room for discussion, although I 
doubt this may have added anything to it...

I'd love to see something done with BLOB support during 7.2.x *hint* :)

Besides, if someone could give me some pointers as to where I might be 
able to start, I might try to contribute something myself.

Thomas




Re: BLOBs

From
Tom Lane
Date:
Thomas Swan <tswan@olemiss.edu> writes:
> I know that BLOBs are on the TODO list, but I had an idea.

I think you just rediscovered TOAST.
        regards, tom lane


Re: BLOBs

From
Bruce Momjian
Date:
> Thomas Swan <tswan@olemiss.edu> writes:
> > I know that BLOBs are on the TODO list, but I had an idea.
> 
> I think you just rediscovered TOAST.

We have TOAST and people want to keep large objects for performance.  I
think we could us an API that allows TOAST binary access and large
object access using the same API, and hopefully an improved one.

--  Bruce Momjian                        |  http://candle.pha.pa.us pgman@candle.pha.pa.us               |  (610)
853-3000+  If your life is a hard drive,     |  830 Blythe Avenue +  Christ can be your backup.        |  Drexel Hill,
Pennsylvania19026
 


Re: BLOBs

From
Thomas Swan
Date:
Bruce Momjian wrote:<br /><blockquote cite="mid:200106110315.f5B3FgT24983@candle.pha.pa.us" type="cite"><blockquote
type="cite"><prewrap="">Thomas Swan <a class="moz-txt-link-rfc2396E"
href="mailto:tswan@olemiss.edu"><tswan@olemiss.edu></a>writes:<br /></pre><blockquote type="cite"><pre wrap="">I
knowthat BLOBs are on the TODO list, but I had an idea.<br /></pre></blockquote><pre wrap="">I think you just
rediscoveredTOAST.<br /></pre></blockquote><pre wrap=""><br />We have TOAST and people want to keep large objects for
performance. I<br />think we could us an API that allows TOAST binary access and large<br />object access using the
sameAPI, and hopefully an improved one.</pre></blockquote> I think I missed what I was trying to say in my original
statement. I think there's a way to use the existing API with performance benefits left intact.<br /><br /> Take for
examplethe table :<br /><tt>create table foo {</tt><tt><br /></tt><tt>    foo_id serial,</tt><tt><br /></tt><tt>   
foo_namevarchar(32),</tt><tt><br /></tt><tt>    foo_object BLOB,</tt><tt><br /></tt><tt>);</tt><tt><br /></tt><br /> On
theinsert statement <tt>"insert into foo (foo_name,foo_object) values ('My Object','{some escaped arbitrary string of
binarydata}');"</tt>, flush the {some escaped arbitrary string of binary data} to disk as a temporary file.  Then do
thelo_import operation transparent to the user.<br /><br /> On a select, do the same thing (transparently) and return
thedata back to user.<br /><br /> Personally, I like LO's being stored separately from the actual table.<br /><br /><br
/><br/> 

Re: BLOBs

From
Tom Lane
Date:
Thomas Swan <tswan@olemiss.edu> writes:
> I think I missed what I was trying to say in my original statement.  I 
> think there's a way to use the existing API with performance benefits 
> left intact.

> Take for example the table :
> create table foo {
>     foo_id serial,
>     foo_name varchar(32),
>     foo_object BLOB,
> );

> On the insert statement "insert into foo (foo_name,foo_object) values 
> ('My Object','{some escaped arbitrary string of binary data}');", flush 
> the {some escaped arbitrary string of binary data} to disk as a 
> temporary file.  Then do the lo_import operation transparent to the user.

> On a select, do the same thing (transparently) and return the data back 
> to user.

> Personally, I like LO's being stored separately from the actual table.

I still think you've rediscovered TOAST.  How is this better than (or
even significantly different from) foo_object being a toastable bytea
column?
        regards, tom lane