RE: Does anyone have experience using inso filters for all kind of documents for Oracle text

  • From: Feighery Raymond <Raymond.Feighery@xxxxxxxxxxxxx>
  • To: "'oracle-l@xxxxxxxxxxxxx'" <oracle-l@xxxxxxxxxxxxx>
  • Date: Wed, 19 May 2004 15:20:57 +0100

Not quite sure what you are asking. The inso filter can index documents such
as word and html.

Ray

Simple example (Oracle 9.2.0.4 on Solaris 8, woods.doc is a word document
containing a poem where the word 'woods' appears but 'digital' does not.)
------------------

drop table hdocs;
drop index hdoxsx;

create table hdocs (
     id number primary key,
     fmt varchar2(10),
     text varchar2(80)
);
insert into hdocs values(1, 'binary', './woods.doc');
insert into hdocs values (2, 'text', './index.html');

create index hdocsx on hdocs(text) indextype is ctxsys.context
  parameters ('datastore ctxsys.file_datastore
  filter ctxsys.inso_filter
  format column fmt');

------------------

SQL> select     id,
  2  NVL(SCORE(1),0) scoring
  3  from       hdocs
  4  where      contains(text,'digital',1)>0
  5  /

no rows selected

SQL> select     id,
  2  NVL(SCORE(1),0) scoring
  3  from       hdocs
  4* where      contains(text,'woods',1)>0;

        ID    SCORING
---------- ----------
         1         20
-----Original Message-----
From: Juan Cachito Reyes Pacheco [mailto:jreyes@xxxxxxxxxxxxxxxx]
Sent: Wednesday, May 19, 2004 2:13 PM
To: oracle-l@xxxxxxxxxxxxx
Subject: Does anyone have experience using inso filters for all kind of
documents for Oracle text 


Hi does anyone have experience using insofilter for Oracle text,
I would like to get a search like google from pdf,doc,etc. documents using
Oracle text.

I found something at
http://otn.oracle.com/products/text/htdocs/FilterServer.htm
http://otn.oracle.com/products/text/htdocs/altfilters.htm

But some of them are only for some specific product,
 I would like to know if someone have or know about some perl script or
something like that,
working and tested for all kind of text documents.

Thanks.

Juan Carlos Reyes Pacheco
OCP
Database 9.2 Standard Edition

----------------------------------------------------------------
Please see the official ORACLE-L FAQ: http://www.orafaq.com
----------------------------------------------------------------
To unsubscribe send email to:  oracle-l-request@xxxxxxxxxxxxx
put 'unsubscribe' in the subject line.
--
Archives are at //www.freelists.org/archives/oracle-l/
FAQ is at //www.freelists.org/help/fom-serve/cache/1.html
-----------------------------------------------------------------

___________________________________________________________________________ 


This email and any attached to it are confidential and intended only for the
individual or entity to which it is addressed.  If you are not the intended
recipient, please let us know by telephoning or emailing the sender.  You
should also delete the email and any attachment from your systems and should
not copy the email or any attachment or disclose their content to any other
person or entity.  The views expressed here are not necessarily those of
Churchill Insurance Group plc or its affiliates or subsidiaries. Thank you. 

Churchill Insurance Group plc.  Company Registration Number - 2280426.
England. 

Registered Office: Churchill Court, Westmoreland Road, Bromley, Kent BR1
1DP. 


----------------------------------------------------------------
Please see the official ORACLE-L FAQ: http://www.orafaq.com
----------------------------------------------------------------
To unsubscribe send email to:  oracle-l-request@xxxxxxxxxxxxx
put 'unsubscribe' in the subject line.
--
Archives are at //www.freelists.org/archives/oracle-l/
FAQ is at //www.freelists.org/help/fom-serve/cache/1.html
-----------------------------------------------------------------

Other related posts: