Opened 4 years ago
#62314 new request
Apache Tika
Reported by: | workflowsguy | Owned by: | |
---|---|---|---|
Priority: | Normal | Milestone: | |
Component: | ports | Version: | 2.6.4 |
Keywords: | Cc: | ||
Port: |
Description
The Apache Tika™ (https://tika.apache.org/) toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). It would be a valuable addition to Apache Solr, for which a port is already available. Homebrew has a formula for Tika.
Note: See
TracTickets for help on using
tickets.