GROBID

From BITPlan Wiki
Jump to navigation Jump to search

GROBID en

GlossaryEntry
responsible  
state  
since  2009
description  GROBID (or Grobid, but not GroBid nor GroBiD) means GeneRation Of BIbliographic Data.
references  
lang  en
master  GROBID

Links


Installation

git clone https://github.com/kermitt2/grobid.git
./gradlew clean install test

Running service

./gradlew run

Apache Configuration

a2ensite grobid
systemctl reload apache2

With domain Names

see https://grobid.bitplan.com

#
# grobid.bitplan.com Server configuration
#  2020-07-04 see https://grobid.readthedocs.io/en/latest/Grobid-service/ 
#  
# see  http://stackoverflow.com/a/13089668/1497139
<VirtualHost *:80> 
  ServerAlias www.grobid.bitplan.com
  ProxyPreserveHost On
  ProxyRequests Off
  ServerName grobid.bitplan.com 
  ProxyPass / http://localhost:8070/
  ProxyPassReverse / http://localhost:8070/
  Header add "Access-Control-Allow-Origin" "*"
</VirtualHost>
<VirtualHost *:443> 
  ServerAlias secure.grobid.bitplan.com
  ProxyPreserveHost On
  ProxyRequests Off
  ServerName grobid.bitplan.com 
  ProxyPass / http://localhost:8070/
  ProxyPassReverse / http://localhost:8070/
  Header add "Access-Control-Allow-Origin" "*"
  Include ssl.conf
</VirtualHost>


Install as service with Ansible playbook

https://github.com/TIBHannover/confiDent-Ansible-VMs/tree/master/playbooks/grobid


Service file

/etc/systemd/system/grobid.service

[Unit]
Description=Grobid
After=network.target
Wants=network.target

[Service]
WorkingDirectory=/usr/local/bin/
ExecStart=/usr/local/bin/grobid-installation/grobid-service/bin/grobid-service server /usr/local/bin/grobid-installation/grobid-service/config/config.yaml
Restart=on-abort


[Install]
WantedBy=multi-user.target

apache2 proxy pass

  Redirect /r/grobid /r/grobid/
  ProxyPass /r/grobid/ http://localhost:8070/
  ProxyPassReverse /r/grobid/ http://localhost:8070/

References

  1. ^  Patrice Lopez. (2009) "GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications" ECDL 2009: Research and Advanced Technology for Digital Libraries pp 473-474 . doi: 10.1007/978-3-642-04346-8_62url: https://core.ac.uk/download/pdf/38300913.pdfat: ECDL 2009