mirror of
https://github.com/iiab/iiab.git
synced 2025-03-09 15:40:17 +00:00
roles/internetarchive/README.md more approachable?
This commit is contained in:
parent
bb296eed6d
commit
15b5b7fc18
1 changed files with 42 additions and 25 deletions
|
@ -1,7 +1,12 @@
|
||||||
# Internet Archive - Universal Library project README
|
# Internet Archive Universal Library / Decentralized Web README
|
||||||
|
|
||||||
|
The Internet Archive (http://archive.org) is famous for their WayBack Machine
|
||||||
|
that has saved 362+ Billion web pages, and more recently their Decentralized
|
||||||
|
Web project.
|
||||||
|
|
||||||
This Ansible role installs the Internet Archive's dweb-mirror project on
|
This Ansible role installs the Internet Archive's dweb-mirror project on
|
||||||
Internet-in-a-Box (IIAB).
|
Internet-in-a-Box (IIAB). Use this to build up a dynamic offline library
|
||||||
|
arising from the materials you can explore at http://dweb.archive.org
|
||||||
|
|
||||||
The project is a local server that allows users to browse resources from the
|
The project is a local server that allows users to browse resources from the
|
||||||
Internet Archive stored on local drives - including USB drives.
|
Internet Archive stored on local drives - including USB drives.
|
||||||
|
@ -24,16 +29,24 @@ content types; new platforms; and new decentralized transports.
|
||||||
|
|
||||||
### Starting server
|
### Starting server
|
||||||
|
|
||||||
The server is started and restarted automatically.
|
The server is started and restarted automatically. It can be turned on or off
|
||||||
It can be turned on or off at a terminal window with `service internetarchive
|
at a terminal window with `service internetarchive start` or `service
|
||||||
start` or `service internetarchive stop`
|
internetarchive stop`
|
||||||
|
|
||||||
### Browsing
|
### Browsing
|
||||||
|
|
||||||
The server can be accessed at [http://box.lan:4244] (try
|
The server can be accessed at [http://box:4244](http://box:4244) or
|
||||||
[http://box.local:4244] if that doesn't work)
|
[http://box.lan:4244](http://box.lan:4244) (try
|
||||||
|
[http://box.local:4244](http://box.local:4244) via mDNS over a local network,
|
||||||
|
if you don't have name resolution set up to reach your Internet-in-a-Box).
|
||||||
|
|
||||||
If you don’t get a Archive UI then look at the server log (in browser console)
|
_If future, we also hope to get [http://box/archive](http://box/archive) and
|
||||||
|
[http://box.lan/archive](http://box.lan/archive) working (as of 2019-05-25 the
|
||||||
|
error "Cannot GET /archive" appears — if you can help us fix
|
||||||
|
[/etc/apache2/sites-available/internetarchive.conf](https://github.com/iiab/iiab/blob/master/roles/internetarchive/templates/internetarchive.conf)
|
||||||
|
that would be incredible!)_
|
||||||
|
|
||||||
|
If you don’t get an Archive UI then look at the server log (in browser console)
|
||||||
to see for any “FAILING” log lines which indicate a problem.
|
to see for any “FAILING” log lines which indicate a problem.
|
||||||
|
|
||||||
Expect to see errors in the Browser log for
|
Expect to see errors in the Browser log for
|
||||||
|
@ -47,12 +60,12 @@ first time, refresh after a little while and most should appear.
|
||||||
|
|
||||||
Administration is carried out through the same User Interface as browsing.
|
Administration is carried out through the same User Interface as browsing.
|
||||||
|
|
||||||
Access [http://localhost:4244/local] to see a display of local content, this
|
Access [http://localhost:4244/local](http://localhost:4244/local) to see a
|
||||||
interface is under development and various admin tools will be added here.
|
display of local content, this interface is under development and various admin
|
||||||
*At some point this will become the default page*.
|
tools will be added here. *At some point this will become the default page*.
|
||||||
|
|
||||||
Access [http://localhost:4244] to get the Internet Archive main interface if
|
Access [http://localhost:4244](http://localhost:4244) to get the Internet
|
||||||
connected to the net.
|
Archive main interface if connected to the net.
|
||||||
|
|
||||||
While viewing an item or collection, the "Crawl" button in the top bar
|
While viewing an item or collection, the "Crawl" button in the top bar
|
||||||
indicates whether the item is being crawled or not. Clicking it will cycle
|
indicates whether the item is being crawled or not. Clicking it will cycle
|
||||||
|
@ -67,9 +80,9 @@ through three levels:
|
||||||
|
|
||||||
### Disks
|
### Disks
|
||||||
|
|
||||||
The server checks for caches of content in directories called `archiveorg` in all the likely places,
|
The server checks for caches of content in directories called `archiveorg` in
|
||||||
in particular it looks in `/media/pi/*archiveorg` for any inserted USB drives,
|
all the likely places, in particular it looks in `/media/pi/*archiveorg` for
|
||||||
and if none are found, it uses `/library/archiveorg`.
|
any inserted USB drives, and if none are found, it uses `/library/archiveorg`.
|
||||||
|
|
||||||
The list of places it checks, in an unmodified installation can be seen at
|
The list of places it checks, in an unmodified installation can be seen at
|
||||||
`https://github.com/internetarchive/dweb-mirror/blob/master/configDefaults.yaml#L7`.
|
`https://github.com/internetarchive/dweb-mirror/blob/master/configDefaults.yaml#L7`.
|
||||||
|
@ -78,15 +91,16 @@ You can override this in `dweb-mirror.config.yaml` in the home directory of the
|
||||||
user that runs the server, this is currently `/root/dweb-mirror.config.yaml`
|
user that runs the server, this is currently `/root/dweb-mirror.config.yaml`
|
||||||
(see 'Advanced' below)
|
(see 'Advanced' below)
|
||||||
|
|
||||||
Archive's `Items` are stored in subdirectories of the first of these directories found, but
|
Archive's `Items` are stored in subdirectories of the first of these
|
||||||
are read from any of the locations.
|
directories found, but are read from any of the locations.
|
||||||
|
|
||||||
If you disk space is getting full, its perfectly safe to delete any subdirectories, or to move them
|
If you disk space is getting full, its perfectly safe to delete any
|
||||||
to an attached USB. Its also safe to move attached USB's from one device to another.
|
subdirectories, or to move them to an attached USB. Its also safe to move
|
||||||
|
attached USB's from one device to another.
|
||||||
|
|
||||||
The one directory you should not move or delete is `archiveorg/.hashstore` in any of these locations,
|
The one directory you should not move or delete is `archiveorg/.hashstore` in
|
||||||
the server will refetch anything else it needs if you browse to the item again when connected to
|
any of these locations, the server will refetch anything else it needs if you
|
||||||
the internet.
|
browse to the item again when connected to the internet.
|
||||||
|
|
||||||
### Maintenance
|
### Maintenance
|
||||||
|
|
||||||
|
@ -107,8 +121,10 @@ cached, just to rebuild a table of checksums.
|
||||||
Most functionality of the tool is controlled by two YAML files, the second of
|
Most functionality of the tool is controlled by two YAML files, the second of
|
||||||
which you can edit if you have access to the shell.
|
which you can edit if you have access to the shell.
|
||||||
|
|
||||||
You can view the current configuration by going to [http://box.lan:4244/info]
|
You can view the current configuration by going to
|
||||||
or [http://localhost:4244/info] depending on how you are connected.
|
[http://box.lan:4244/info](http://box.lan:4244/info) or
|
||||||
|
[http://localhost:4244/info](http://localhost:4244/info) depending on how you
|
||||||
|
are connected.
|
||||||
|
|
||||||
The default, and user configurations are displayed as the `0` and `1` item in
|
The default, and user configurations are displayed as the `0` and `1` item in
|
||||||
the `/info` call.
|
the `/info` call.
|
||||||
|
@ -172,6 +188,7 @@ cd /opt/iiab/internetarchive/node_modules/@internetarchive/dweb-mirror
|
||||||
./internetarchive --copydirectory /media/pi/xyz/archiveorg --crawl --rows 10 --level details prelinger
|
./internetarchive --copydirectory /media/pi/xyz/archiveorg --crawl --rows 10 --level details prelinger
|
||||||
```
|
```
|
||||||
## Troubleshooting
|
## Troubleshooting
|
||||||
|
|
||||||
There are two logs of relevance, the browser and the server.
|
There are two logs of relevance, the browser and the server.
|
||||||
|
|
||||||
**Browser**: If using Chrome then this is at View / Developer Tools /
|
**Browser**: If using Chrome then this is at View / Developer Tools /
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue