1
0
Fork 0
mirror of https://github.com/iiab/iiab.git synced 2025-02-13 03:32:12 +00:00

internetarchive/README.md punctuation/layout/capitalization tweaks

This commit is contained in:
A Holt 2019-05-05 22:56:23 -04:00 committed by GitHub
parent 2ad9b29dcc
commit 3164a55fae
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23

View file

@ -1,68 +1,94 @@
# Internet Archive - Universal Library project README
This Ansible role installs the Internet Archive's dweb-mirror project on Internet-in-a-Box (IIAB).
This Ansible role installs the Internet Archive's dweb-mirror project on
Internet-in-a-Box (IIAB).
The project is a local server that allows users to browse resources from the Internet Archive stored
on local drives - including USB drives.
The project is a local server that allows users to browse resources from the
Internet Archive stored on local drives - including USB drives.
It includes a crawler that can regularly synchronize local collections,
against a list of Internet Archive items and collections, and those collections can be moved between installations.
It includes a crawler that can regularly synchronize local collections, against
a list of Internet Archive items and collections, and those collections can be
moved between installations.
When connected to the internet, the server works as a Proxy, i.e. it will store Internet Archive content the user views for later off-line viewing.
When connected to the internet, the server works as a Proxy, i.e. it will store
Internet Archive (IA) content the user views for later off-line viewing.
There are components to integrate the IA server with decentralized tools including IPFS, WebTorrent, GUN, WOLK,
both for fetching content and for serving it back to the net or locally.
There are components to integrate the IA server with decentralized tools
including IPFS, WebTorrent, GUN, WOLK, both for fetching content and for
serving it back to the net or locally.
This is an ongoing project, continually adding support for new Internet Archive content types; new platforms; and new decentralized transports.
This is an ongoing project, continually adding support for new Internet Archive
content types; new platforms; and new decentralized transports.
## Using it
### Starting server
The server is started and restarted automatically.
It can be turned on or off at a terminal window with `service internetarchive start` or `service internetarchive stop`
It can be turned on or off at a terminal window with `service internetarchive
start` or `service internetarchive stop`
### Browsing
The server can be accessed at [http://box.lan:4244] (try [http://box.local:4244] if that doesn't work)
The server can be accessed at [http://box.lan:4244] (try
[http://box.local:4244] if that doesn't work)
If you dont get a Archive UI then look at the server log (in browser console) to see for any “FAILING” log lines which indicate a problem.
If you dont get a Archive UI then look at the server log (in browser console)
to see for any “FAILING” log lines which indicate a problem.
Expect to see errors in the Browser log for `http://localhost:5001/api/v0/version?stream-channels=true` which is checking for a local IPFS server which is not started here.
Expect to see errors in the Browser log for
`http://localhost:5001/api/v0/version?stream-channels=true` which is checking
for a local IPFS server which is not started here.
Expect, on slower machines or slower network connections, to see no images the first time, refresh after a little while and most should appear.
Expect, on slower machines or slower network connections, to see no images the
first time, refresh after a little while and most should appear.
## Administration
Administration is carried out through the same User Interface as browsing.
Access [http://localhost:4244/local] to see a display of local content, this interface is under development and various admin tools will be added here. *at some point this will become the default page*.
Access [http://localhost:4244/local] to see a display of local content, this
interface is under development and various admin tools will be added here.
*At some point this will become the default page*.
Access [http://localhost:4244] to get the Internet Archive main interface if connected to the net.
Access [http://localhost:4244] to get the Internet Archive main interface if
connected to the net.
While viewing an item or collection, the "Crawl" button in the top bar
indicates whether the item is being crawled or not. Clicking it will cycle
through three levels:
While viewing an item or collection,
the "Crawl" button in the top bar indicates whether the item is being crawled or not.
Clicking it will cycle through three levels:
* No crawling
* Details - sufficient information will be crawled to display the page,
for a collection this also means getting the thumbnails and metadata for the top items.
* Full - crawls everything on the item, this can be a LOT of data, including full size videos etc, so use with care if bandwidth/disk is limited.
* Details - sufficient information will be crawled to display the page, for a
collection this also means getting the thumbnails and metadata for the top
items.
* Full - crawls everything on the item, this can be a LOT of data, including
full size videos etc, so use with care if bandwidth/disk is limited.
### Disks
The server checks for disks in all the likely places, the list of places it checks, in an unmodified installation can be seen at
The server checks for disks in all the likely places, the list of places it
checks, in an unmodified installation can be seen at
`https://github.com/internetarchive/dweb-mirror/blob/master/configDefaults.yaml#L7`
This includes top level directories on usb drives at `/media/pi/*/archiveorg`
You can override this in `dweb-mirror.config.yaml` in the home directory of the user that runs the server, this is currently `/root/dweb-mirror.config.yaml` (see 'Advanced' below )
You can override this in `dweb-mirror.config.yaml` in the home directory of the
user that runs the server, this is currently `/root/dweb-mirror.config.yaml`
(see 'Advanced' below)
Items are stored in subdirectories of the first of these directories found, but read from any of the locations.
Items are stored in subdirectories of the first of these directories found, but
read from any of the locations.
If you disk space is getting full, its perfectly safe to delete any subdirectories,
except the `.hashstore` at the top level of each,
the server will refetch what it needs if you browse to the item again when connected to the internet.
If you disk space is getting full, its perfectly safe to delete any
subdirectories, except the `.hashstore` at the top level of each, the server
will refetch what it needs if you browse to the item again when connected to
the internet.
### Maintenance
If you are worried about corruption, or after for example hand-editing or moving cached items around.
If you are worried about corruption, or after for example hand-editing or
moving cached items around.
```
# Run everything as root
sudo sh
@ -70,31 +96,40 @@ sudo sh
cd /opt/iiab/internetarchive/node_modules/@internetarchive/dweb-mirror
./internetarchive -m
```
This will usually take about 5-10 minutes depending on the amount of material cached,
just to rebuild a table of checksums.
This will usually take about 5-10 minutes depending on the amount of material
cached, just to rebuild a table of checksums.
### Advanced
Most functionality of the tool is controlled by two YAML files,
the second of which you can edit if you have access to the shell.
You can view the current configuration by going to [http://box.lan:4244/info] or [http://localhost:4244/info] depending on how you are connected.
Most functionality of the tool is controlled by two YAML files, the second of
which you can edit if you have access to the shell.
The default, and user configurations are displayed as the `0` and `1` item in the `/info` call.
You can view the current configuration by going to [http://box.lan:4244/info]
or [http://localhost:4244/info] depending on how you are connected.
In the Repo is a [default YAML file](https://github.com/internetarchive/dweb-mirror/blob/master/configDefaults.yaml) which is commented.
It would be a bad idea to edit this, so I'm not going to tell you where it is on your installation!
But anything from this file can be overridden by lines in `/root/dweb-mirror.config.yaml`.
Make sure you understand how yaml works before editing this file,
if you break it, you can copy a new default from [dweb-mirror.config.yaml on the repo](https://github.com/internetarchive/dweb-mirror/blob/master/configDefaults.yaml#L7)
The default, and user configurations are displayed as the `0` and `1` item in
the `/info` call.
In the Repo is a
[default YAML file](https://github.com/internetarchive/dweb-mirror/blob/master/configDefaults.yaml)
which is commented. It would be a bad idea to edit this, so I'm not going to
tell you where it is on your installation! But anything from this file can be
overridden by lines in `/root/dweb-mirror.config.yaml`. Make sure you
understand how yaml works before editing this file, if you break it, you can
copy a new default from
[dweb-mirror.config.yaml on the repo](https://github.com/internetarchive/dweb-mirror/blob/master/configDefaults.yaml#L7)
TODO Note this file will probably move location.
Note that this file is also edited automatically when the Crawl button described above is clicked.
Note that this file is also edited automatically when the Crawl button
described above is clicked.
As the project develops, this file will be editable via a UI.
## Update
Dweb-mirror is under rapid development, as is the Javascript UI. Its recommended to update frequently.
Dweb-mirror is under rapid development, as is the JavaScript UI. It's
recommended to update frequently.
From a Terminal window
```
@ -104,9 +139,12 @@ yarn upgrade # Currently this can take up to about 20 minutes to run, we hope t
```
## Crawling
The Crawler will be built into the UI fairly soon, for now it has to be run in a terminal window.
Its highly configurable either through the YAML file described above, or from the command line.
The Crawler will be built into the UI fairly soon, for now it has to be run in
a terminal window.
Its highly configurable either through the YAML file described above, or from
the command line.
In a shell
```
@ -133,7 +171,8 @@ cd /opt/iiab/internetarchive/node_modules/@internetarchive/dweb-mirror
## Troubleshooting
There are two logs of relevance, the browser and the server.
**Browser**: If using Chrome then this is at View / Developer Tools / Javascript Console or something similar.
**Browser**: If using Chrome then this is at View / Developer Tools /
JavaScript Console or something similar.
**Server**:
From a Terminal window.
@ -142,20 +181,23 @@ journalctl -u internetarchive
```
## Known Issues
See [github dweb-mirror issues](https://github.com/internetarchive/dweb-mirror/issues); and [github dweb-archive issues](https://github.com/internetarchive/dweb-archive/issues);
See
[github dweb-mirror issues](https://github.com/internetarchive/dweb-mirror/issues);
and
[github dweb-archive issues](https://github.com/internetarchive/dweb-archive/issues);
## More info
Dweb-Mirror Lives on github at:
[dweb-mirror](https://github.com/internetarchive/dweb-mirror);
[source](https://github.com/internetarchive/dweb-mirror);
[issues](https://github.com/internetarchive/dweb-mirror/issues);
[API.md](./API.md) API documentation for dweb-mirror
This project is part of our larger Dweb project, see also:
[dweb-universal](https://github.com/internetarchive/dweb-universal) info about others distributing the web;
[dweb-transport](https://github.com/internetarchive/dweb-transport) miscellaneous inc GUN gateway and webtorrent;
[dweb-objects](https://github.com/internetarchive/dweb-objects) library of dweb objects;
[dweb-archive](https://github.com/internetarchive/dweb-archive) archive UI in Javascript;
[dweb-archivecontroller](https://github.com/internetarchive/dweb-archive) Knows about the structure of archive objects;
Dweb-Mirror lives on GitHub at:
* [dweb-mirror](https://github.com/internetarchive/dweb-mirror)
* [source](https://github.com/internetarchive/dweb-mirror)
* [issues](https://github.com/internetarchive/dweb-mirror/issues)
* [API.md](./API.md) API documentation for dweb-mirror
This project is part of the Internet Archive's larger Dweb project, see also:
* [dweb-universal](https://github.com/internetarchive/dweb-universal) info about others distributing the web
* [dweb-transport](https://github.com/internetarchive/dweb-transport) miscellaneous incl GUN gateway and WebTorrent
* [dweb-objects](https://github.com/internetarchive/dweb-objects) library of dweb objects
* [dweb-archive](https://github.com/internetarchive/dweb-archive) archive UI in JavaScript
* [dweb-archivecontroller](https://github.com/internetarchive/dweb-archive) Knows about the structure of archive objects