You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -56,9 +49,27 @@ virutalenv before calling `dsub`, `dstat`, and `ddel`. They are in the
56
49
use these scripts if you don't want to activate the virtualenv explicitly in
57
50
your shell.
58
51
52
+
#### Install the Google Cloud SDK
53
+
54
+
While not used directly by `dsub` for the `google-v2` or `google-cls-v2` providers, you are likely to want to install the command line tools found in the [Google
55
+
Cloud SDK](https://cloud.google.com/sdk/).
56
+
57
+
If you will be using the `local` provider for faster job development,
58
+
you *will* need to install the Google Cloud SDK, which uses `gsutil` to ensure
59
+
file operation semantics consistent with the Google `dsub` providers.
60
+
61
+
1.[Install the Google Cloud SDK](https://cloud.google.com/sdk/)
62
+
2. Run
63
+
64
+
gcloud init
65
+
66
+
67
+
`gcloud` will prompt you to set your default project and to grant
68
+
credentials to the Google Cloud SDK.
69
+
59
70
### Install `dsub`
60
71
61
-
Choose one of the following:
72
+
Choose **one** of the following:
62
73
63
74
#### Install from PyPI
64
75
@@ -167,12 +178,7 @@ The steps for getting started differ slightly as indicated in the steps below:
167
178
168
179
[Enable the Cloud Life Sciences, Storage, and Compute APIs](https://console.cloud.google.com/flows/enableapi?apiid=lifesciences.googleapis.com,storage_component,compute_component&redirect=https://console.cloud.google.com)
169
180
170
-
1.[Install the Google Cloud SDK](https://cloud.google.com/sdk/) and run
171
-
172
-
gcloud init
173
-
174
-
This will set up your default project and grant credentials to the Google
175
-
Cloud SDK. Now provide [credentials](https://developers.google.com/identity/protocols/application-default-credentials)
181
+
1. Provide [credentials](https://developers.google.com/identity/protocols/application-default-credentials)
176
182
so `dsub` can call Google APIs:
177
183
178
184
gcloud auth application-default login
@@ -423,57 +429,88 @@ specified and they can be specified in any order.
423
429
424
430
#### Mounting "resource data"
425
431
426
-
If you have one of the following:
432
+
While explicitly specifying inputs improves tracking provenance of your data,
433
+
there are cases where you might not want to expliclty localize all inputs
434
+
from Cloud Storage to your job VM.
435
+
436
+
For example, if you have:
437
+
438
+
- a large set of resource files
439
+
- your code only reads a subset of those files
440
+
- runtime decisions of which files to read
441
+
442
+
OR
443
+
444
+
- a large input file over which your code makes a single read pass
445
+
446
+
OR
447
+
448
+
- a large input file that your code does not read in its entirety
427
449
428
-
1. A large set of resource files, your code only reads a subset of those files,
429
-
and the decision of which files to read is determined at runtime, or
430
-
2. A large input file over which your code makes a single read pass or only
431
-
needs to read a small range of bytes,
450
+
then you may find it more efficient or convenient to access this data by
451
+
mounting read-only:
432
452
433
-
then you may find it more efficient at runtime to access this resource data via
434
-
mounting a Google Cloud Storage bucket read-only or mounting a persistent disk
0 commit comments