Commit Graph

105 Commits

Author SHA1 Message Date
romainx
25d4876efe Regular update 2021-05-04 18:36:03 +02:00
Ayaz Salikhov
b5d6f04e43 Delete unused hadolint ignore 2021-05-02 23:28:53 +03:00
Ayaz Salikhov
0fff54f3f2 Install spark from archive.apache.org to be able to use old versions 2021-05-02 23:28:32 +03:00
Antony Neu
5b0b25d158 Fix config formatting 2021-03-20 14:19:55 +01:00
Bjørn-Andre Skaar
0192175d05 Spark 3.1.1 upgrade 2021-03-04 08:27:14 +01:00
romainx
753abff645 Regular update 2021-02-28 06:56:36 +01:00
Aaron Cody
ffea0d5c94 bumped spark version to 3.0.2 as 3.0.1 is no longer available for download 2021-02-23 16:27:49 -08:00
romainx
1dd95bad2c Fix spark installation for Java 11 and Arrow 2020-12-13 11:58:46 +01:00
romainx
1dca39182b Improve spark installation
Spark installation improved by sourcing the `spark-config.sh` in the `before-notebook.d` hook that is run by `start.sh`. It permits to add automatically the right Py4J dependency version in the `PYTHONPATH`. So it is not needed anymore to set this variable at build time.

Documentation describing the installation of a custom Spark version modified to remove this step. Also updated to install the latest `2.x` Spark version.

`test_pyspark` fixed (was always OK before that).
2020-11-24 20:40:06 +01:00
romainx
0f2a7473c4 Regular update 2020-11-05 20:55:18 +01:00
romainx
4271b09b33 rollback pyarrow update 2020-10-20 12:10:36 +02:00
romainx
32090f6011 Update docker stack 2020-11-20
The following changes have been made.

- `base-notebook`
  - `Changed`: Bump Ubuntu
  - `Changed`: Bump conda
  - `Fixed`: Add missing `apt-get clean`
- `minimal-notebook`
  - `Removed`: `jed` editor see #1174 (partial)
  - `Deprecated`: `emacs` editor (in the documentation and in the image) see #1174 (partial)
- `scipy-notebook`
  - `Fixed`: Add missing `apt-get clean`
  - `Changed`: Bump `dask`
  - `Changed`: Bump `protobuf`
- `r-notebook`
  - `Changed`: Bump `r-base`
  - `Changed`: Bump `r-rmarkdown`
  - `Removed`: The description of `tidyverse` packages because it's not the place to do that and it will always be obsolete.
- `datascience-notebook`
  - `Changed`: Bump `r-base` to `4.0.x` see #1102 (partial)
  - `Removed`: `plyr` package because it's retired fixes #1103
  - `Removed`: `r-reshape2` package because it's retired fixes #1103
  - `Changed`: Bump `r-rmarkdown`
  - `Changed`: Bump Julia
- `tensorflow-notebook`
  - `Changed`: Bump `tensorflow`
- `pyspark-notebook`
  - `Fixed`: Add missing `apt-get clean`
  - `Changed`: Bump `pyarrow`
- `all-spark-notebook`
  - `Fixed`: Add missing `apt-get clean`
  - `Changed`: Bump `r-base` to `4.0.x` see #1102 (partial)
  - `Changed`: Bump `r-sparklyr`
2020-10-20 11:36:34 +02:00
romainx
325dd5b3e2 Fix build (hope)
- miniconda versions and arguments
- remove useless hd5 install (dependency of h5py)
- pin the version of pyarrow
2020-09-19 17:27:47 +02:00
romainx
384acda330 Spark 3.0.1 -> fixes #1156 2020-09-10 15:25:00 +02:00
romainx
c288e77acb Fix debug line commented 2020-08-16 17:03:10 +02:00
romainx
8669d6e79b Resolves #1131: Allow alternative Spark version
Allow to build `pyspark-notebook` image with an alternative Spark version.

- Define arguments for Spark installation
- Add a note in "Image Specifics" explaining how to build an image with an alternative Spark version
- Remove Toree documentation from "Image Specifics" since its support has been droped in #1115
2020-08-15 20:19:35 +02:00
Darek
568708d279 Upgrading Spark to 3.0, removing Toree 2020-06-19 00:52:51 +00:00
Romain
5e6645d137 Ignore DL3006 and DL3008 by default 2020-06-01 06:23:44 +02:00
Romain
2ce0b49fb5 Final review 2020-05-30 05:44:53 +02:00
Romain
593698985f Fixes 2020-05-29 21:27:18 +02:00
Romain
7b48e43b74 Fix hadolint deviations 2020-05-29 19:33:24 +02:00
Peter Parente
bbbabd22a3 Update comments to remove mesos references 2020-05-25 12:25:47 -04:00
Travis CI
44a7d70805 Remove mesos lib from pyspark-notebook 2020-05-24 14:18:47 -04:00
Peter Parente
18bb3a4b8b Merge branch 'master' into fix-pyspark 2020-02-15 20:18:46 -05:00
Peter Parente
4a8b58a41b Test payspark import 2020-02-15 19:08:40 -05:00
Peter Parente
3aa61f94c2 Split SPARK_HOME definition from other env vars 2020-02-15 18:53:51 -05:00
romainx
4333c7cc14 fix as_json param 2020-02-13 13:02:37 +01:00
romainx
7f7be5707c spark mirror improvement 2020-02-13 12:00:41 +01:00
romainx
8b3ce5cfa6 Change spark mirror 2020-02-13 11:21:54 +01:00
romainx
45d51e3b42 Bump to spark 2.4.5 + minor improvements 2020-02-11 21:30:47 +01:00
Manny Cato
1089ae349c Address updating spark version to fix wget error 2019-09-10 16:25:27 -07:00
Peter Parente
411ec857bb Update to Spark 2.4.3 2019-05-11 19:27:53 -04:00
echowhisky
167c686011 Merge branch 'master' into issue-861
Resolved conflicts between local branch and updates to the upstream
master.
2019-05-06 18:46:51 +00:00
echowhisky
40c5c07b0a added -f flag to all conda clean commands
This commit adds the additional `-f` force command to all uses of `conda
clean --all` through the repo. Size should be smaller, but still testing
if anything breaks. See issue #861.
2019-05-04 19:11:32 +00:00
echowhisky
1f8311a7aa changed -tipsy to --all -y across all files
The last commit was only for the base-notebook's Dockerfile. For this,
all the files in the repo were grepped through and changed.
2019-05-04 18:53:49 +00:00
Peter Parente
825166612c Update to Spark 2.4.2 2019-04-25 10:06:50 -07:00
Peter Parente
32240517eb Update checksum for spark 2.4.1
https://www.apache.org/dist/spark/spark-2.4.1/spark-2.4.1-bin-hadoop2.7.tgz.sha512
2019-04-10 09:30:59 -04:00
Peter Parente
8cea451451 Update to Spark 2.4.1
The mirror we prefer no longer has 2.4.0
2019-04-10 00:05:55 -04:00
Tim Ryan
3fc28b67e0 Remove apt-get clean command. Unecessary on official Ubuntu images. 2018-12-17 10:40:58 -05:00
borregore
f4f24a3e63 Update Dockerfile 2018-11-14 11:06:44 -07:00
borregore
46dbe75904 Update Dockerfile 2018-11-14 10:26:12 -07:00
Peter Parente
313453504d Keep py4j-0.10.7-src.zip 2018-11-09 23:43:03 -05:00
Javier Ivan Venegas Carrillo
777eec10c3 Update for test with py4j update for test 2018-11-08 17:29:03 -07:00
Javier Ivan Venegas Carrillo
dd3a198922 Update for test with py4j update 2018-11-08 17:04:10 -07:00
Javier Ivan Venegas Carrillo
5162984d02 Update for test 2018-11-08 16:41:04 -07:00
Javier Ivan Venegas Carrillo
c9bd0ff317 Bumped Spark to 2.4.0 adn py4j to 0.10.8.1 2018-11-08 16:29:41 -07:00
graemes
1d763bbac5 Convert FROM specification to ARG (#727) 2018-10-19 18:52:27 +01:00
Stefan Proell
1584606a40 Adds config file initialization
Adds the initial creation of the configuration file
jupyter_notebook_config.json to the image. This file
was missing as reported in issue #639 and caused an
error when trying to set a new password via the Web
interface.

Bumps the version of Jupyter Notebook to 5.6

Updates Apache Mirror URL because the current causes build timeouts

The mirror used now (mirrors.ukfast.co.uk) is fast enough and does not
point to an ad site and is already used in a different Dockerfile within
this repository.
2018-07-19 10:17:50 +02:00
Darek
9f2072b8e6 Bumped Spark to 2.3.1 2018-06-13 14:39:25 +00:00
Braden Pellett
9a48f4cf65 Add Mesos GPG key
Add Mesos GPG key since accessing the keyserver seems to be flaky:
https://github.com/docker-library/official-images/issues/4252#issuecomment-381783035
2018-06-01 13:03:12 -07:00