romainx
9529e3dffe
Turn off ipython low-level output capture and forward
2022-01-07 17:53:27 +01:00
Darek
df72530d28
Upgrading Spark to 3.2.0
2021-10-20 13:42:25 +00:00
Ayaz Salikhov
a6d0ed456e
Merge branch 'master' into asalikhov/automatic_conda_versioning
2021-07-09 16:52:31 +03:00
Ayaz Salikhov
3c9e62efce
Introduce owner to Dockerfiles to make it easy to test locally
2021-07-08 17:26:56 +03:00
Ayaz Salikhov
bfb8cc7d50
Allow conda to automatically deduce package versions
2021-06-20 22:13:15 +03:00
Ayaz Salikhov
4befdd5948
Merge branch 'master' into asalikhov/use_mamba
2021-06-07 23:05:26 +03:00
Ayaz Salikhov
396024a4dd
Merge pull request #1335 from mathbunnyru/asalikhov/unify_bash_variables
...
Unify bash variables usage and add quotes where needed
2021-06-07 23:00:11 +03:00
Darek
3b5b735bb6
Upgrading spark_version->3.1.2
2021-06-04 15:02:52 +00:00
Ayaz Salikhov
6ab40f0002
Use mamba instead of conda in spark images
2021-06-01 20:30:30 +03:00
Ayaz Salikhov
0999f1a36f
Unify bash variables usage and add quotes where needed
2021-05-26 13:06:10 +03:00
Ayaz Salikhov
fbceaa9892
Fixes
2021-05-22 14:01:01 +03:00
romainx
25d4876efe
Regular update
2021-05-04 18:36:03 +02:00
Ayaz Salikhov
b5d6f04e43
Delete unused hadolint ignore
2021-05-02 23:28:53 +03:00
Ayaz Salikhov
0fff54f3f2
Install spark from archive.apache.org to be able to use old versions
2021-05-02 23:28:32 +03:00
Antony Neu
5b0b25d158
Fix config formatting
2021-03-20 14:19:55 +01:00
Bjørn-Andre Skaar
0192175d05
Spark 3.1.1 upgrade
2021-03-04 08:27:14 +01:00
romainx
753abff645
Regular update
2021-02-28 06:56:36 +01:00
Aaron Cody
ffea0d5c94
bumped spark version to 3.0.2 as 3.0.1 is no longer available for download
2021-02-23 16:27:49 -08:00
romainx
1dd95bad2c
Fix spark installation for Java 11 and Arrow
2020-12-13 11:58:46 +01:00
romainx
1dca39182b
Improve spark installation
...
Spark installation improved by sourcing the `spark-config.sh` in the `before-notebook.d` hook that is run by `start.sh`. It permits to add automatically the right Py4J dependency version in the `PYTHONPATH`. So it is not needed anymore to set this variable at build time.
Documentation describing the installation of a custom Spark version modified to remove this step. Also updated to install the latest `2.x` Spark version.
`test_pyspark` fixed (was always OK before that).
2020-11-24 20:40:06 +01:00
romainx
0f2a7473c4
Regular update
2020-11-05 20:55:18 +01:00
romainx
4271b09b33
rollback pyarrow update
2020-10-20 12:10:36 +02:00
romainx
32090f6011
Update docker stack 2020-11-20
...
The following changes have been made.
- `base-notebook`
- `Changed`: Bump Ubuntu
- `Changed`: Bump conda
- `Fixed`: Add missing `apt-get clean`
- `minimal-notebook`
- `Removed`: `jed` editor see #1174 (partial)
- `Deprecated`: `emacs` editor (in the documentation and in the image) see #1174 (partial)
- `scipy-notebook`
- `Fixed`: Add missing `apt-get clean`
- `Changed`: Bump `dask`
- `Changed`: Bump `protobuf`
- `r-notebook`
- `Changed`: Bump `r-base`
- `Changed`: Bump `r-rmarkdown`
- `Removed`: The description of `tidyverse` packages because it's not the place to do that and it will always be obsolete.
- `datascience-notebook`
- `Changed`: Bump `r-base` to `4.0.x` see #1102 (partial)
- `Removed`: `plyr` package because it's retired fixes #1103
- `Removed`: `r-reshape2` package because it's retired fixes #1103
- `Changed`: Bump `r-rmarkdown`
- `Changed`: Bump Julia
- `tensorflow-notebook`
- `Changed`: Bump `tensorflow`
- `pyspark-notebook`
- `Fixed`: Add missing `apt-get clean`
- `Changed`: Bump `pyarrow`
- `all-spark-notebook`
- `Fixed`: Add missing `apt-get clean`
- `Changed`: Bump `r-base` to `4.0.x` see #1102 (partial)
- `Changed`: Bump `r-sparklyr`
2020-10-20 11:36:34 +02:00
romainx
325dd5b3e2
Fix build (hope)
...
- miniconda versions and arguments
- remove useless hd5 install (dependency of h5py)
- pin the version of pyarrow
2020-09-19 17:27:47 +02:00
romainx
384acda330
Spark 3.0.1 -> fixes #1156
2020-09-10 15:25:00 +02:00
romainx
c288e77acb
Fix debug line commented
2020-08-16 17:03:10 +02:00
romainx
8669d6e79b
Resolves #1131 : Allow alternative Spark version
...
Allow to build `pyspark-notebook` image with an alternative Spark version.
- Define arguments for Spark installation
- Add a note in "Image Specifics" explaining how to build an image with an alternative Spark version
- Remove Toree documentation from "Image Specifics" since its support has been droped in #1115
2020-08-15 20:19:35 +02:00
Darek
568708d279
Upgrading Spark to 3.0, removing Toree
2020-06-19 00:52:51 +00:00
Romain
5e6645d137
Ignore DL3006 and DL3008 by default
2020-06-01 06:23:44 +02:00
Romain
2ce0b49fb5
Final review
2020-05-30 05:44:53 +02:00
Romain
593698985f
Fixes
2020-05-29 21:27:18 +02:00
Romain
7b48e43b74
Fix hadolint deviations
2020-05-29 19:33:24 +02:00
Peter Parente
bbbabd22a3
Update comments to remove mesos references
2020-05-25 12:25:47 -04:00
Travis CI
44a7d70805
Remove mesos lib from pyspark-notebook
2020-05-24 14:18:47 -04:00
Peter Parente
18bb3a4b8b
Merge branch 'master' into fix-pyspark
2020-02-15 20:18:46 -05:00
Peter Parente
4a8b58a41b
Test payspark import
2020-02-15 19:08:40 -05:00
Peter Parente
3aa61f94c2
Split SPARK_HOME definition from other env vars
2020-02-15 18:53:51 -05:00
romainx
4333c7cc14
fix as_json param
2020-02-13 13:02:37 +01:00
romainx
7f7be5707c
spark mirror improvement
2020-02-13 12:00:41 +01:00
romainx
8b3ce5cfa6
Change spark mirror
2020-02-13 11:21:54 +01:00
romainx
45d51e3b42
Bump to spark 2.4.5 + minor improvements
2020-02-11 21:30:47 +01:00
Manny Cato
1089ae349c
Address updating spark version to fix wget error
2019-09-10 16:25:27 -07:00
Peter Parente
411ec857bb
Update to Spark 2.4.3
2019-05-11 19:27:53 -04:00
echowhisky
167c686011
Merge branch 'master' into issue-861
...
Resolved conflicts between local branch and updates to the upstream
master.
2019-05-06 18:46:51 +00:00
echowhisky
40c5c07b0a
added -f
flag to all conda clean
commands
...
This commit adds the additional `-f` force command to all uses of `conda
clean --all` through the repo. Size should be smaller, but still testing
if anything breaks. See issue #861 .
2019-05-04 19:11:32 +00:00
echowhisky
1f8311a7aa
changed -tipsy
to --all -y
across all files
...
The last commit was only for the base-notebook's Dockerfile. For this,
all the files in the repo were grepped through and changed.
2019-05-04 18:53:49 +00:00
Peter Parente
825166612c
Update to Spark 2.4.2
2019-04-25 10:06:50 -07:00
Peter Parente
32240517eb
Update checksum for spark 2.4.1
...
https://www.apache.org/dist/spark/spark-2.4.1/spark-2.4.1-bin-hadoop2.7.tgz.sha512
2019-04-10 09:30:59 -04:00
Peter Parente
8cea451451
Update to Spark 2.4.1
...
The mirror we prefer no longer has 2.4.0
2019-04-10 00:05:55 -04:00
Tim Ryan
3fc28b67e0
Remove apt-get clean command. Unecessary on official Ubuntu images.
2018-12-17 10:40:58 -05:00