Reproducible Data Science at Scale!

Last update: Dec 29, 2022

Overview

Pachyderm: The Data Foundation for Machine Learning

Pachyderm provides the data layer that allows machine learning teams to productionize and scale their machine learning lifecycle. With Pachyderm’s industry leading data versioning, pipelines and lineage teams gain data driven automation, petabyte scalability and end-to-end reproducibility. Teams using Pachyderm get their ML projects to market faster, lower data processing and storage costs, and can more easily meet regulatory compliance requirements

Features

Automated Data Versioning: Pachyderm’s Data Versioning gives teams an automated and performant way to keep track of all data changes.
Data-Driven Pipelines: Pachyderm’s Containerized Pipelines speed data processing while lowering compute costs.
Immutable Data Lineage: Pachyderm’s data lineage provides an immutable record for all activities and assets in the ML lifecycle.
Console: The Pachyderm Console provides an intuitive visualization of your DAG (directed acyclic graph), and aids in reproducibility.
Notebooks: Pachyderm Notebooks provide an easy way to interact with Pachyderm data versioning and pipelines via Jupyter notebooks.

Getting Started

To start deploying your end-to-end version-controlled data pipelines, try us for free on Hub with little to no setup or run Pachyderm locally. You can also deploy on AWS/GCE/Azure in about 5 minutes.

You can also refer to our complete documentation to see tutorials, check out example projects, and learn about advanced features of Pachyderm.

If you'd like to see some examples and learn about core use cases for Pachyderm:

Documentation

Official Documentation

Community

Keep up to date and get Pachyderm support via:

Follow us on Twitter.
Join our community Slack Channel to get help from the Pachyderm team and other users.

Contributing

To get started, sign the Contributor License Agreement.

You should also check out our contributing guide.

Send us PRs, we would love to see what you do! You can also check our GH issues for things labeled "help-wanted" as a good place to start. We're sometimes bad about keeping that label up-to-date, so if you don't see any, just let us know.

Join Us

WE'RE HIRING! Love Docker, Go and distributed systems? Learn more about our open positions

Usage Metrics

Pachyderm automatically reports anonymized usage metrics. These metrics help us understand how people are using Pachyderm and make it better. They can be disabled by setting the env variable METRICS to false in the pachd container.

License Information

Pachyderm has moved some components of Pachyderm Platform to a source-available limited license.

We remain committed to the culture of open source, developing our product transparently and collaboratively with our community, and giving our community and customers source code access and the ability to study and change the software to suit their needs.

Under the Pachyderm Community License, you can access the source code and modify or redistribute it; there is only one thing you cannot do, and that is use it to make a competing offering.

Check out our License FAQ Page for more information.

Comments

pachd fails: panic: failed to initialize pach client: context deadline exceeded

What happened?:

Ran pachctl deploy to create an on-premises pachyderm cluster:

pachctl deploy custom --object-store s3 any-string 10 <bucket> <accesskey> <secretkey> rook-ceph-rgw-my-store.rook-ceph:80 --etcd-storage-class nfs-client --image-pull-secret boss-6000 --namespace pachyderm --dynamic-etcd-nodes 1

pachd is failing to start up and is reporting the following in the logs:

2019-12-16T18:46:13Z INFO no Jaeger collector found (JAEGER_COLLECTOR_SERVICE_HOST not set) 
2019-12-16T18:46:19Z WARNING TLS disabled: could not stat public cert at /pachd-tls-cert/tls.crt: stat /pachd-tls-cert/tls.crt: no such file or directory 
2019-12-16T18:46:19Z WARNING s3gateway TLS disabled: could not stat public cert at /pachd-tls-cert/tls.crt: stat /pachd-tls-cert/tls.crt: no such file or directory 
2019-12-16T18:46:20Z INFO validating kubernetes access returned no errors 
2019-12-16T18:46:49Z INFO error starting githook server context deadline exceeded 
 
panic: failed to initialize pach client: context deadline exceeded 
 
goroutine 492 [running]: 
github.com/pachyderm/pachyderm/src/server/pkg/serviceenv.(*ServiceEnv).GetPachClient(0xc00021f450, 0x2ad81a0, 0xc00053a2c0, 0xc00053a2c0) 
	src/github.com/pachyderm/pachyderm/src/server/pkg/serviceenv/service_env.go:171 +0x11a 
github.com/pachyderm/pachyderm/src/server/pps/server.(*apiServer).master.func1(0x0, 0x0) 
	src/github.com/pachyderm/pachyderm/src/server/pps/server/master.go:58 +0xe5 
github.com/pachyderm/pachyderm/src/server/pkg/backoff.RetryNotify(0xc00088c220, 0x2a99520, 0xc00061d6e0, 0xc0009fbfb8, 0x2a, 0xc00113f4c0) 
	src/github.com/pachyderm/pachyderm/src/server/pkg/backoff/retry.go:35 +0x4a 
github.com/pachyderm/pachyderm/src/server/pps/server.(*apiServer).master(0xc0002bcfc0) 
	src/github.com/pachyderm/pachyderm/src/server/pps/server/master.go:52 +0x20a 
created by github.com/pachyderm/pachyderm/src/server/pps/server.NewAPIServer 
	src/github.com/pachyderm/pachyderm/src/server/pps/server/server.go:67 +0x3d4 
panic: failed to initialize pach client: context deadline exceeded 
 
goroutine 513 [running]: 
github.com/pachyderm/pachyderm/src/server/pkg/serviceenv.(*ServiceEnv).GetPachClient(0xc00021f450, 0x2ad81e0, 0xc0000560d0, 0x7f01cf21a008) 
	src/github.com/pachyderm/pachyderm/src/server/pkg/serviceenv/service_env.go:171 +0x11a 
github.com/pachyderm/pachyderm/src/server/transaction/server.newAPIServer.func1(0xc001107260) 
	src/github.com/pachyderm/pachyderm/src/server/transaction/server/api_server.go:43 +0x48 
created by github.com/pachyderm/pachyderm/src/server/transaction/server.newAPIServer 
	src/github.com/pachyderm/pachyderm/src/server/transaction/server/api_server.go:43 +0x103

What you expected to happen?:

pachd should load successfully

How to reproduce it (as minimally and precisely as possible)?:

Anything else we need to know?:

Environment?:

Kubernetes version (use kubectl version):

Client Version: version.Info{Major:"1", Minor:"16", GitVersion:"v1.16.3", GitCommit:"b3cbbae08ec52a7fc73d334838e18d17e8512749", GitTreeState:"clean", BuildDate:"2019-11-13T11:23:11Z", GoVersion:"go1.12.12", Compiler:"gc", Platform:"linux/amd64"}
Server Version: version.Info{Major:"1", Minor:"16", GitVersion:"v1.16.3", GitCommit:"b3cbbae08ec52a7fc73d334838e18d17e8512749", GitTreeState:"clean", BuildDate:"2019-11-13T11:13:49Z", GoVersion:"go1.12.12", Compiler:"gc", Platform:"linux/amd64"}

Pachyderm CLI and pachd server version (use pachctl version):

COMPONENT           VERSION
pachctl             1.9.9
pachd               1.9.9

Cloud provider (e.g. aws, azure, gke) or local deployment (e.g. minikube vs dockerized k8s): on-premises Rancher 2.3.3 with 7 nodes
OS (e.g. from /etc/os-release): Ubuntu 18.04
Others:

opened by benwbooth 27

Stuck in "Pulling State" after delete-all
Hi. I have worked on Pachyderm several days. Since there're many repos, so I use delete all to delete all of the files and do new experiments, Then, I have new problems.

I use the 'fruit-stand' example to see whether it works, but although I do the same operation as I did several days before. But the program is stuck in the Pulling State like following for several hours.

pachctl list-job ID OUTPUT STARTED DURATION STATE 48521d8b6ff23c6ba66665ef6807d0e0 filter/665d9d51d3114a1e98c5bb7f432f8374 About a minute ago - pulling 649ddf50f993eafc56c5d205b7dd153f filter/fbc0accb6d4f48cc9caec8de282d8b5a About a minute ago - pulling

I search the problems in the previous issues. And I find that it is because Pachyderm cannot find the required image. So I speculate that when I do ``delete-all` , also the docker image has been deleted.
opened by ShengjieLuo 27
Explore micro k8s as minikube alternative

Mostly a suggestion for documentation purposes. User reported easier installation path and MUCH better performance in dev cluster mode.

Not sure if there are other "gotchas" or limitations, but we've seen a few users asking about it so probably worth documenting.
feature request docs

opened by JoeyZwicker 25

start-kube-docker not working in Vagrant image

Trying to run Pachyderm in Vagrant using the Vagrantfile/init.sh in the Github documentation QUICKSTART.md. gcr.io/google_containers/hyperkube:v1.1.2 container does not start.

Steps to reproduce:

vagrant destroy # or download per README.md
vagrant up
vagrant ssh

go get github.com/pachyderm/pachyderm/...
cd ~/go/src/github.com/pachyderm/pachyderm
etc/kube/start-kube-docker.sh

~/pachyderm_vagrant$ vagrant version
Installed Version: 1.7.4
Latest Version: 1.7.4

You're running an up-to-date version of Vagrant!

Console log: kubeNotStarting.txt

opened by brinman2002 23

Error 'the server has asked for the client to provide credentials'

Hi. I deploy pachyderm on another server. The installation is successful with:

COMPONENT           VERSION
pachctl             1.1.0
pachd               1.1.0

However, when I begin the pipeline fruit stand

ID                                 OUTPUT                                     STARTED             DURATION             STATE
d6ddca2dfd72e6a0da7053ba5151b4cb   filter3/1dd4428dcc4d40359e7bcc3cdb594f3b   8 seconds ago       Less than a second   failure
2610bae2936923f0ce850c04f2cedad3   filter2/d58ac2a1231d4db4bb2487554bf36273   25 minutes ago      Less than a second   failure
dfe6acbbcd241d55a394a95077df5d1e   filter/a4934ebe280c4e2cae2e6cfb4b1c4c04    2 hours ago         Less than a second   failure
e9a26e0594f6bd00bacefa33c1b9850a   filter/78a4cea8361c41f3a9a2c8e2f9679bb0    2 hours ago         Less than a second   failure

I tried it several times, but all failure.

See pipeline information here

NAME                INPUT               OUTPUT              STATE
filter              data                filter              running
sum3                filter2             sum3                running
filter2             data                filter2             running
sum2                filter2             sum2                running
filter3             data                filter3             running
sum                 filter              sum                 running

I check the log for the problem

pachctl get-logs d6ddca2dfd72e6a0da7053ba5151b4cb
the server has asked for the client to provide credentials (get pods)

I can delete the repo in this pipeline, but I can't delete this pipeline See here,

pachctl delete-pipeline filter3
error from DeletePipeline: the server has asked for the client to provide credentials (get pods)

I searched the information in source code See here

src/server/vendor/k8s.io/kubernetes/pkg/api/errors/errors.go 
case http.StatusUnauthorized:
        reason = unversioned.StatusReasonUnauthorized
        message = "the server has asked for the client to provide credentials"

Unfortunately, I have hit so many problems these days... I have to ask for problems everyday...

opened by ShengjieLuo 21

Get Pachyderm Working with OpenShift

Per our discussions, it looks like it may be a privilege issue or something similar. I have attached the steps to install the OpenShift vagrant image and then how to deploy pachyderm. You can troubleshoot through normal Kube commands as well. Everything is created and running, except the pachd pods do not start.

https://gist.github.com/munchee13/8cf64f2c1797d1d60891b28a193767f6

opened by munchee13 21
Improve Custom/On-Premise Docs
There's been a lot of interest in on-prem clusters lately, and our docs on the subject aren't very good. The main issues that people have been hitting are:

When to use the custom deploy

Confusion around how to use the custom deploy for cloud vs. on prem deploys

How to modify the manifest for on prem deploys

What needs to be changed in the deploy process for OpenShift, OpenStack, and other systems.

NotExist error with CEPH S3 interface deploy

I think the following will probably make the process better for on prem users:

Use the Helm chart as default for on prem deploys. Our users seem to indicate in Slack that this is much easier.

Update the custom/on-prem docs to emphasize custom object stores in the cloud vs. completely on-prem solutions.

Test the custom deploy commands and the Helm chart to see if updates are needed for the latest Pachyderm versions.

docs openshift size: XL priority: high
opened by dwhitena 20
Add support for minio deploy

This adds support for Minio and all other S3 compatible servers. This patch also uses minio-go. This has an added benefit i.e this can be used S3 as well transparently.

opened by harshavardhana 20
pipelines stalling

I have a pachyderm repo with a few commits commits in it (each a seperate branch). Each commit is about 100MB in size. The processing step begin and starts output data into the next pipeline but stops working after about 1MB of data output. If I force finish the commit and inspect the output is see that only some of the expected output was generated and the some of the files list the unix epoch as their creation date some of the time. Moreover, the pipeline takes an exceptionally long time to run compared to running it out side of pachyderm.

opened by JonathanFraser 20
Contexts
This introduces a new, backwards-incompatible version of configs, a migration to update old configs, and the related behavioral changes to pachctl from the new config.

Closes #3774 Fixes #3538 Closes #3036 Fixes #3419

Contexts

The largest addition is contexts, which are akin to kubectl contexts. Instead of building off of config V1 and having a different config file for each context (as proposed in #3774), this stores all contexts in a single config file. The reasons for this:

The hope with multiple files was that we could save effort by just building off of config V1. I no longer think that is the case, because bolting contexts on top of that design appears to be just as much work, if not more.

This design avoids ambiguity when a user sets PACH_CONFIG that the multiple config file design has.

It more closely follows k8s' approach.

Contexts have the same fields as config V1, plus a context source field, which specifies where contexts came from.

Active context

Config V2 contains a reference to the currently active context. The active context can be overridden via the env var PACH_CONTEXT.

Metrics

Rather than having a global flag --no-metrics that needs to be passed in for each call to pachctl, the ability to disable metrics are now specified in the config.

Config implementation changes

Configs are now read only once per run of pachctl. Before, it was read multiple times, which allowed for subtle bugs (e.g. if the user ID wasn't yet set, it would be reset multiple times.) I did not add any further locking to ensure changes can't overwrite each other. Given the current uses of configs (which are predominantly read-only), I think this is safe enough for now, but it is certainly not bulletproof!

Migrations

A migration is run the first time the config is read. The V1 config is turned into a context.

Deployments

When a (non-dry-run) deployment succeeds, a new pach context is created, and the user is automatically switched to the new context.

New commands

pachctl config get metrics - gets whether metrics are enabled

pachctl config set metrics (true|false) - sets whether metrics are enabled

pachctl config get active-context - gets the active context

pachctl config set active-context [name] - sets the active context

pachctl config get context [name] - gets a context config JSON by name

pachctl config set context [name] [--overwrite] - sets a context config from JSON stdin

pachctl config update context [name] --pachd-address=[address] - updates the pachd address of an existing context (this is the only field that is updatable without completely overwriting a context, at the moment.)

pachctl config delete context [name] - removes a context

pachctl config list context - lists all contexts

Removals

The global --no-metrics and --no-port-forwarding flags were removed, in favor of a config values.
opened by ysimonson 19
Better documentation for cluster API ingress
It seems a lot of user questions center around how API ingress works. Didn't find docs about this, so just wrote this up for a customer. Should probably be migrated into our docs if we don't have something like this already

To clarify about ingress/NodePorts: pachyderm doesn't really care how a user gets access to it, so long as their local pachctl client can talk to the pachd pod. pachctl has built-in support for two different methods: setting the PACHD_ADDRESS, and using pachctl port-forward

setting the PACHD_ADDRESS env var to point at a host:port that directs traffic to the pachd pod tells pachctl to just talk to that endpoint directly. This is the flow NodePort supports -- it makes the internal pachd pod's API port accessible on the cluster's external address, so that users can set PACHD_ADDRESS=cluster-address:30650 (30650 is the default), and the k8s/OC cluster will send that traffic to the pachd pod. because this port is a global resource at the k8s/OC cluster level, it needs to be unique per pachyderm cluster. but it can be changed to whatever you want for a given pachyderm deployment, and shouldn't affect pachyderm's internal operation (so long as the user of pachctl has the right value in their PACHD_ADDRESS variable)

~~pachctl port-forward piggy-backs on kubectl to fetch the name of the pachd pod within the k8s cluster/namespace the user is currently connected to, and then runs kubectl port-forward to direct traffic from the user's local machine to pachd via the k8s API. In this case, setting the PACHD_ADDRESS variable isn't needed, but the user needs to have k8s access set up, pointing at the namespace for their pachyderm cluster~~

Edited, based on @ysimonson 's suggestion:

pachctl port-forward piggy-backs off kubectl's config file and client API -- it reads kubectl's config file to fetch the name of the pachd pod within the k8s cluster/namespace the user is currently connected to, then uses the kubernetes API to effectively run kubectl port-forward. This directs traffic from the user's local machine to pachd via the k8s API. If using pachctl port-forward, then setting the PACHD_ADDRESS variable isn't needed. Instead, the user needs to have k8s access set up, pointing at the namespace for their pachyderm cluster

As of 1.8.3 pachctl port-forward happens automatically when running any pachctl command that tries to access pachd, but in order to open a persistent tunnel to a number of other ports pachyderm uses (the dashboard, git and auth hooks, the built-in HTTP file API, etc) users will still need to run pachctl port-forward explicitly

also, it seems pachctl port-forward isn't working with openshift, but the following oc port-forward command does effectively the same thing:

PACHD_POD_NAME=`oc get pod --output=json | jq -r '.items[] | select(.metadata.name|startswith("pachd")).metadata.name'` # -r flag is needed to not get quotes in the output oc port-forward pod/$PACHD_POD_NAME 30650:1650
docs openshift size: L priority: high solutions-architecture
opened by gabrielgrant 19
Can't run pachctl on WSL2
What happened?:

Following the local pachyderm instructions (running on WSL2 / 20.04):

Install homebrew and run the Next steps

tested everything works via brew install hello

install pachctl via brew tap pachyderm/tap && brew install pachyderm/tap/[email protected]

Trying to run pachctl, gives the following message:

pachctl zsh: permission denied: pachctl # same when running it via bash: /bin/bash pachctl /home/linuxbrew/.linuxbrew/bin/pachctl: /home/linuxbrew/.linuxbrew/bin/pachctl: cannot execute binary file

What you expected to happen?:

Not get the permission denied: pachctl message.

How to reproduce it (as minimally and precisely as possible)?:

# run the next steps as recommended from the following command too /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)" brew tap pachyderm/tap && brew install pachyderm/tap/[email protected] pachctl

Anything else we need to know?: Installing other packages via brew seems to work, so I don't think its a homebrew issue. (E.g. I can finish the local deploy guide, including the helm install via brew)

Environment?:

Kubernetes version (use kubectl version):

Pachyderm CLI and pachd server version (use pachctl version):

Cloud provider (e.g. aws, azure, gke) or local deployment (e.g. minikube vs dockerized k8s):

If you deployed with helm, the values you used (helm get values pachyderm):

OS (e.g. from /etc/os-release):

Others:

This is on WSL2 .

cat /etc/os-release NAME="Ubuntu" VERSION="20.04.5 LTS (Focal Fossa)" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu 20.04.5 LTS" VERSION_ID="20.04" HOME_URL="https://www.ubuntu.com/" SUPPORT_URL="https://help.ubuntu.com/" BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" VERSION_CODENAME=focal UBUNTU_CODENAME=focal

The permissions for linuxbrew look allright:

ls -lhA /home/linuxbrew/.linuxbrew/bin/pachctl lrwxrwxrwx 1 mheiser mheiser 40 Dec 27 09:14 /home/linuxbrew/.linuxbrew/bin/pachctl -> ../Cellar/[email protected]/v2.4.2/bin/pachctl
bug
opened by Persedes 6
[2.4.x backport][Jupyter] Fix datums-related error message when notebooks starts up

Make datums request at notebooks startup only when connected to a cluster and logged in if auth enabled (when mount server response is 200).

Ticket: https://linear.app/pachyderm/issue/INT-760/error-message-when-starting-up-notebooks

JIRA: INT-782

opened by smalyala 1
Warn about outdated pachctl
This implements some compatibility checking between pachctl (actually any Go client) and pachd.

Compatibility is defined as:

Either the client or the server are 0.0.0 as developer builds are.

If either the client or the server are a pre-release (nightly/alpha/beta/rc), then the server and client versions have to be an exact match.

Otherwise, the major and minor number have to be the same.

See the test cases in version_test.go.

The client always calls InspectCluster before connecting, so this PR modifies InspectCluster to take the client's version as a parameter. The server then checks that version against it's own version, and returns warnings in the reponse. There is also a version_warnings_ok flag. Old servers won't set that (since it's not in the message version they have), so the client can detect a way-too-old server. If there are any warnings set, the client will log them at level error. Technically this would be intrusive to users of the go client, but since the pctx.TODO() logger points to no-op logger until someone calls InitPachctlLogger() and that's a symbol they can't import, only pachctl users will ever see this.

It is a little weird to change InspectCluster from taking Empty to taking a message type, but it seems perfectly safe to me. The Go client API doesn't change; only people that directly generate stubs and call methods on it (like some of our tests) are affected. Users of the Go client that want to send a version have the option to do so with a new function in the client, InspectClusterWithVersion.

The server logs at INFO level whenever an incompatible client is detected, so even if users miss the warnings, administrators can know.

Here's an example of what it looks like in pachctl (runs for every command, can't be turned off). In this particular case, a "released" client is talking to a nightly build, which requires an exact version match between client and server:

And the server logs:

The server log is the same for every case (modulo the error field), but the client message varies based on the constants in admin/api_server.go. Feel free to wordsmith them.

Annoyingly, we don't seem to send an auth token with InspectCluster, so the server can't report the user name that is using an out of date client. We should probably do something about that.
opened by jrockway 1
Increase reliability of debug dumps
Fixes CORE-1193 and CORE-1294.

This PR does a bunch of stuff to make debug dumps more reliable, at least without burning the whole thing down and starting over.

pachctl debug dump can now specify a timeout; it defaults to 30m.

The timeout is adjusted down on the server side to about 90% of the client timeout. That means the debug dumper has some time to handle context deadline exceeded and start producing output before the RPC is totally aborted. I've had good results with timeouts as low as 100ms; you don't get everything, but you get some files. At 30m it should be Really Good (tm).

Every multi-step operation that the dumper does now continues in the face of errors, if the error doesn't affect the next thing. Every for loop or function that does two+ things now uses multierr.Append to collect all the errors. That means if we hit an issue where we try to do something silly like InspectPipeline an input repo, we just continue doing the whole debug dump anyway. At the very end, an error will be returned, but we can still write all the other files.

I fixed the thing where we did InspectPipeline on an input repo; there was a missing continue statement. I also tried to fix PPS's error message for a pipeline not being found, but it's actually not relevant to this PR. (I don't think the code can ever hit the case I "fixed", but in case it does, hey now the error type is correct. We still don't return grpc.status = NotFound from PPS under any of these circumstances though.)

I added some arbitrary timeouts around things I don't think will be too slow, like we did for Loki.

I noticed that the Pod Describer from the k8s library can't take a context. That means it could run forever, so I put it in a background goroutine; the foreground goroutine tries to get its output until the context expires, and then it just abandons it and moves on. This will leak memory if it runs forever, but hey, after we review the debug dump we'll probably tell you to restart pachd anyway. In the future we'll have to just collect pod YAMLs instead of "describe" output. Or fork k8s.io/client-go to make the silly thing take a context.

As an example, here's what a run with an aggressive timeout looks like now:

$ rm dump.tgz; /usr/bin/time pachctl debug dump dump.tgz --timeout=1s ; tar tzvf dump.tgz; du -h dump.tgz rpc error: code = Unknown desc = listPipelines: context deadline exceeded; appLogs: context deadline exceeded; collectDatabaseDump: collectDatabaseTables: list tables: context deadline exceeded Command exited with non-zero status 1 0.09user 0.02system 0:01.04elapsed 11%CPU (0avgtext+0avgdata 66060maxresident)k 0inputs+2176outputs (0major+2005minor)pagefaults 0swaps -rwxrwxrwx 0/0 6214 1969-12-31 19:00 source-repos/default/benchmark-upload/commits.json -rwxrwxrwx 0/0 52020 1969-12-31 19:00 source-repos/default/benchmark-upload/commits-chart.png -rwxrwxrwx 0/0 8083 1969-12-31 19:00 source-repos/default/images/commits.json -rwxrwxrwx 0/0 45961 1969-12-31 19:00 source-repos/default/images/commits-chart.png -rwxrwxrwx 0/0 17 1969-12-31 19:00 pachd/pachd-84f6794987-74hf2/pachd/version.txt -rwxrwxrwx 0/0 7612 1969-12-31 19:00 pachd/pachd-84f6794987-74hf2/pachd/describe.txt -rwxrwxrwx 0/0 8690350 1969-12-31 19:00 pachd/pachd-84f6794987-74hf2/pachd/logs.txt -rwxrwxrwx 0/0 80 1969-12-31 19:00 pachd/pachd-84f6794987-74hf2/pachd/logs-previous/error.txt -rwxrwxrwx 0/0 8640042 1969-12-31 19:00 pachd/pachd-84f6794987-74hf2/pachd/logs-loki.txt -rwxrwxrwx 0/0 22422 1969-12-31 19:00 pachd/pachd-84f6794987-74hf2/pachd/go_info.txt -rwxrwxrwx 0/0 11559 1969-12-31 19:00 pachd/pachd-84f6794987-74hf2/pachd/goroutine -rwxrwxrwx 0/0 84444 1969-12-31 19:00 pachd/pachd-84f6794987-74hf2/pachd/heap -rwxrwxrwx 0/0 26 1969-12-31 19:00 database/activities/error.txt -rwxrwxrwx 0/0 26 1969-12-31 19:00 database/row-counts/error.txt -rwxrwxrwx 0/0 26 1969-12-31 19:00 database/table-sizes/error.txt 1.1M dump.tgz

We end up with data (and a long chain of error messages) even if we hit timeouts.
opened by jrockway 1
dlock: add logging around lock acquisition and release

It's often interesting to have information about when locks are acquired or lost, so this adds it around all uses of DLock. The actual calls to Lock/TryLock/Unlock are wrapped in a span, reporting how long it took to acquire or release the lock, and any errors that might have occurred. The time spent waiting for the lock is reported as the spanDuration on the DLock.Lock (etc.) span, and all messages that are logged using the returned context have a withLock and locked field, to make it clear where the context came from. (The lock timing spans also have a withLock field, but locked isn't set until the lock is actually acquired.)

Here's what the chunk GC looks like starting up:

From this, we can see that we waited 21.86 seconds to take the lock, and that several GC runs have occurred while holding that lock. (If there was an error, that would also be logged.)

The span only tracks time spent actually interacting with the locking machinery; the total time the lock was held is reported at the end though.

When unlocking, we identify the lock by the prefix field instead of withLock. That's so that you can compare the two and see which context is being used to gate the unlocking operation vs. which lock is being unlocked.

opened by jrockway 1

Releases(v2.5.0-alpha.2)

v2.5.0-alpha.2(Dec 14, 2022)
What's Changed

Update version of jaeger all-in-one image by @msteffen in https://github.com/pachyderm/pachyderm/pull/8425

bump console version by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8426

Remove traefik example by @jrockway in https://github.com/pachyderm/pachyderm/pull/8427

security: update http2 by @jrockway in https://github.com/pachyderm/pachyderm/pull/8428

[CORE-1335] Correct nil pointer dereference from ListFile by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8422

worker_rc: set default requests if resource_requests and resource_limits are both empty by @jrockway in https://github.com/pachyderm/pachyderm/pull/8386

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221208...v2.5.0-alpha.2
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-alpha.2_amd64.deb(32.09 MB)
mount-server_2.5.0-alpha.2_arm64.deb(29.84 MB)
mount-server_2.5.0-alpha.2_darwin_amd64.zip(31.88 MB)
mount-server_2.5.0-alpha.2_darwin_arm64.zip(30.66 MB)
mount-server_2.5.0-alpha.2_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-alpha.2_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-alpha.2_amd64.deb(34.85 MB)
pachctl_2.5.0-alpha.2_arm64.deb(32.39 MB)
pachctl_2.5.0-alpha.2_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-alpha.2_darwin_arm64.zip(33.27 MB)
pachctl_2.5.0-alpha.2_linux_amd64.tar.gz(34.78 MB)
pachctl_2.5.0-alpha.2_linux_arm64.tar.gz(32.34 MB)
v2.5.0-nightly.20221208(Dec 8, 2022)
What's Changed

Remove default controlplane tolerations from promtail by @tybritten in https://github.com/pachyderm/pachyderm/pull/8396

[CORE-1219] Update S3 sidecar to be project-aware by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8370

[CORE-1080] Support for projects in extended traces by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8390

[CORE-1227] Rename files and references from v2.4.0 to v2.5.0 by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8395

Create role bindings for projects created before auth activation by @albscui in https://github.com/pachyderm/pachyderm/pull/8409

[CORE-1078] Update debug service to be project-aware by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8376

ListRepo - filter on projects and check permissions by @albscui in https://github.com/pachyderm/pachyderm/pull/8382

[PFS-13][PFS-14] Distributed put / get file url by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8394

[Jupyter] Fix datums-related error message when notebooks starts up by @smalyala in https://github.com/pachyderm/pachyderm/pull/8367

Authorize DeleteProject by @albscui in https://github.com/pachyderm/pachyderm/pull/8410

Fix pachctl auth get --help message by @albscui in https://github.com/pachyderm/pachyderm/pull/8413

s3gateway: remove limit on file uploads; make upload timeout less aggressive by @jrockway in https://github.com/pachyderm/pachyderm/pull/8407

[CORE-1165] Fix bad branch not found message by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8416

Remove ProjectViewer and ProjectWriter by @albscui in https://github.com/pachyderm/pachyderm/pull/8419

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221201...v2.5.0-nightly.20221208
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221208_amd64.deb(32.08 MB)
mount-server_2.5.0-nightly.20221208_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221208_darwin_amd64.zip(31.88 MB)
mount-server_2.5.0-nightly.20221208_darwin_arm64.zip(30.66 MB)
mount-server_2.5.0-nightly.20221208_linux_amd64.tar.gz(32.02 MB)
mount-server_2.5.0-nightly.20221208_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221208_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221208_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221208_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221208_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221208_linux_amd64.tar.gz(34.77 MB)
pachctl_2.5.0-nightly.20221208_linux_arm64.tar.gz(32.34 MB)
v2.4.2(Dec 8, 2022)
What's Changed

[PFS-10] Fix Misleading Version Check Logic 2.4.x Backport by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8399

[2.4.x Backport] Remove default controlplane tolerations from promtail (#8396) by @tybritten in https://github.com/pachyderm/pachyderm/pull/8406

[2.4.x backport] s3gateway: remove limits on file uploads by @jrockway in https://github.com/pachyderm/pachyderm/pull/8420

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.4.1...v2.4.2
Source code(tar.gz)
Source code(zip)
mount-server_2.4.2_amd64.deb(31.99 MB)
mount-server_2.4.2_arm64.deb(29.74 MB)
mount-server_2.4.2_darwin_amd64.zip(31.77 MB)
mount-server_2.4.2_darwin_arm64.zip(30.56 MB)
mount-server_2.4.2_linux_amd64.tar.gz(31.93 MB)
mount-server_2.4.2_linux_arm64.tar.gz(29.69 MB)
pachctl_2.4.2_amd64.deb(34.72 MB)
pachctl_2.4.2_arm64.deb(32.29 MB)
pachctl_2.4.2_darwin_amd64.zip(34.46 MB)
pachctl_2.4.2_darwin_arm64.zip(33.16 MB)
pachctl_2.4.2_linux_amd64.tar.gz(34.66 MB)
pachctl_2.4.2_linux_arm64.tar.gz(32.23 MB)
v2.5.0-nightly.20221201(Dec 1, 2022)
What's Changed

[PFS-10] Fix Misleading Version Check Logic by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8374

envoy: switch to JSON; log listener events by @jrockway in https://github.com/pachyderm/pachyderm/pull/8398

Delete CODEOWNERS by @brendoncarroll in https://github.com/pachyderm/pachyderm/pull/8401

circle job renames by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8400

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221130...v2.5.0-nightly.20221201
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221201_amd64.deb(32.08 MB)
mount-server_2.5.0-nightly.20221201_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221201_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221201_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221201_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221201_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221201_amd64.deb(34.82 MB)
pachctl_2.5.0-nightly.20221201_arm64.deb(32.38 MB)
pachctl_2.5.0-nightly.20221201_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221201_darwin_arm64.zip(33.27 MB)
pachctl_2.5.0-nightly.20221201_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221201_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221130(Nov 30, 2022)
What's Changed

Adjust envoy default stream window to increase upload throughput by @jrockway in https://github.com/pachyderm/pachyderm/pull/8397

Deal with subprocesses cleanly in the worker by @jrockway in https://github.com/pachyderm/pachyderm/pull/8385

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221129...v2.5.0-nightly.20221130
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221130_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221130_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221130_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221130_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221130_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221130_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221130_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221130_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221130_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221130_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221130_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221130_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221129(Nov 29, 2022)
What's Changed

oidc: avoid panicking on invalid input by @jrockway in https://github.com/pachyderm/pachyderm/pull/8387

Enable a bunch of linters by @jrockway in https://github.com/pachyderm/pachyderm/pull/8391

[CORE-1216] More accurate stop pipeline message by @zmajeed in https://github.com/pachyderm/pachyderm/pull/8389

bump console version for 2.5.0 alpha 1 by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8392

New Contributors

@zmajeed made their first contribution in https://github.com/pachyderm/pachyderm/pull/8389

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221123...v2.5.0-nightly.20221129
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221129_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221129_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221129_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221129_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221129_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221129_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221129_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221129_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221129_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221129_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221129_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221129_linux_arm64.tar.gz(32.33 MB)
v2.5.0-alpha.1(Dec 1, 2022)
What's Changed

oidc: avoid panicking on invalid input by @jrockway in https://github.com/pachyderm/pachyderm/pull/8387

Enable a bunch of linters by @jrockway in https://github.com/pachyderm/pachyderm/pull/8391

[CORE-1216] More accurate stop pipeline message by @zmajeed in https://github.com/pachyderm/pachyderm/pull/8389

bump console version for 2.5.0 alpha 1 by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8392

New Contributors

@zmajeed made their first contribution in https://github.com/pachyderm/pachyderm/pull/8389

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221123...v2.5.0-alpha.1
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-alpha.1_amd64.deb(32.09 MB)
mount-server_2.5.0-alpha.1_arm64.deb(29.84 MB)
mount-server_2.5.0-alpha.1_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-alpha.1_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-alpha.1_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-alpha.1_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-alpha.1_amd64.deb(34.83 MB)
pachctl_2.5.0-alpha.1_arm64.deb(32.39 MB)
pachctl_2.5.0-alpha.1_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-alpha.1_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-alpha.1_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-alpha.1_linux_arm64.tar.gz(32.33 MB)
v2.4.1(Nov 28, 2022)
What's Changed

[2.4.x backport][Mount Server] Mount latest non-alias commit on branch by @smalyala in https://github.com/pachyderm/pachyderm/pull/8366

Core 1123 2.4.x Backport by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8384

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.4.0...v2.4.1
Source code(tar.gz)
Source code(zip)
mount-server_2.4.1_amd64.deb(31.98 MB)
mount-server_2.4.1_arm64.deb(29.73 MB)
mount-server_2.4.1_darwin_amd64.zip(31.77 MB)
mount-server_2.4.1_darwin_arm64.zip(30.55 MB)
mount-server_2.4.1_linux_amd64.tar.gz(31.92 MB)
mount-server_2.4.1_linux_arm64.tar.gz(29.69 MB)
pachctl_2.4.1_amd64.deb(34.70 MB)
pachctl_2.4.1_arm64.deb(32.30 MB)
pachctl_2.4.1_darwin_amd64.zip(34.45 MB)
pachctl_2.4.1_darwin_arm64.zip(33.16 MB)
pachctl_2.4.1_linux_amd64.tar.gz(34.64 MB)
pachctl_2.4.1_linux_arm64.tar.gz(32.24 MB)
v2.5.0-nightly.20221125(Nov 25, 2022)
What's Changed

oidc: avoid panicking on invalid input by @jrockway in https://github.com/pachyderm/pachyderm/pull/8387

Enable a bunch of linters by @jrockway in https://github.com/pachyderm/pachyderm/pull/8391

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221123...v2.5.0-nightly.20221125
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221125_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221125_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221125_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221125_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221125_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221125_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221125_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221125_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221125_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221125_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221125_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221125_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221124(Nov 24, 2022)
What's Changed

oidc: avoid panicking on invalid input by @jrockway in https://github.com/pachyderm/pachyderm/pull/8387

Enable a bunch of linters by @jrockway in https://github.com/pachyderm/pachyderm/pull/8391

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221123...v2.5.0-nightly.20221124
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221124_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221124_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221124_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221124_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221124_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221124_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221124_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221124_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221124_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221124_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221124_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221124_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221123(Nov 23, 2022)
What's Changed

[CORE-1291] Default project in transactional repo creation by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8380

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221121...v2.5.0-nightly.20221123
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221123_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221123_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221123_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221123_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221123_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221123_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221123_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221123_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221123_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221123_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221123_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221123_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221122(Nov 22, 2022)
What's Changed

[CORE-1291] Default project in transactional repo creation by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8380

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221121...v2.5.0-nightly.20221122
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221122_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221122_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221122_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221122_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221122_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221122_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221122_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221122_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221122_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221122_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221122_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221122_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221121(Nov 21, 2022)
What's Changed

[CORE-1072] Plumb contextual project through pachctl commands by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8348

remove docs folder by @lbliii in https://github.com/pachyderm/pachyderm/pull/8377

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221118...v2.5.0-nightly.20221121
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221121_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221121_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221121_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221121_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221121_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221121_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221121_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221121_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221121_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221121_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221121_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221121_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221118(Nov 18, 2022)
What's Changed

Make ModifyRoleBinding project aware by @albscui in https://github.com/pachyderm/pachyderm/pull/8354

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221117...v2.5.0-nightly.20221118
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221118_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221118_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221118_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221118_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221118_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221118_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221118_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221118_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221118_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221118_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221118_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221118_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221117(Nov 17, 2022)
What's Changed

Changes compactionShard settings from int to str by @tybritten in https://github.com/pachyderm/pachyderm/pull/8368

Propagate shard config to pachd sidecar by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8369

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221115...v2.5.0-nightly.20221117
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221117_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221117_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221117_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221117_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221117_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221117_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221117_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221117_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221117_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221117_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221117_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221117_linux_arm64.tar.gz(32.33 MB)
v2.4.0(Nov 17, 2022)
What's Changed

Core Worker & pachd Improvements

Enable Memory-Backed Pachyderm Worker Volumes - #8262

Fix worker keepalive timeout - #8246

Disable S3 server when running in paused mode - #8168

regenerate protos as they've gone out of sync - #8166

Factor out S3 server - #8019

Handle signals uniformly - #8033

Use service environment interface - #8027

Plumb a context through cmdutil.Main - #8013

Start sidecar PFS server in sidecar mode - #8124

Logging

loki timeout and line num change - #8329

Remove error wrap checking from some internal interfaces. - #7447

Update inspectPipeline error message for non-existent pipelines - #8101

add additional buildinfo to version - #8034

Use errgroups to manage goroutines in pachd - #8020

Factor out Prometheus server - #8018

grpc: log request ID - #8222

pachctl Improvements

Draw pach DAGs on the command line - #7304

List pipelines at a commit set - #8221

Fix issue with pretty printing JobInfo.Started - #8135

pachctl: add "buildinfo" command - #8031

Pagabale API

add startedTime marker for ListCommit - #8174

ListFile pagination - #8335

ListDatum pagination- #8336

Performance

Propagate shard config to pachd sidecar - #8369

changes compactionShard settings to str - #8368

Fix input reordering with cross inputs - #8357

metadata and data sharding configurations - #8363

Fix file set renewal in new compaction algorithm - #8227

Cache Symlink Commit Uploads - #8172

Implement pachctl put file untar - #8167

Implement PPS datum sharding - #7851

Support Locking Repos Across Multiple PFS Masters Using Consistent Hashing Library - #8045

Implement chunk prefetcher - #8200

Proxy

Fix file set renewal in new compaction algorithm - #8227

Security

Fix list datum input with auth - #8359

Cache Symlink Commit Uploads - #8172

Implement pachctl put file untar - #8167

Implement PPS datum sharding - #7851

Support Locking Repos Across Multiple PFS Masters Using Consistent Hashing Library - #8045

Snowflake Integration

Implement chunk prefetcher - #8200

New Contributors

@Juneezee made their first contribution in https://github.com/pachyderm/pachyderm/pull/7660

@harrisonfang made their first contribution in https://github.com/pachyderm/pachyderm/pull/8058

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.1.0...v2.4.0
Source code(tar.gz)
Source code(zip)
mount-server_2.4.0_amd64.deb(31.98 MB)
mount-server_2.4.0_arm64.deb(29.74 MB)
mount-server_2.4.0_darwin_amd64.zip(31.77 MB)
mount-server_2.4.0_darwin_arm64.zip(30.55 MB)
mount-server_2.4.0_linux_amd64.tar.gz(31.92 MB)
mount-server_2.4.0_linux_arm64.tar.gz(29.69 MB)
pachctl_2.4.0_amd64.deb(34.70 MB)
pachctl_2.4.0_arm64.deb(32.30 MB)
pachctl_2.4.0_darwin_amd64.zip(34.45 MB)
pachctl_2.4.0_darwin_arm64.zip(33.16 MB)
pachctl_2.4.0_linux_amd64.tar.gz(34.64 MB)
pachctl_2.4.0_linux_arm64.tar.gz(32.24 MB)
v2.5.0-nightly.20221116(Nov 16, 2022)
What's Changed

Changes compactionShard settings from int to str by @tybritten in https://github.com/pachyderm/pachyderm/pull/8368

Propagate shard config to pachd sidecar by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8369

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221115...v2.5.0-nightly.20221116
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221116_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221116_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221116_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221116_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221116_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221116_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221116_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221116_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221116_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221116_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221116_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221116_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221115(Nov 15, 2022)
What's Changed

[CORE-1116] S3 gateway and projects by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8365

[Mount Server] Mount latest non-alias commit on branch by @smalyala in https://github.com/pachyderm/pachyderm/pull/8344

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221114...v2.5.0-nightly.20221115
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221115_amd64.deb(32.09 MB)
mount-server_2.5.0-nightly.20221115_arm64.deb(29.84 MB)
mount-server_2.5.0-nightly.20221115_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221115_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221115_linux_amd64.tar.gz(32.03 MB)
mount-server_2.5.0-nightly.20221115_linux_arm64.tar.gz(29.79 MB)
pachctl_2.5.0-nightly.20221115_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221115_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221115_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221115_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221115_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221115_linux_arm64.tar.gz(32.33 MB)
v2.4.0-rc.3(Nov 15, 2022)
What's Changed

Propagate shard config to pachd sidecar (2.4.x) by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8371

[2.4.x Backport] Changes compactionShard settings to str (#8368) by @tybritten in https://github.com/pachyderm/pachyderm/pull/8372

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.4.0-rc.2...v2.4.0-rc.3
Source code(tar.gz)
Source code(zip)
mount-server_2.4.0-rc.3_amd64.deb(31.98 MB)
mount-server_2.4.0-rc.3_arm64.deb(29.74 MB)
mount-server_2.4.0-rc.3_darwin_amd64.zip(31.77 MB)
mount-server_2.4.0-rc.3_darwin_arm64.zip(30.55 MB)
mount-server_2.4.0-rc.3_linux_amd64.tar.gz(31.92 MB)
mount-server_2.4.0-rc.3_linux_arm64.tar.gz(29.69 MB)
pachctl_2.4.0-rc.3_amd64.deb(34.70 MB)
pachctl_2.4.0-rc.3_arm64.deb(32.30 MB)
pachctl_2.4.0-rc.3_darwin_amd64.zip(34.45 MB)
pachctl_2.4.0-rc.3_darwin_arm64.zip(33.16 MB)
pachctl_2.4.0-rc.3_linux_amd64.tar.gz(34.64 MB)
pachctl_2.4.0-rc.3_linux_arm64.tar.gz(32.24 MB)
v2.5.0-nightly.20221114(Nov 14, 2022)
What's Changed

support tgz extension for pachctl put file untar by @tybritten in https://github.com/pachyderm/pachyderm/pull/8361

[PFS-17] Merge metadata and data sharding configurations by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8362

[CORE-1231] pass pachctl args to server for audit logging by @armaanv in https://github.com/pachyderm/pachyderm/pull/8356

bump envoy proxy stream idle timeout to 24hrs by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8364

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221111...v2.5.0-nightly.20221114
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221114_amd64.deb(32.08 MB)
mount-server_2.5.0-nightly.20221114_arm64.deb(29.83 MB)
mount-server_2.5.0-nightly.20221114_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221114_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221114_linux_amd64.tar.gz(32.02 MB)
mount-server_2.5.0-nightly.20221114_linux_arm64.tar.gz(29.78 MB)
pachctl_2.5.0-nightly.20221114_amd64.deb(34.83 MB)
pachctl_2.5.0-nightly.20221114_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221114_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221114_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221114_linux_amd64.tar.gz(34.76 MB)
pachctl_2.5.0-nightly.20221114_linux_arm64.tar.gz(32.33 MB)
v2.5.0-nightly.20221111(Nov 11, 2022)
What's Changed

[CORE-1225] [CORE-1158] Fix input reordering with cross inputs by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8352

[CORE-1206] Fix list datum input with auth by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8340

bump pgbouncer version helm values by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8358

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221110...v2.5.0-nightly.20221111
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221111_amd64.deb(32.08 MB)
mount-server_2.5.0-nightly.20221111_arm64.deb(29.83 MB)
mount-server_2.5.0-nightly.20221111_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221111_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221111_linux_amd64.tar.gz(32.02 MB)
mount-server_2.5.0-nightly.20221111_linux_arm64.tar.gz(29.78 MB)
pachctl_2.5.0-nightly.20221111_amd64.deb(34.84 MB)
pachctl_2.5.0-nightly.20221111_arm64.deb(32.40 MB)
pachctl_2.5.0-nightly.20221111_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221111_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221111_linux_amd64.tar.gz(34.77 MB)
pachctl_2.5.0-nightly.20221111_linux_arm64.tar.gz(32.34 MB)
v2.4.0-rc.2(Nov 11, 2022)
What's Changed

[2.4.x port][CORE-1198] Add a timeout to loki requests from debug dump (#8329) by @armaanv in https://github.com/pachyderm/pachyderm/pull/8332

[2.4.x port][CORE-1097] ListDatum pagination by @armaanv in https://github.com/pachyderm/pachyderm/pull/8336

[2.4.x port][CORE-1096] ListFile pagination by @armaanv in https://github.com/pachyderm/pachyderm/pull/8335

Fix input reordering with cross inputs (2.4.x) by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8357

Fix list datum input with auth (2.4.x) by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8359

bump pgbouncer version helm values (#8358) by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8360

Merge metadata and data sharding configurations (2.4.x) by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8363

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.4.0-rc.1...v2.4.0-rc.2
Source code(tar.gz)
Source code(zip)
mount-server_2.4.0-rc.2_amd64.deb(31.98 MB)
mount-server_2.4.0-rc.2_arm64.deb(29.74 MB)
mount-server_2.4.0-rc.2_darwin_amd64.zip(31.77 MB)
mount-server_2.4.0-rc.2_darwin_arm64.zip(30.55 MB)
mount-server_2.4.0-rc.2_linux_amd64.tar.gz(31.92 MB)
mount-server_2.4.0-rc.2_linux_arm64.tar.gz(29.69 MB)
pachctl_2.4.0-rc.2_amd64.deb(34.70 MB)
pachctl_2.4.0-rc.2_arm64.deb(32.30 MB)
pachctl_2.4.0-rc.2_darwin_amd64.zip(34.45 MB)
pachctl_2.4.0-rc.2_darwin_arm64.zip(33.16 MB)
pachctl_2.4.0-rc.2_linux_amd64.tar.gz(34.64 MB)
pachctl_2.4.0-rc.2_linux_arm64.tar.gz(32.24 MB)
v2.5.0-nightly.20221110(Nov 10, 2022)
What's Changed

[Mount Server] Prevent mounting non-existent branch in a read-only mount by @smalyala in https://github.com/pachyderm/pachyderm/pull/8341

Tweak default shard threshold by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8230

Revert "Test tweaking default shard threshold (#8230)" by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8355

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.5.0-nightly.20221109...v2.5.0-nightly.20221110
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221110_amd64.deb(32.08 MB)
mount-server_2.5.0-nightly.20221110_arm64.deb(29.83 MB)
mount-server_2.5.0-nightly.20221110_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221110_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221110_linux_amd64.tar.gz(32.02 MB)
mount-server_2.5.0-nightly.20221110_linux_arm64.tar.gz(29.78 MB)
pachctl_2.5.0-nightly.20221110_amd64.deb(34.84 MB)
pachctl_2.5.0-nightly.20221110_arm64.deb(32.40 MB)
pachctl_2.5.0-nightly.20221110_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221110_darwin_arm64.zip(33.28 MB)
pachctl_2.5.0-nightly.20221110_linux_amd64.tar.gz(34.77 MB)
pachctl_2.5.0-nightly.20221110_linux_arm64.tar.gz(32.34 MB)
v2.5.0-nightly.20221109(Nov 9, 2022)
What's Changed

Make ListJob project-aware by @albscui in https://github.com/pachyderm/pachyderm/pull/8211

[CORE-1066] Update FUSE mounts for projects by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8311

testutils and password tests by @djanicekpach in https://github.com/pachyderm/pachyderm/pull/8349

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.4.0-nightly.20221108...v2.5.0-nightly.20221109
Source code(tar.gz)
Source code(zip)
mount-server_2.5.0-nightly.20221109_amd64.deb(32.08 MB)
mount-server_2.5.0-nightly.20221109_arm64.deb(29.83 MB)
mount-server_2.5.0-nightly.20221109_darwin_amd64.zip(31.87 MB)
mount-server_2.5.0-nightly.20221109_darwin_arm64.zip(30.65 MB)
mount-server_2.5.0-nightly.20221109_linux_amd64.tar.gz(32.02 MB)
mount-server_2.5.0-nightly.20221109_linux_arm64.tar.gz(29.78 MB)
pachctl_2.5.0-nightly.20221109_amd64.deb(34.84 MB)
pachctl_2.5.0-nightly.20221109_arm64.deb(32.39 MB)
pachctl_2.5.0-nightly.20221109_darwin_amd64.zip(34.58 MB)
pachctl_2.5.0-nightly.20221109_darwin_arm64.zip(33.27 MB)
pachctl_2.5.0-nightly.20221109_linux_amd64.tar.gz(34.77 MB)
pachctl_2.5.0-nightly.20221109_linux_arm64.tar.gz(32.33 MB)
v2.4.0-nightly.20221108(Nov 8, 2022)
What's Changed

[CORE-1123] Support External Loki Instance with Pachyderm by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8277

Use gotestsum for PPS tests and separate into their own run by @chainlink in https://github.com/pachyderm/pachyderm/pull/8269

[CORE-872] Enable Memory-Backed Pachyderm Worker Volumes by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8262

[Doc] [skip-ci] correct a minor typo in Delete Files from History by @nikolay-pavlov-snkeos in https://github.com/pachyderm/pachyderm/pull/8282

proxy: set the default stream idle timeout to 1h instead of 10m by @jrockway in https://github.com/pachyderm/pachyderm/pull/8284

Migration load testing by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8292

[CORE-1152] Separate core & Jupyter extension jobs in CircleCI config by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8291

[CORE-1136] Adjust pachctl list jobs -x output by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8286

pgbouncer image release workflow by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8243

fix Max number of workflows exceeded. by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8296

circleci - fix builder with tls env create error by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8297

fix buildx build requires exactly 1 argument error by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8298

circle ci - pgbouncer buildx fix by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8300

Update Pg-bouncer to support non md5 auth by @tybritten in https://github.com/pachyderm/pachyderm/pull/8301

Fix Spout and Service pipeline restart issue by @albscui in https://github.com/pachyderm/pachyderm/pull/8308

add btl mock work load to alpha pre-release tests by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8259

[CORE-93/CORE-172] Migrate default project name from "" to "default" by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8245

[Mount Server] Fix delete mounted repo error by @smalyala in https://github.com/pachyderm/pachyderm/pull/8255

Improve random data generation speed by @jrockway in https://github.com/pachyderm/pachyderm/pull/8309

CODEOWNERS Init by @brendoncarroll in https://github.com/pachyderm/pachyderm/pull/8289

[Mount Server] Fix commits behind calculation by @smalyala in https://github.com/pachyderm/pachyderm/pull/8307

'pachctl draw' -> 'pachctl draw pipeline' by @acohen4 in https://github.com/pachyderm/pachyderm/pull/8314

go.mod: upgrade automaxprocs to v1.5.1; for compatibility with newer linuxes by @jrockway in https://github.com/pachyderm/pachyderm/pull/8316

Add 'make docker-build-amd' to speed up local development by @acohen4 in https://github.com/pachyderm/pachyderm/pull/8320

unused code part II by @seslattery in https://github.com/pachyderm/pachyderm/pull/8317

[Jupyter] Save state when switching between repo mode and datum mode by @smalyala in https://github.com/pachyderm/pachyderm/pull/8280

Add Pgsql client to pg bouncer image by @tybritten in https://github.com/pachyderm/pachyderm/pull/8321

[CORE-549]remove duplicae pps env variables by @djanicekpach in https://github.com/pachyderm/pachyderm/pull/8315

set pachd's $GOMEMLIMIT from k8s requests or limits by @jrockway in https://github.com/pachyderm/pachyderm/pull/8281

arm64 deploy tests by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8160

prep 2.5.x alphas by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8327

[CORE-1198] loki timeout and line num change by @armaanv in https://github.com/pachyderm/pachyderm/pull/8329

add fail body to curl helium workspace create by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8333

[CORE-1096] ListFile pagination by @armaanv in https://github.com/pachyderm/pachyderm/pull/8295

[CORE-1097] ListDatum pagination by @armaanv in https://github.com/pachyderm/pachyderm/pull/8323

use unique repo name to deflake proxy tests by @armaanv in https://github.com/pachyderm/pachyderm/pull/8334

automate etcd image mirror release by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8212

temporarily revert alpha console version by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8339

Extend auth with project primitives, and implement CreateProject by @albscui in https://github.com/pachyderm/pachyderm/pull/8235

add build flag to linter by @armaanv in https://github.com/pachyderm/pachyderm/pull/8342

[CORE-1066] Refactor PachctlBashCommand by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8330

re-add pool size for pgbouncer image by @tybritten in https://github.com/pachyderm/pachyderm/pull/8345

update pgbouncer image release by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8346

New Contributors

@nikolay-pavlov-snkeos made their first contribution in https://github.com/pachyderm/pachyderm/pull/8282

@djanicekpach made their first contribution in https://github.com/pachyderm/pachyderm/pull/8315

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.4.0-alpha.4...v2.4.0-nightly.20221108
Source code(tar.gz)
Source code(zip)
mount-server_2.4.0-nightly.20221108_amd64.deb(32.08 MB)
mount-server_2.4.0-nightly.20221108_arm64.deb(29.84 MB)
mount-server_2.4.0-nightly.20221108_darwin_amd64.zip(31.87 MB)
mount-server_2.4.0-nightly.20221108_darwin_arm64.zip(30.65 MB)
mount-server_2.4.0-nightly.20221108_linux_amd64.tar.gz(32.02 MB)
mount-server_2.4.0-nightly.20221108_linux_arm64.tar.gz(29.78 MB)
pachctl_2.4.0-nightly.20221108_amd64.deb(34.82 MB)
pachctl_2.4.0-nightly.20221108_arm64.deb(32.39 MB)
pachctl_2.4.0-nightly.20221108_darwin_amd64.zip(34.57 MB)
pachctl_2.4.0-nightly.20221108_darwin_arm64.zip(33.27 MB)
pachctl_2.4.0-nightly.20221108_linux_amd64.tar.gz(34.75 MB)
pachctl_2.4.0-nightly.20221108_linux_arm64.tar.gz(32.33 MB)
v2.3.9(Nov 7, 2022)
What's Changed

Migration load testing (2.3.x) by @brycemcanally in https://github.com/pachyderm/pachyderm/pull/8343

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.3.8...v2.3.9
Source code(tar.gz)
Source code(zip)
mount-server_2.3.9_amd64.deb(31.85 MB)
mount-server_2.3.9_arm64.deb(29.62 MB)
mount-server_2.3.9_darwin_amd64.zip(31.65 MB)
mount-server_2.3.9_darwin_arm64.zip(30.42 MB)
mount-server_2.3.9_linux_amd64.tar.gz(31.79 MB)
mount-server_2.3.9_linux_arm64.tar.gz(29.57 MB)
pachctl_2.3.9_amd64.deb(35.00 MB)
pachctl_2.3.9_arm64.deb(32.56 MB)
pachctl_2.3.9_darwin_amd64.zip(34.75 MB)
pachctl_2.3.9_darwin_arm64.zip(33.42 MB)
pachctl_2.3.9_linux_amd64.tar.gz(34.93 MB)
pachctl_2.3.9_linux_arm64.tar.gz(32.51 MB)
v2.3.8(Oct 31, 2022)
What's Changed

Backport [CORE-1198] Add a timeout to loki requests from debug dump (#8329) by @armaanv in https://github.com/pachyderm/pachyderm/pull/8331

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.3.7...v2.3.8
Source code(tar.gz)
Source code(zip)
mount-server_2.3.8_amd64.deb(31.85 MB)
mount-server_2.3.8_arm64.deb(29.60 MB)
mount-server_2.3.8_darwin_amd64.zip(31.63 MB)
mount-server_2.3.8_darwin_arm64.zip(30.42 MB)
mount-server_2.3.8_linux_amd64.tar.gz(31.79 MB)
mount-server_2.3.8_linux_arm64.tar.gz(29.55 MB)
pachctl_2.3.8_amd64.deb(34.97 MB)
pachctl_2.3.8_arm64.deb(32.54 MB)
pachctl_2.3.8_darwin_amd64.zip(34.73 MB)
pachctl_2.3.8_darwin_arm64.zip(33.40 MB)
pachctl_2.3.8_linux_amd64.tar.gz(34.91 MB)
pachctl_2.3.8_linux_arm64.tar.gz(32.49 MB)
v2.4.0-rc.1(Oct 27, 2022)
What's Changed

[CORE-946] Plumb a context through cmdutil.Main by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8013

Add POSTGRES_DATABASE to pgbouncer liveness cmd by @BOsterbuhr in https://github.com/pachyderm/pachyderm/pull/8015

Consolidate versions by @seslattery in https://github.com/pachyderm/pachyderm/pull/8022

Docs 2.2.7 by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8023

Use prebuilt image in CI by @chainlink in https://github.com/pachyderm/pachyderm/pull/8014

[CORE-952] Factor out Prometheus server by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8018

Doc 477/lbliii/list all permissions by @lbliii in https://github.com/pachyderm/pachyderm/pull/8028

Fix liveness command by @tybritten in https://github.com/pachyderm/pachyderm/pull/8029

Build Arm64 containers by @jrockway in https://github.com/pachyderm/pachyderm/pull/8026

[2.3.0 Backport] PG Bouncer Liveness Probe fix (#8029) by @tybritten in https://github.com/pachyderm/pachyderm/pull/8030

[CORE-951] Factor out S3 server by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8019

pachctl: add "buildinfo" command by @jrockway in https://github.com/pachyderm/pachyderm/pull/8031

[CORE-950] Use service environment interface by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8027

[CORE-947] Use errgroups to manage goroutines in pachd by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8020

[DOC-488] Preparation of 2.3.0-rc by @npepin-hub in https://github.com/pachyderm/pachyderm/pull/8024

add additional buildinfo to version by @seslattery in https://github.com/pachyderm/pachyderm/pull/8034

release: integrate manifests into the release process by @jrockway in https://github.com/pachyderm/pachyderm/pull/8038

goreleaser: set the platform for the docker image description by @jrockway in https://github.com/pachyderm/pachyderm/pull/8041

[DOC-490] 2.3.0-rc.2 by @npepin-hub in https://github.com/pachyderm/pachyderm/pull/8048

Raise kubeEventTail mem limit default to 100Mi by @tybritten in https://github.com/pachyderm/pachyderm/pull/8044

update to go 1.19 by @seslattery in https://github.com/pachyderm/pachyderm/pull/8035

[DOC] parameterization of links to examples by @npepin-hub in https://github.com/pachyderm/pachyderm/pull/8053

[DOC] Update branch value in gitsnippet - 2.3.x by @npepin-hub in https://github.com/pachyderm/pachyderm/pull/8056

[DOC-468] Added mention of ARM support and related links by @npepin-hub in https://github.com/pachyderm/pachyderm/pull/8047

[DOC-492] ingress.host by @npepin-hub in https://github.com/pachyderm/pachyderm/pull/8059

add 2.3.x to list for load testing by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8062

[CORE-948] Handle signals uniformly by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8033

Fix service pipelines not picking up new jobs. by @PFedak in https://github.com/pachyderm/pachyderm/pull/8050

Remove duplicate text from join.md by @harrisonfang in https://github.com/pachyderm/pachyderm/pull/8058

Lbliii/doc 502/housing prices example update by @lbliii in https://github.com/pachyderm/pachyderm/pull/8071

fix links and commands by @lbliii in https://github.com/pachyderm/pachyderm/pull/8064

Adds Proxy.host by @tybritten in https://github.com/pachyderm/pachyderm/pull/8065

use localhostIssuer=true even when network routing allows it to be false by @jrockway in https://github.com/pachyderm/pachyderm/pull/8069

[CORE-850] Ignore no-op updates during branch propagation. by @PFedak in https://github.com/pachyderm/pachyderm/pull/7955

lbliii/doc-495/transactions-example-update by @lbliii in https://github.com/pachyderm/pachyderm/pull/8072

helm: make upgrades idempotent with respect to the Values dictionary by @jrockway in https://github.com/pachyderm/pachyderm/pull/8080

[CORE-825] Refactor pachd by @robert-uhl in https://github.com/pachyderm/pachyderm/pull/8039

bump doc versions by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8084

Enable IAM Login for Cloud SQL Auth Proxy by @BOsterbuhr in https://github.com/pachyderm/pachyderm/pull/8085

updated Read.me to reflect new Marketing branding of Pachyderm by @bhavanirao in https://github.com/pachyderm/pachyderm/pull/8082

[FREQQ-143] Remove Loki storageClassName default by @BOsterbuhr in https://github.com/pachyderm/pachyderm/pull/8088

fix console deployment manifest by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8094

2.3.0 release prep by @lbliii in https://github.com/pachyderm/pachyderm/pull/8060

[CORE-563] Update deleteExpiredTokens to Run on Startup by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8090

Highlight that JupyterLab-Pachyderm is experimental by @msteffen in https://github.com/pachyderm/pachyderm/pull/8103

update helm artifacthub image path by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8104

[CORE-842] Remove logging configuration options by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8102

Add missing parentheses to JupyterLab-Pachyderm docs page by @msteffen in https://github.com/pachyderm/pachyderm/pull/8105

cronspec details by @lbliii in https://github.com/pachyderm/pachyderm/pull/8083

deferred processing example updates by @lbliii in https://github.com/pachyderm/pachyderm/pull/8079

restructure + add auth notice by @lbliii in https://github.com/pachyderm/pachyderm/pull/8109

[CORE-448] Spread alias commits from output branches to meta branches. by @PFedak in https://github.com/pachyderm/pachyderm/pull/8095

Reintroduce UPGRADE_NO_OP in deployment.yaml by @acohen4 in https://github.com/pachyderm/pachyderm/pull/8113

[CORE-612] Support Locking Repos Across Multiple PFS Masters Using Consistent Hashing Library by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8045

Remove error wrap checking from some internal interfaces. by @PFedak in https://github.com/pachyderm/pachyderm/pull/7447

fix circle ci nightly load test jobs by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8117

aws wp release workload pre-release test env by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8061

bump console version by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8324

bump cache by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8325

New Contributors

@harrisonfang made their first contribution in https://github.com/pachyderm/pachyderm/pull/8058

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.3.0-alpha.6...v2.4.0-rc.1
Source code(tar.gz)
Source code(zip)
mount-server_2.4.0-rc.1_amd64.deb(31.98 MB)
mount-server_2.4.0-rc.1_arm64.deb(29.73 MB)
mount-server_2.4.0-rc.1_darwin_amd64.zip(31.77 MB)
mount-server_2.4.0-rc.1_darwin_arm64.zip(30.55 MB)
mount-server_2.4.0-rc.1_linux_amd64.tar.gz(31.92 MB)
mount-server_2.4.0-rc.1_linux_arm64.tar.gz(29.68 MB)
pachctl_2.4.0-rc.1_amd64.deb(34.70 MB)
pachctl_2.4.0-rc.1_arm64.deb(32.27 MB)
pachctl_2.4.0-rc.1_darwin_amd64.zip(34.45 MB)
pachctl_2.4.0-rc.1_darwin_arm64.zip(33.16 MB)
pachctl_2.4.0-rc.1_linux_amd64.tar.gz(34.64 MB)
pachctl_2.4.0-rc.1_linux_arm64.tar.gz(32.22 MB)
v2.3.7(Oct 27, 2022)
What's Changed

Backport CORE-1123 to Pachyderm 2.3 by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8283

Backport Fix for Spout pipelines not able to restart by @albscui in https://github.com/pachyderm/pachyderm/pull/8310

bump console version 2.3.7 by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8326

try temp removing python cache by @molinamelendezj in https://github.com/pachyderm/pachyderm/pull/8328

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.3.6...v2.3.7
Source code(tar.gz)
Source code(zip)
mount-server_2.3.7_amd64.deb(31.85 MB)
mount-server_2.3.7_arm64.deb(29.60 MB)
mount-server_2.3.7_darwin_amd64.zip(31.63 MB)
mount-server_2.3.7_darwin_arm64.zip(30.42 MB)
mount-server_2.3.7_linux_amd64.tar.gz(31.79 MB)
mount-server_2.3.7_linux_arm64.tar.gz(29.55 MB)
pachctl_2.3.7_amd64.deb(34.97 MB)
pachctl_2.3.7_arm64.deb(32.54 MB)
pachctl_2.3.7_darwin_amd64.zip(34.73 MB)
pachctl_2.3.7_darwin_arm64.zip(33.40 MB)
pachctl_2.3.7_linux_amd64.tar.gz(34.91 MB)
pachctl_2.3.7_linux_arm64.tar.gz(32.49 MB)
v2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7(Oct 17, 2022)
What's Changed

Backport CORE-1123 to Pachyderm 2.3 by @FahadBSyed in https://github.com/pachyderm/pachyderm/pull/8283

Full Changelog: https://github.com/pachyderm/pachyderm/compare/v2.3.6...v2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7
Source code(tar.gz)
Source code(zip)
mount-server_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_amd64.deb(31.85 MB)
mount-server_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_arm64.deb(29.60 MB)
mount-server_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_darwin_amd64.zip(31.63 MB)
mount-server_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_darwin_arm64.zip(30.42 MB)
mount-server_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_linux_amd64.tar.gz(31.79 MB)
mount-server_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_linux_arm64.tar.gz(29.55 MB)
pachctl_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_amd64.deb(34.97 MB)
pachctl_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_arm64.deb(32.54 MB)
pachctl_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_darwin_amd64.zip(34.73 MB)
pachctl_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_darwin_arm64.zip(33.40 MB)
pachctl_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_linux_amd64.tar.gz(34.91 MB)
pachctl_2.3.6-278239f8de6f1050b0a84dad565e61ae30e703a7_linux_arm64.tar.gz(32.49 MB)

Reproducible Data Science at Scale!

Related tags

Overview

Pachyderm: The Data Foundation for Machine Learning

Features

Getting Started

Documentation

Community

Contributing

Join Us

Usage Metrics

License Information

Comments

Contexts

Active context

Metrics

Config implementation changes

Migrations

Deployments

New commands

Removals

Releases(v2.5.0-alpha.2)

v2.5.0-alpha.2(Dec 14, 2022)

What's Changed

v2.5.0-nightly.20221208(Dec 8, 2022)

What's Changed

v2.4.2(Dec 8, 2022)

What's Changed

v2.5.0-nightly.20221201(Dec 1, 2022)

What's Changed

v2.5.0-nightly.20221130(Nov 30, 2022)

What's Changed

v2.5.0-nightly.20221129(Nov 29, 2022)

What's Changed

New Contributors

v2.5.0-alpha.1(Dec 1, 2022)

What's Changed

New Contributors

v2.4.1(Nov 28, 2022)

What's Changed

v2.5.0-nightly.20221125(Nov 25, 2022)

What's Changed

v2.5.0-nightly.20221124(Nov 24, 2022)

What's Changed

v2.5.0-nightly.20221123(Nov 23, 2022)

What's Changed

v2.5.0-nightly.20221122(Nov 22, 2022)

What's Changed

v2.5.0-nightly.20221121(Nov 21, 2022)

What's Changed

v2.5.0-nightly.20221118(Nov 18, 2022)

What's Changed

v2.5.0-nightly.20221117(Nov 17, 2022)

What's Changed

v2.4.0(Nov 17, 2022)

What's Changed

Core Worker & pachd Improvements

Logging

pachctl Improvements

Pagabale API

Performance

Proxy

Security

Snowflake Integration

New Contributors

v2.5.0-nightly.20221116(Nov 16, 2022)

What's Changed

v2.5.0-nightly.20221115(Nov 15, 2022)

What's Changed

v2.4.0-rc.3(Nov 15, 2022)

What's Changed

v2.5.0-nightly.20221114(Nov 14, 2022)

What's Changed

v2.5.0-nightly.20221111(Nov 11, 2022)

What's Changed

v2.4.0-rc.2(Nov 11, 2022)

What's Changed

v2.5.0-nightly.20221110(Nov 10, 2022)

What's Changed

v2.5.0-nightly.20221109(Nov 9, 2022)

Core Worker & `pachd` Improvements

`pachctl` Improvements