refactor: inject url path parts instead of endpoints #315

tdstein · 2024-10-25T16:32:47Z

Refactors path building responsibilities to the creating action, eliminating a ton of complexity along the way.

github-actions · 2024-10-25T16:33:19Z

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines	Covered	Coverage	Threshold	Status
1415	1356	96%	0%	🟢

New Files

No new covered files...

Modified Files

File	Coverage	Status
src/posit/connect/content.py	99%	🟢
src/posit/connect/jobs.py	100%	🟢
src/posit/connect/resources.py	95%	🟢
src/posit/connect/vanities.py	94%	🟢
TOTAL	97%	🟢

updated for commit: b3ff1cd by action🐍

tdstein · 2024-10-25T16:34:05Z

@schloerke - I applied more of your feedback, which helped reduce the complexity of endpoint creation. I think this is in the right direction. You can ignore the docstrings for now. I'll mark the PR as ready for review once I have those fixed.

toph-allen

I think I need a bit more of an explanation about how this is working before I fully understand it.

src/posit/connect/jobs.py

src/posit/connect/vanities.py

src/posit/connect/jobs.py

src/posit/connect/resources.py

src/posit/connect/vanities.py

src/posit/connect/resources.py

schloerke

Drop @property
Discuss cache invalidation of self._cache

Co-authored-by: Barret Schloerke <[email protected]>

…endpoint-injection

…-injection

tdstein · 2024-10-30T18:21:18Z

@schloerke / @toph-allen—This is ready for final review. I believe all of the points we discussed have been addressed.

I am pretty happy with the implementation now. Thanks for all of your thoughts and attention to detail!

tdstein · 2024-10-30T18:26:11Z

src/posit/connect/resources.py

+        endpoint = self._ctx.url + self._path + uid
+        response = self._ctx.session.get(endpoint)
+        result = response.json()
+        return self._to_instance(result)


Note that I removed the call to invalidate the cache that existed a few commits before. After some additional consideration, I concluded that invalidating the cache is an unwanted side effect.

I think there is still an argument for invalidating the cache or appending the instance to the cached list. But, I don't think we have a good enough understanding of the side effects to proceed with either implementation.

Why not just call .find_by() all the time?

If the cache exists, ._data returns quickly. If not, it asks the server.

Then both methods have the same quirks. (Where as find() will not alter the cache, but find_by will... causing a followup call to find() to use the cached values, behaving differently.

I think in either situation, we end up with conflicting ideas.

If we always depend on find_by, we must fetch the entire collection before returning a single value, which would take significantly longer than a single GET request for the value.

Today, I think the obvious solution would be to always call the HTTP GET method to get the value from the server. This will sometimes be slightly slower than an in-memory list scan. But in reality, it's going to be a negligible difference. The weird edge case with this solution is when another process creates the value fetched by GET after the _cache is set. In this situation, the value returned by find will exist on the server but not in the _cache. This would probably warrant a cache invalidation. But that would take extra time to compute and may not be consistent behavior across all endpoints.

tl;dr - the speed up via find_by probably isn't worth the trouble. Classic over engineering.

src/posit/connect/resources.py

schloerke · 2024-10-30T20:22:54Z

src/posit/connect/resources.py

+        if self.cached():
            conditions = {self._uid: uid}
            result = self.find_by(**conditions)
-        else:
-            endpoint = self._endpoint + uid
-            response = self._ctx.session.get(endpoint)
-            result = response.json()
-            result = self._create_instance(**result)
-
-        if not result:
-            raise ValueError(f"Failed to find instance where {self._uid} is '{uid}'")
+            if result:
+                return result

-        return result
+        endpoint = self._ctx.url + self._path + uid
+        response = self._ctx.session.get(endpoint)
+        result = response.json()
+        return self._to_instance(result)


The whole function body could be something like...

conditions = {self._uid: uid} result = self.find_by(**conditions) if result is not None: return result raise ValueError(f"Object `\{ \"{self._uid}\": \"{ uid }\" \}` could not be found")

(untested)

toph-allen

looks good to me, pending resolving @schloerke’s comments :)

src/posit/connect/resources.py

* main: build: Embrace ruff (#319) refactor: inject url path parts instead of endpoints (#315)

tdstein and others added 10 commits October 18, 2024 09:07

feat: add jobs

0533f19

--wip-- [skip ci]

6b79912

refactor: introduce the active pattern

279fcd6

add link to parent

e349870

skip when Quarto unavailable

533839b

adds unit tests

1066ca3

adds docstrings

437c515

Update src/posit/connect/resources.py

a1ca377

applies feedback discussed in pull requests

82b9b7e

refactor: inject url path parts instead of endpoints

6b8126d

tdstein requested a review from schloerke October 25, 2024 16:32

tdstein added 2 commits October 28, 2024 11:28

update docstrings

b64f3e7

renames init arguments to path and pathinfo

f57340d

tdstein force-pushed the tdstein/jobs-endpoint-injection branch from 902d271 to f57340d Compare October 28, 2024 16:28

minor cleanup

72b62ac

tdstein force-pushed the tdstein/jobs-endpoint-injection branch from 8b9f08a to 72b62ac Compare October 28, 2024 16:39

tdstein requested review from toph-allen and zackverham October 28, 2024 16:47

tdstein marked this pull request as ready for review October 28, 2024 16:47

toph-allen reviewed Oct 29, 2024

View reviewed changes

src/posit/connect/jobs.py Outdated Show resolved Hide resolved

src/posit/connect/vanities.py Show resolved Hide resolved