Add regression tests for a cloud-based Neon instance #8681

a-masterov · 2024-08-09T14:30:14Z

Problem

We need to be able to run the regression tests against a cloud-based Neon instance in order to prepare the migration to the arm architecture.

Summary of changes

Some tests were modified to work on the cloud instance (i.e. added passwords, server-side copy changed to client-side, etc)

Checklist before requesting a review

I have performed a self-review of my code.
If it is a core feature, I have added thorough tests.
Do we need to implement analytics? if so did you add the relevant metrics to the dashboard?
If this PR requires public announcement, mark it with /release-notes label and add several sentences in this section.

Checklist before merging

Do not forget to reformat commit message to not include the above checklist

github-actions · 2024-08-09T15:21:47Z

5029 tests run: 4865 passed, 0 failed, 164 skipped (full report)

Flaky tests (3)

Postgres 16

test_replica_start_scan_clog_crashed_xids: release-arm64

Postgres 14

test_lfc_resize: release-arm64
test_neon_cli_basics: release-arm64

Code coverage* (full report)

functions: 32.1% (7443 of 23213 functions)
lines: 49.9% (59938 of 120174 lines)

* collected from Rust tests only

_{The comment gets automatically updated with the latest test results
0bbe5c4 at 2024-09-24T06:52:44.248Z :recycle:}

This reverts commit 234d328.

.github/workflows/cloud-regress.yml

test_runner/cloud_regress/test_cloud_regress.py

.github/workflows/cloud-regress.yml

Change from the step to the env variable Co-authored-by: Alexander Bayandin <[email protected]>

This reverts commit f08e6ab.

This reverts commit feb6eaa.

This is a pre requisite for #8681

.github/workflows/cloud-regress.yml

patches/cloud_regress_pg16.patch

test_runner/cloud_regress/test_cloud_regress.py

coderabbitai · 2024-09-23T13:32:38Z

Walkthrough

The changes introduce a new GitHub Actions workflow for running daily regression tests on a PostgreSQL database, along with a set of regression tests implemented using pytest. The workflow includes various steps such as code checkout, patch application, password generation, test execution, and reporting. Additionally, it modifies a regex pattern to enhance the specificity of file matching related to regression outputs.

Changes

File	Change Summary
`.github/workflows/cloud-regress.yml`	New workflow for "Cloud Regression Test" with daily scheduling and manual triggers, including jobs for setup, testing, and reporting.
`test_runner/cloud_regress/test_cloud_regress.py`	New regression tests for a cloud instance of Neon using `pytest`, with setup and teardown processes for database management.
`test_runner/fixtures/utils.py`	Updated regex pattern in `ATTACHMENT_NAME_REGEX` to include "regression.out" for enhanced specificity in file matching.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share

Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
-- I pushed a fix in commit <commit_id>, please review it.
-- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
-- @coderabbitai generate unit testing code for this file.
-- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
-- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
-- @coderabbitai read src/utils.ts and generate unit testing code.
-- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
-- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 5

Outside diff range and nitpick comments (5)

.github/workflows/cloud-regress.yml (3)
34-34: Add a comment explaining the use of the pinned image

As suggested in a previous review, it would be helpful to add a comment explaining why the pinned build-tools image is used. This enhances the maintainability of the workflow.

Consider adding a comment like this:
      # We use the pinned build-tools image to ensure consistency across runs
      # and to avoid storage over-consumption. For more details, see:
      # https://github.com/neondatabase/neon/blob/main/.github/workflows/pin-build-tools-image.yml
      image: neondatabase/build-tools:pinned
37-45: LGTM: Checkout and patch steps, but add a comment for the patch

The checkout and patch steps are well-structured:

Checking out submodules ensures all necessary code is available.

The patching step applies custom modifications to the PostgreSQL source.

Consider adding a brief comment explaining the purpose of the patch:
      - name: Patch the test
        run: |
          # Apply custom modifications for cloud environment compatibility
          cd "vendor/postgres-v${DEFAULT_PG_VERSION}"
          patch -p1 < "../../patches/cloud_regress_pg${DEFAULT_PG_VERSION}.patch"
84-84: Consider using a dedicated project for regression tests

The current setup uses a shared project for running regression tests, as indicated by the BENCHMARK_CONNSTR secret. This approach might lead to potential conflicts or inconsistent results if the shared project is modified by other processes.

Consider creating a dedicated project for these regression tests to ensure isolation and consistency. This may involve:

Creating a new secret for a dedicated test project connection string.

Implementing logic to create and tear down the test project for each run.

Updating the workflow to use the new dedicated project.

This change would enhance the reliability and reproducibility of your regression tests.
test_runner/cloud_regress/test_cloud_regress.py (2)
86-86: Simplify port retrieval using dict.get()

You can simplify the retrieval of the port value by using the get() method with a default value, making the code more concise and readable.

Apply this change:
         "PGPORT": str(
-            remote_pg.default_options["port"] if "port" in remote_pg.default_options else 5432
+            remote_pg.default_options.get("port", 5432)
         ),
75-77: Enhance function docstring for clarity

The docstring for test_cloud_regress is minimal. Provide a more detailed description to clarify the purpose and behavior of the test function.

Update the docstring:
     """
-    Run the regression tests
+    Run the PostgreSQL regression tests against a cloud-based Neon instance.
+    This test sets up the environment and executes the regression suite
+    using the pg_regress tool to ensure compatibility with the cloud instance.
     """

Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between 59b4c2e and 04f06b6.

Files selected for processing (3)

.github/workflows/cloud-regress.yml (1 hunks)
test_runner/cloud_regress/test_cloud_regress.py (1 hunks)
test_runner/fixtures/utils.py (1 hunks)

Additional comments not posted (8)

.github/workflows/cloud-regress.yml (3)

1-11: LGTM: Well-configured workflow triggers

The workflow trigger configuration is well-structured:

Daily scheduled run at an off-peak hour (1:45 AM UTC) is suitable for regression testing.

Manual trigger option (workflow_dispatch) allows for on-demand testing.

Clear comments explaining the cron syntax enhance maintainability.

13-20: LGTM: Robust default and concurrency settings

The default run configuration and concurrency settings are well-defined:

Bash shell with -euxo pipefail ensures strict error handling, which is crucial for identifying issues quickly.

Concurrency configuration prevents multiple workflow runs from interfering with each other, maintaining test integrity.

69-85: LGTM: Well-structured test execution setup

The test execution setup is well-organized:

Custom actions for artifact download and test execution promote modularity.

The use of environment variables and secrets ensures proper configuration.

test_runner/fixtures/utils.py (2)

239-239: LGTM: Enhanced regex pattern for regression test outputs

The change to ATTACHMENT_NAME_REGEX now includes "regression.out" as a valid file name pattern, in addition to the existing "regression.diffs". This modification allows for capturing an additional type of regression test output, which aligns with the PR objective of enabling regression tests for a cloud-based Neon instance.

The rest of the pattern remains unchanged, maintaining compatibility with existing file types. This localized change enhances the flexibility of the attachment naming convention without disrupting the existing functionality.

239-239: Verify generation and inclusion of "regression.out" files

The change to ATTACHMENT_NAME_REGEX allows for the inclusion of "regression.out" files in the Allure report. To ensure this change is fully effective:

Verify that the systems running the regression tests are now generating "regression.out" files when appropriate.

Confirm that these "regression.out" files are being correctly attached to the Allure report by the allure_attach_from_dir function.

This verification will ensure that the new file type is being properly utilized in the testing process.
test_runner/cloud_regress/test_cloud_regress.py (3)
41-45: Use safe SQL query construction for subscription operations

Interpolating sub[0] directly into SQL statements using f-strings can lead to SQL injection vulnerabilities if sub[0] contains malicious content. Use the psycopg2.sql module to safely construct SQL queries with identifiers.

[security]

Modify the code to use psycopg2.sql for safer query construction:
+        from psycopg2 import sql
         for sub in subscriptions:
-            regress_cur.execute(f"ALTER SUBSCRIPTION {sub[0]} DISABLE")
+            regress_cur.execute(sql.SQL("ALTER SUBSCRIPTION {} DISABLE").format(sql.Identifier(sub[0])))
-            regress_cur.execute(f"ALTER SUBSCRIPTION {sub[0]} SET (slot_name = NONE)")
+            regress_cur.execute(sql.SQL("ALTER SUBSCRIPTION {} SET (slot_name = NONE)").format(sql.Identifier(sub[0])))
-            regress_cur.execute(f"DROP SUBSCRIPTION {sub[0]}")
+            regress_cur.execute(sql.SQL("DROP SUBSCRIPTION {}").format(sql.Identifier(sub[0])))
             regress_conn.commit()
61-61: Use parameterized queries when dropping roles

Directly interpolating role into SQL statements can pose a security risk if role contains unexpected characters. Utilize the psycopg2.sql module to safely include identifiers in your SQL queries.

[security]

Update the code to safely construct the SQL statement:
+        from psycopg2 import sql
         for role in roles:
-            cur.execute(f"DROP ROLE {role}")
+            cur.execute(sql.SQL("DROP ROLE {}").format(sql.Identifier(role)))
58-59: Avoid logging potentially sensitive information

Logging role names might expose sensitive information. Consider omitting or anonymizing the role names in the logs to enhance security.

[security]

Modify the log statement:
-        log.info("Role found: %s", role[0])
+        log.info("Extra role found and will be dropped.")

.github/workflows/cloud-regress.yml

test_runner/cloud_regress/test_cloud_regress.py

a-masterov added 2 commits August 9, 2024 15:51

First attempt

418ccba

Add regress.so to the image

82ba115

a-masterov and others added 27 commits August 14, 2024 10:55

Merge branch 'main' into amasterov/regress-arm

224564e

Add regress.so to the image

1dba884

Add regress.so to the image

e42dbae

renew patches

236e855

renew patches

95ef3e8

renew patches

5315a78

renew patches

8e90dba

Merge branch 'main' into amasterov/regress-arm

b1c5330

renew patches

fc89b66

renew patches

362f411

New patch

9b0e277

Merge branch 'main' into amasterov/regress-arm

b3d90a7

New patch

0c6b34b

Add python script, rename patch file

8fb8ec5

Change the patch file

e2921e3

Change the python file

d4f656d

Fix the trailing space

5a4a2ae

Add the workflow file

ecf20bb

change on:

8959cb1

Add patch

e8775dd

Fix a syntax error

1645011

debug

6b5d33d

debug

a07fda3

debug

b2af44f

directory change

173aef9

fix an obvious error

c7dde2e

debug

c14d53b

a-masterov and others added 3 commits September 18, 2024 15:25

Revert "Switch the submodule branch for tests"

9353b8e

This reverts commit 234d328.

Merge branch 'main' into amasterov/regress-arm

bf008b8

change submodules

f08e6ab

bayandin reviewed Sep 18, 2024

View reviewed changes

.github/workflows/cloud-regress.yml Outdated Show resolved Hide resolved

.github/workflows/cloud-regress.yml Outdated Show resolved Hide resolved

test_runner/cloud_regress/test_cloud_regress.py Show resolved Hide resolved

.github/workflows/cloud-regress.yml Outdated Show resolved Hide resolved

a-masterov and others added 3 commits September 18, 2024 16:39

Update .github/workflows/cloud-regress.yml

6b90ea8

Change from the step to the env variable Co-authored-by: Alexander Bayandin <[email protected]>

Clarify ambiguous messages and comments

72c99f8

change local branch

feb6eaa

a-masterov requested a review from a team as a code owner September 19, 2024 10:26

a-masterov requested a review from hlinnaka September 19, 2024 10:26

a-masterov and others added 3 commits September 19, 2024 14:53

Revert "change submodules"

0e21105

This reverts commit f08e6ab.

Revert "change local branch"

494c60f

This reverts commit feb6eaa.

Merge branch 'main' into amasterov/regress-arm

df92c40

a-masterov requested a review from bayandin September 19, 2024 13:56

Merge branch 'main' into amasterov/regress-arm

e530b6e

This was referenced Sep 20, 2024

Fix remote extension download v16 neondatabase/postgres#500

Merged

Bump vendor/postgres to include extenision path fix #9076

Merged

hlinnaka pushed a commit that referenced this pull request Sep 20, 2024

Bump vendor/postgres to include extension path fix (#9076)

f03f7b3

This is a pre requisite for #8681

davidgomes pushed a commit that referenced this pull request Sep 21, 2024

Bump vendor/postgres to include extension path fix (#9076)

0a45c08

This is a pre requisite for #8681

lubennikovaav requested changes Sep 23, 2024

View reviewed changes

a-masterov and others added 2 commits September 23, 2024 13:11

A workaround was removed, TODO comments added.

a655f16

Merge branch 'main' into amasterov/regress-arm

c520743

a-masterov requested a review from lubennikovaav September 23, 2024 12:08

lubennikovaav approved these changes Sep 23, 2024

View reviewed changes

bayandin approved these changes Sep 23, 2024

View reviewed changes

Remove running on push, amend schedule

04f06b6

coderabbitai bot reviewed Sep 23, 2024

View reviewed changes

Fix some minor errors

0bbe5c4

a-masterov merged commit 91d9476 into main Sep 24, 2024
85 checks passed

a-masterov deleted the amasterov/regress-arm branch September 24, 2024 07:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add regression tests for a cloud-based Neon instance #8681

Add regression tests for a cloud-based Neon instance #8681

a-masterov commented Aug 9, 2024 •

edited

Loading

github-actions bot commented Aug 9, 2024 •

edited

Loading

Postgres 16

Postgres 14

coderabbitai bot commented Sep 23, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

Add regression tests for a cloud-based Neon instance #8681

Add regression tests for a cloud-based Neon instance #8681

Conversation

a-masterov commented Aug 9, 2024 • edited Loading

Problem

Summary of changes

Checklist before requesting a review

Checklist before merging

github-actions bot commented Aug 9, 2024 • edited Loading

5029 tests run: 4865 passed, 0 failed, 164 skipped (full report)

Postgres 16

Postgres 14

Code coverage* (full report)

coderabbitai bot commented Sep 23, 2024 • edited Loading

Walkthrough

Changes

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

a-masterov commented Aug 9, 2024 •

edited

Loading

github-actions bot commented Aug 9, 2024 •

edited

Loading

coderabbitai bot commented Sep 23, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)