[BACK] Support running on sample input and run in CI by acornet · Pull Request #46 · dataforgoodfr/13_eclaireur_public

acornet · 2025-02-20T22:48:12Z

Add test config to run against a test sample input.
- read / write intermediary results to other locations
Add GH action to run the pipeline
- skip pre-commit and test run for changes in front/
- skip test run for changes in data-analyst
Fix side effect of save_csv function, which was modifying the dataframe

acornet · 2025-02-20T23:30:00Z

@cgoudet qu'en penses-tu? c'était deux fois plus rapide sur mon mac, 5 minutes j'ai peur que cela soit un peu long.

acornet · 2025-02-21T12:37:18Z

@cgoudet j'ai fait quelques optimisation supplémentaires et désormais la CI passe en 3 minutes, ce qui acceptable selon moi :)

acornet · 2025-02-21T13:09:03Z

note: splitting the workflow in two, to be more granular on when things are ran

cgoudet

Je suis assez sceptique sur la strategie. Au final on a un code avec énormément de ìf TestHelper.usingFullInput()`. Cela nuit à la lisibilité du code et le code est très dépendant du fait d'être en phase de test.

Je pense qu'il faut réorganizer le test voir le code pour que le code lui meme soit agnostique d'être en phase test, dégradé ou prod.

Voici une contre proposition :

le test se déclenche via pytest d'un fichier du type tests/back/test_end_to_end.py
On fourni à l'entrée de la pipeline un fichier de configuration dédié : les fichiers d'input comme les chemins de sauvegarde.
On mock les appels API : ton travail pour remplacer des appels web par des lectures de fichier est nickel pour ca.
On mock get_data_path par un fichier temporaire avec tempfile.

A dispo pour échanger!

cgoudet · 2025-02-21T19:59:05Z

back/data/test/ofgl-base-communes-consolidee.test.csv

@@ -0,0 +1,145 @@
+Exercice;Outre-mer;Code Insee 2023 Région;Nom 2023 Région;Code Insee 2023 Département;Nom 2023 Département;Code Siren 2023 EPCI;Nom 2023 EPCI;Strate population 2023;Commune rurale;Commune de montagne;Commune touristique;Tranche revenu par habitant;Présence QPV;Code Insee 2023 Commune;Nom 2023 Commune;Catégorie;Code Siren Collectivité;Code Insee Collectivité;Libellé Budget;Agrégat;Montant BP;Montant BA;Montant flux BP-BA;Montant;Montant en millions;Population totale;Montant en € par habitant;Compte 2023 Disponible;ordre_analyse1_section1;ordre_analyse1_section2;ordre_analyse1_section3;ordre_analyse2_section1;ordre_analyse2_section2;ordre_analyse2_section3;ordre_analyse3_section1;ordre_analyse3_section2;ordre_analyse3_section3;ordre_analyse4_section1;annee_join;Population totale du dernier exercice;siren


Je conseillerais de mettre ce fichier dans tests/back/fixtures/. La bonne pratique est de ne pas mélanger le code et les tests. Et techniquement ce fichier est un input pour un test.

cgoudet · 2025-02-21T20:02:02Z

back/scripts/loaders/base_loader.py

+            # Get the content type of the file from the headers
+            response = requests.head(file_url)
+            content_type = response.headers.get("content-type")
+            # logger.info(f"Content type : {content_type}")


This is dead code and can be removed.

cgoudet · 2025-02-21T20:05:02Z

back/scripts/utils/dataframe_operation.py

+
+def normalize_column_names(df):
+    """This modify the dataframe in place."""
+    df.columns = [


C'est une mauvaise pratique de modifier des datasets en place.

Il faudrait plutot return df.rename(columns=lambda x : re.sub(r"[.-]", "_", c.lower()).

Après je valide le fait d'avoir cette logique dans une fonction dédiée.

Peux tu ajouter les type hint tand que l'on y est?

back/scripts/utils/files_operation.py

acornet

merci beaucoup pour les retours @cgoudet , je suis d'accord avec toi que la solution n'était pas satisfaite et difficile à maintenir. j'ai changé d'approche et j'ai capitalisé sur le travail de @RiwsPy dans #47 en créant un fichier de config de test, ce qui minimise les changements apporté dans le code. dis moi ce que tu en penses!

back/scripts/utils/files_operation.py

cgoudet

Minimal style comments.

back/scripts/utils/dataframe_operation.py

back/scripts/test-utils/select_test_citites.py

cgoudet · 2025-02-24T12:25:41Z

back/scripts/test-utils/select_test_citites.py

+    Path.cwd() / "back" / "data" / "test" / "ofgl-base-communes-consolidee.test.csv",
+    sep=";",
+    index=False,
+)


Maybe make this module callable by wrapping it into a function and add the entreypoint at the end.

acornet force-pushed the back/ac/test-mode branch 2 times, most recently from 5bea548 to 039d8f0 Compare February 20, 2025 23:01

acornet changed the base branch from main to back/ac/simple-sql-export February 20, 2025 23:02

acornet force-pushed the back/ac/test-mode branch from 1f34898 to ad281e9 Compare February 20, 2025 23:26

acornet requested a review from cgoudet February 20, 2025 23:27

acornet marked this pull request as ready for review February 20, 2025 23:27

acornet force-pushed the back/ac/test-mode branch from 4a3a582 to 5bcbe14 Compare February 20, 2025 23:38

Base automatically changed from back/ac/simple-sql-export to main February 21, 2025 10:05

acornet force-pushed the back/ac/test-mode branch from 5bcbe14 to edcbfc2 Compare February 21, 2025 10:10

acornet force-pushed the back/ac/test-mode branch from 605ae65 to cd80e73 Compare February 21, 2025 13:04

acornet force-pushed the back/ac/test-mode branch 3 times, most recently from 6022559 to 2ea5fcb Compare February 21, 2025 15:41

cgoudet reviewed Feb 21, 2025

View reviewed changes

acornet force-pushed the back/ac/test-mode branch from 2ea5fcb to d811a87 Compare February 24, 2025 11:58

Select test cities script

b836bcd

acornet force-pushed the back/ac/test-mode branch from d811a87 to 02c0771 Compare February 24, 2025 11:58

acornet added 3 commits February 24, 2025 13:08

Create test config

b35975f

Fix save_csv side effect

e1aa75b

Add GH action

0de4a49

acornet force-pushed the back/ac/test-mode branch from 02c0771 to 78a26bb Compare February 24, 2025 12:09

acornet commented Feb 24, 2025

View reviewed changes

back/scripts/utils/files_operation.py Show resolved Hide resolved

do not modify in place

1eb036c

acornet force-pushed the back/ac/test-mode branch from 78a26bb to 1eb036c Compare February 24, 2025 12:12

acornet requested a review from cgoudet February 24, 2025 12:17

cgoudet approved these changes Feb 24, 2025

View reviewed changes

paths

7837ef1

acornet added 2 commits February 24, 2025 13:36

nit PR

b1f7a57

README

c395e1b

acornet merged commit a92a7f4 into main Feb 24, 2025
2 checks passed

acornet deleted the back/ac/test-mode branch February 24, 2025 12:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BACK] Support running on sample input and run in CI#46

[BACK] Support running on sample input and run in CI#46
acornet merged 8 commits intomainfrom
back/ac/test-mode

acornet commented Feb 20, 2025 •

edited

Loading

Uh oh!

acornet commented Feb 20, 2025

Uh oh!

acornet commented Feb 21, 2025

Uh oh!

acornet commented Feb 21, 2025

Uh oh!

cgoudet left a comment

Uh oh!

cgoudet Feb 21, 2025

Uh oh!

cgoudet Feb 21, 2025

Uh oh!

cgoudet Feb 21, 2025

Uh oh!

Uh oh!

acornet left a comment

Uh oh!

Uh oh!

cgoudet left a comment

Uh oh!

Uh oh!

Uh oh!

cgoudet Feb 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,145 @@
		Exercice;Outre-mer;Code Insee 2023 Région;Nom 2023 Région;Code Insee 2023 Département;Nom 2023 Département;Code Siren 2023 EPCI;Nom 2023 EPCI;Strate population 2023;Commune rurale;Commune de montagne;Commune touristique;Tranche revenu par habitant;Présence QPV;Code Insee 2023 Commune;Nom 2023 Commune;Catégorie;Code Siren Collectivité;Code Insee Collectivité;Libellé Budget;Agrégat;Montant BP;Montant BA;Montant flux BP-BA;Montant;Montant en millions;Population totale;Montant en € par habitant;Compte 2023 Disponible;ordre_analyse1_section1;ordre_analyse1_section2;ordre_analyse1_section3;ordre_analyse2_section1;ordre_analyse2_section2;ordre_analyse2_section3;ordre_analyse3_section1;ordre_analyse3_section2;ordre_analyse3_section3;ordre_analyse4_section1;annee_join;Population totale du dernier exercice;siren

Conversation

acornet commented Feb 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

acornet commented Feb 20, 2025

Uh oh!

acornet commented Feb 21, 2025

Uh oh!

acornet commented Feb 21, 2025

Uh oh!

cgoudet left a comment

Choose a reason for hiding this comment

Uh oh!

cgoudet Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

cgoudet Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

cgoudet Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

acornet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cgoudet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cgoudet Feb 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

acornet commented Feb 20, 2025 •

edited

Loading