Skip to content

Conversation

@komoroko
Copy link

Problem

When a CSV file contains line breaks inside a cell, the generated Markdown table becomes corrupted after conversion.
This type of CSV file is reasonably common in real-world and enterprise datasets, and supporting this case improves the robustness of the conversion process.

Example CSV

名前,年齢,住所
佐藤太郎,30,東京
三木英子,25,大阪
髙橋淳,35,名古屋
鈴木一郎,"28

cellB5",福岡

In this case, the line break inside the quoted cell causes the Markdown table structure to break.

Solution

To address this issue, the CSV-to-Markdown conversion was updated to use pandas-based HTML conversion, consistent with the existing handling for xlsx and xls files.

By adopting pandas-based HTML conversion for CSV-to-Markdown processing, multiline cells are handled correctly and the table structure is preserved.

Result

  • CSV files containing line breaks within individual cells are now converted into valid Markdown tables.
  • Behavior is consistent with Excel (.xlsx, .xls) conversions
  • Improves robustness of Markdown generation for CSV inputs

@komoroko
Copy link
Author

@microsoft-github-policy-service agree

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant