feat: support markdown #2943

ikatyang · 2017-09-30T16:00:58Z

Main:

Use remark-parse with commonmark option enabled (GFM is based on commonmark)
Commonmark Spec
enforce
- header: #-style
- emphasis: _
- strong: **
- listItem: -(always), +(switch between two style for continuous lists)
- listItem (ordered): 1. (always), 1)(switch between two style for continuous lists)
- thematicBreak: - - -(always), * * * (in list)
- indented code block: 4 whitespaces
- fenced code block: ```(always), ~~~(in js template)
- table: leading/trailing| and align table cell separator and respect alignment
fill word by word (whitespace-based).

See the formatted part of the snapshot file.

export[xxx] = `

unformatted
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
formatted

`;

Side:

extend align's n to be string available. (use for quoteblock)

Question:

Are the styles above ok?
~~Should name this parser markdown (current) or remark?~~ Ans: markdown

NOTE:

the content of ~~code~~/html/yaml still remains the same.
~~does not support break (trailing two spaces).~~
remark is available on astexplorer.

TODO:


format the content of code
~~escape html entities (<, >, &)~~ html entities remain the same (<, etc.).
pass the test case: https://raw.githubusercontent.com/adamschwartz/github-markdown-kitchen-sink/master/TEST.md
add a special case for  not to print additional line break
use ~~~-style code block for markdown in template string
pass all the examples in the spec using AST_COMPARE (607/616)
figure out a better way to escape necessary characters
anything else?

Closes #2444

ikatyang · 2017-10-07T13:57:50Z

@Graham42

That's a test snapshot file, the top part is unformatted, the bottom part is formatted. 😅

Looks like:

export[xxx] = `

unformatted
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
formatted

`;

Graham42 · 2017-10-07T14:06:46Z

O! well that explains it. That looks much prettier 💯

azu · 2017-10-11T13:59:52Z

I have a question.
Does this markdown formatter support multi-bye language like Japanese or emoji?

It seem that current implementaion has two problems.

First, current implementation will be broken in Chinese-Japanese-Korean(CJK) and emoji.
Because, some printing algorithm depened on string.length.

For example, "❤️".length is 2. Following algorithm jolt out of alignment.

https://github.com/prettier/prettier/pull/2943/files#diff-385bd78c43a57ae55923bfea744a5ae3R359

This issue is known as East Asian Width or Unicode problem.

I know that East Asian Width problem is very difficult.( I don't know perfect solution...)
Some library get unicode length.

Second, Following tokenizer can not tokenize non-English text.
Because, non-English text like cheniese doesn't put a space between words.

    .split(/(\s+)/g)

https://github.com/prettier/prettier/pull/2943/files#diff-0f09e16c6ee7c5a40fef83b315027002R149

It will be resolved by using tokenizer like nlcst-parse-japanese, parse-english, and rakutenma.
But, It is not realistic and is not perfect. because toknizer is heavy weight(File size is large and Parse speed is slow)

Toknizer example is Text Tokenizer · Hivemall User Manual

Thanks.

ikatyang · 2017-10-11T15:03:02Z

@azu

Thanks for the suggestion, I've already thought about it and just merged #3003 to fix the printer first, the CJK support is working in progress now, but it should be in a separate PR I think, this PR is somehow too large.

I think there's no need to use tokenizer for CJK, since AFAIK they can be broke in any place, and that's how CJK books printed, e.g.

一串很長很長很長很長很長很長很長很長很長很長很長的中文字，一串很長很長很長很長很長很
長很長很長很長很長很長的中文字，一串很長很長很長很長很長很長很長很長很長很長很長的中
文字，一串很長很長很長很長很長很長很長很長很長很長很長的中文字。

For the string width thing, use string-width should be enough, but we have to disable the stripAnsi feature.

@azz

Do we have any concern to merge this PR? I'd like to send some followed up PRs instead of pushing to the current branch since this PR is somehow too large to review easily.

azz · 2017-10-11T22:48:18Z

I've been incrementally reviewing the changes so we're fine to merge!

vjeux · 2017-10-12T05:06:53Z

So cool!

nhoizey · 2017-10-12T10:05:55Z

Amazing!

Is it possible to also support Mardown dialects, such as Kramdown, used by Jekyll?

ikatyang · 2017-10-13T05:31:51Z

@nhoizey

It seems kramdown does not follow the commonmark spec so that it should have its own printer instead of merging into this one.

For new language support, see #3017 (comment).

nhoizey · 2017-10-13T08:40:44Z

@ikatyang thanks, I'll see what I can do!

lipis · 2017-10-13T13:48:36Z

Now we should format all *.md files in this project! :)

lydell · 2017-10-13T15:43:58Z

@lipis Good idea!

lipis · 2017-10-13T20:59:51Z

#3022 quite a few things to take care of though

azz · 2017-10-14T03:58:50Z

Big Data research part two: https://bigquery.cloud.google.com/savedquery/652929483875:69be015faec8444c8a2e7d055eeb32f8

`* item`	`- item`	`+ item`
642,106	413,627	10,954

type	regex
`* item`	`/^\s\\s/gm`
`- item`	`/^\s*-\s/gm`
`+ item`	`/^\s*\+\s/gm`

* item wins!

ikatyang · 2017-10-14T04:05:49Z

There's also the +-style list item, we should do research for it too, and take the top two results to be used in:

listItem: -(always), +(switch between two style for continuous lists)

(current)

<ul>
  <li>123</li>
  <li>123</li>
</ul>
<ul>
  <li>456</li>
  <li>456</li>
</ul>

azz · 2017-10-14T04:13:36Z

+ is not very popular. Edited my previous comment.

revelt · 2018-03-23T09:46:11Z

src/printer-markdown.js

+    contents.push(rowContents);
+  }, "children");
+
+  const columnMaxWidths = contents.reduce(


We should also limit the max length of the columns. For example, All-Contributors tables contain HTML code and table dash columns processed by Prettier get very very long.

Can you open a new issue so we can track this better?

ikatyang added 30 commits September 21, 2017 23:03

feat(markdown): inital implementation

d25469f

feat(markdown): support strong

50aeda5

fix: add missing default value

63de920

feat(markdown): support inlineCode

41d0f43

feat: support delete

f1cce5b

feat: support link

a59dbcc

feat: support image

1ae442a

feat: support blockquote

c8308ee

feat: support heading

1e41713

feat: support code

703db97

feat: support yaml

956eca7

feat: support html

3085364

feat: support list

57d549b

feat: support thematicBreak

0b42021

feat: support table

9fb9f71

feat: support linkReference

9b85280

feat: support imageReference

032a1c0

feat: support definition

fba4662

feat: support footnote

a982f35

feat: support footnoteReference

219356e

feat: support footnoteDefinition

cecd425

test(cli): update snapshots

bbd7837

refactor: extract SINGLE_LINE_NODE_TYPES

efa96e3

refactor: printChildren

709093f

fix: correct newlines

f764d72

test: add trailing newline

f31025a

fix: blockquote formatting

9a99ac7

fix: node types

5eead94

fix: break line correctly

a5c64cb

fix: remove unnecessary properties to make AST_COMPARE happy

7c15481

azz approved these changes Oct 11, 2017

View reviewed changes

azz merged commit 9f6f3e7 into prettier:master Oct 11, 2017

ikatyang deleted the feat/markdown branch October 11, 2017 23:04

ikatyang mentioned this pull request Oct 12, 2017

Support CJK and emoji in markdown #3013

Closed

sergioramos mentioned this pull request Oct 12, 2017

format markdown yldio/joyent-portal#753

Merged

ikatyang mentioned this pull request Oct 14, 2017

Prettier Markdown files #3022

Closed

ikatyang mentioned this pull request Oct 14, 2017

Should use the most popular unordered list style #3025

Closed

SimenB mentioned this pull request Oct 28, 2017

Fix docs overlapping navigation jestjs/jest#4781

Merged

azz mentioned this pull request Nov 7, 2017

Markdown: Not all * is converted into _ #3170

Closed

azz mentioned this pull request Dec 21, 2017

Scrape github for prettier option usage #3538

Open

kachkaev mentioned this pull request Feb 8, 2018

Use Remark beautifier for markdown files by default Glavin001/atom-beautify#2004

Merged

6 tasks

revelt reviewed Mar 23, 2018

View reviewed changes

lydell mentioned this pull request Apr 14, 2018

Markdown table formatting causes merge conflicts. #4314

Closed

lock bot added the locked-due-to-inactivity Please open a new issue and fill out the template instead of commenting. label Jul 5, 2018

lock bot locked as resolved and limited conversation to collaborators Jul 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support markdown #2943

feat: support markdown #2943

ikatyang commented Sep 30, 2017 •

edited

ikatyang commented Oct 7, 2017 •

edited

Graham42 commented Oct 7, 2017

azu commented Oct 11, 2017 •

edited

ikatyang commented Oct 11, 2017

azz commented Oct 11, 2017

vjeux commented Oct 12, 2017

nhoizey commented Oct 12, 2017

ikatyang commented Oct 13, 2017

nhoizey commented Oct 13, 2017

lipis commented Oct 13, 2017 •

edited

lydell commented Oct 13, 2017

lipis commented Oct 13, 2017

azz commented Oct 14, 2017 •

edited

ikatyang commented Oct 14, 2017 •

edited

azz commented Oct 14, 2017

revelt Mar 23, 2018

j-f1 Mar 23, 2018

revelt Mar 23, 2018

feat: support markdown #2943

feat: support markdown #2943

Conversation

ikatyang commented Sep 30, 2017 • edited

ikatyang commented Oct 7, 2017 • edited

Graham42 commented Oct 7, 2017

azu commented Oct 11, 2017 • edited

ikatyang commented Oct 11, 2017

azz commented Oct 11, 2017

vjeux commented Oct 12, 2017

nhoizey commented Oct 12, 2017

ikatyang commented Oct 13, 2017

nhoizey commented Oct 13, 2017

lipis commented Oct 13, 2017 • edited

lydell commented Oct 13, 2017

lipis commented Oct 13, 2017

azz commented Oct 14, 2017 • edited

ikatyang commented Oct 14, 2017 • edited

azz commented Oct 14, 2017

revelt Mar 23, 2018

Choose a reason for hiding this comment

j-f1 Mar 23, 2018

Choose a reason for hiding this comment

revelt Mar 23, 2018

Choose a reason for hiding this comment

ikatyang commented Sep 30, 2017 •

edited

ikatyang commented Oct 7, 2017 •

edited

azu commented Oct 11, 2017 •

edited

lipis commented Oct 13, 2017 •

edited

azz commented Oct 14, 2017 •

edited

ikatyang commented Oct 14, 2017 •

edited