Suggest add an option to ignore special encoding characters

Hi, this tool works well in many cases. But I found two problems.

1. Encoding problem

If a file contains other encoding characters, e.g., Chinese characters and ½, an exception will occur in _extract_comments_ method.

I added "errors='ignore'" in the following statement on my local computer, and it can ignore the above special characters and continue to extract the rest characters of a comment.
```python
def extract_comments(filename, mime=None):
    with open(filename, 'r', errors='ignore') as code: 
```
So I think we can provide this option to users and let them determine to ignore or not.

2. Complex string

The tool throws an exception when parser [this](https://github.com/88250/symphony/blob/master/src/main/java/org/b3log/symphony/Server.java) java file. I found the cause may be the complex string in line 99. 

Thanks for your tool, it helps me a lot. Hope better~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggest add an option to ignore special encoding characters #20

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Suggest add an option to ignore special encoding characters #20

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions