𓆤DBay 𓁛Mirage
Table of Contents generated with DocToc
𓆤DBay 𓁛Mirage
DBay plugin to mirror, parse and process text files (HTML, CSV, ...) in SQL
This module has been pulled from a preliminary version in
dbay-rustybuzz
To Do
- table
mrg_datasources
- [–] there might be datasources we do want to import data from without mirroring them, so there should be a flag for that.
- [–] there may be direct input or results of HTTP queries &cpp so better use URLs instead of file system paths
- HTML:
[+] empty lines between paragraphs should be preserved
[–] initial blank lines are missing from HTML output
[–] what to do for trailing blank lines?
[–] tag registry so we can decide whether tag
- is block
- allows parsing inside (cf
<script>
,<code>
) [–] run all inserts to mirage HTML in single transaction [–] consider to add CFG towalk_par_rows()
to indicate whether to keep or to skip empty/blank lines [–] implement datasources with direct text input [–] inmrg_wspars
, use fieldmrg_*mirror.mat
or constanttxt = ''
instead of function call [+] accept<!doctype>
tags (turn into comments)