Merge pull request #1 from chinese-poetry/master

更新数据源
This commit is contained in:
Kenn Zhang
2020-04-23 16:31:21 +08:00
committed by GitHub
35 changed files with 14157 additions and 14150 deletions

1
.gitignore vendored Normal file
View File

@@ -0,0 +1 @@
.idea

View File

@@ -1,7 +1,6 @@
language: python
python:
- "2.7"
- "3.6"
- "3.7"
install:
- pip install flake8 -r requirements.txt
before_script: flake8 . --count --select=E9,F63,F7,F82 --show-source --statistics

View File

@@ -7,7 +7,7 @@
<h2 align="center">chinese-poetry: 最全中文诗歌古典文集数据库</h2>
<p align="center">
<a href="https://travis-ci.org/chinese-poetry/chinese-poetry" rel="nofollow">
<a href="https://travis-ci.com/chinese-poetry/chinese-poetry" rel="nofollow">
<img height="28px" alt="Build Status" src="https://img.shields.io/travis/chinese-poetry/chinese-poetry?style=for-the-badge" style="max-width:100%;">
</a>
<a href="https://github.com/chinese-poetry/chinese-poetry/blob/master/LICENSE">
@@ -24,6 +24,7 @@
</a>
</p>
最全的中华古典文集数据库,包含 5.5 万首唐诗、26 万首宋诗、2.1 万首宋词和其他古典文集。诗人包括唐宋两朝近 1.4 万古诗人,和两宋时期 1.5 千古词人。数据来源于互联网。
**为什么要做这个仓库?** 古诗是中华民族乃至全世界的瑰宝,我们应该传承下去,虽然有古典文集,但大多数人并没有拥有这些书籍。从某种意义上来说,这些庞大的文集离我们是有一定距离的。而电子版方便拷贝,所以此开源数据库诞生了。此数据库通过 JSON 格式分发,可以让你很方便的开始你的项目。
@@ -90,7 +91,7 @@
- 直接提交 PR 或者通过 issue 讨论来优化完善此数据库,理论上古诗歌体非宗教类都欢迎加入,部分有争议性的数据需要社区投票讨论决定是否加入。关于诗句的纠错在创建 PR 时请标明出处。更多规范请[参考贡献规范文档](https://github.com/chinese-poetry/chinese-poetry/wiki/%E5%8F%82%E4%B8%8E%E8%B4%A1%E7%8C%AE%E8%A7%84%E8%8C%83)。
- 如果你没有办法直接参与完善的过程,你也可以通过 「[Patreon 周期性赞助](https://www.patreon.com/jackeygao)」的形式来持续帮助并激励我去优化完善此数据库。如果您不喜欢周期性赞助,你也可以通过「[支付宝](https://github.com/jackeyGao/JackeyGao.github.io/blob/master/static/images/alipay.png)」或者「[微信赞赏码](https://github.com/jackeyGao/JackeyGao.github.io/blob/master/static/images/wechat.png)」进行一次性赞助(备注留下邮箱)。
- 如果你没有办法直接参与完善的过程,你也可以通过 「[Patreon 周期性赞助](https://www.patreon.com/jackeygao)」的形式来持续帮助并激励我去优化完善此数据库。如果您不喜欢周期性赞助,你也可以通过「[支付宝](https://github.com/jackeyGao/JackeyGao.github.io/blob/master/static/images/alipay.png)」或者「[微信赞赏码](https://github.com/jackeyGao/JackeyGao.github.io/blob/master/static/images/wechat.jpg)」进行一次性赞助(备注留下邮箱)。
- 如有建议或吐槽,欢迎联系我的邮箱 gaojunqi@outlook.com。
@@ -98,7 +99,7 @@
### 赞助者
**xber1986**
[上海逆行信息科技](http://www.desmix.com/)
### 贡献者

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

View File

@@ -90,7 +90,7 @@
"一棹横江,问讯盟鸥,太守谓谁。",
"道皇华使者,光风洒落。",
"元宵三五,乐与民俱。",
"宝金鞯,玉梅钗燕,斗鸭阑干花影嬉。",
"宝金鞯,玉梅钗燕,斗鸭阑干花影嬉。",
"人迎笑,似玉京春浅,长是灯时。",
"风流不减人知。",
"算岳牧词人谁似之。",
@@ -104,7 +104,7 @@
"author": "林实之",
"paragraphs": [
"客星堂下水,碧浮空、烟树几重重。",
"想故人当日,论情蓬,际会云龙。",
"想故人当日,论情蓬,际会云龙。",
"底事泥涂轩冕,不肯作三公。",
"千仞钓江浒,此意谁同。",
"应笑赤松黄石,效痴儿成事,犹自言功。",
@@ -346,7 +346,7 @@
"横江一抹是平沙。",
"沙上几千家。",
"得到人家尽处,依然水接天涯。",
"危栏送目,翩翩去,点点归鸦。",
"危栏送目,翩翩去,点点归鸦。",
"渔唱不知何处,多应只在芦花。"
],
"rhythmic": "朝中措"
@@ -446,7 +446,7 @@
"大华□,□□□。",
"今古□,□陈迹。",
"甚牛山□□,□□□□。",
"□□□嫌□薄,高怀□□□□□。",
"□□□嫌□薄,高怀□□□□□。",
"□□□、黄鹤□□□,□相识。"
],
"rhythmic": "满江红",
@@ -3622,7 +3622,7 @@
"故里山遥春霭碧。",
"为想繁枝,清梦何曾息。",
"缧带霜英人不摘。",
"纷纷日暮飘席。",
"纷纷日暮飘席。",
"休抱离肠凭酒力。",
"只有轻纨,依约应传得。",
"白发未归空自惜。",
@@ -7308,7 +7308,7 @@
{
"author": "无名氏",
"paragraphs": [
"今夜荼风起。",
"今夜荼风起。",
"应是玉消琼碎。",
"淡荡满城春,恼破愁人春睡。",
"须醉。",
@@ -7320,7 +7320,7 @@
{
"author": "无名氏",
"paragraphs": [
"司春有序,排次到荼。",
"司春有序,排次到荼。",
"远预报,在庭知。",
"蕊珠宫里晨妆罢,披香殿下晓班齐。",
"探花正、驱使问,菊花期。",
@@ -7475,7 +7475,7 @@
"蜡灯春酒风光夕。",
"锦浪龙须花六尺。",
"月波寒。",
"玉琅。",
"玉琅。",
"无情又是,华星送宝鞍。"
],
"rhythmic": "梅花引"
@@ -8999,7 +8999,7 @@
{
"author": "无名氏",
"paragraphs": [
"杏花著雨胭脂透。"
"杏花著雨胭脂透。"
],
"rhythmic": "失调名"
},

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

View File

@@ -6,7 +6,7 @@
"先借椒盘劝金斗。",
"坐间和气,压尽一番梅柳。",
"掖庭频寓直,君恩厚。",
"天两宫,南山齐寿。",
"天两宫,南山齐寿。",
"况有仙丹在公手。",
"论功医国,合在药王之右。",
"不妨千岁饮,长生酒。"
@@ -432,8 +432,8 @@
"author": "霍安人",
"paragraphs": [
"十月小春天,梅飘香细。",
"九叶尧已呈瑞。",
"寿阳仙子,暂降羽衣环。",
"九叶尧已呈瑞。",
"寿阳仙子,暂降羽衣环。",
"林间风味,别人难比。",
"齐眉共庆,劝声鼎沸。",
"有子知书继家世。",

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

View File

@@ -13,13 +13,13 @@
《全唐诗》和《全宋诗》是繁体存储, 如有需要请自己转换, 但转换后的字不符合上下文。
目前此诗集还有大量错误需要更改, 欢迎提交 PR 修改。 个人精力优先 但仍有愚公移山精神.
目前此诗集还有大量错误需要更改, 欢迎提交 PR 修改。 个人精力有限 但仍有愚公移山精神.
## 数据形式
*poet.tang.[0-99000].json*
*poet.tang.[0-57000].json*
*poet.song.[0-57000].json*
*poet.song.[0-254000].json*
每个 JSON 文件有1000条诗.
@@ -57,7 +57,7 @@
{
"name": "太宗皇帝",
"desc": "帝姓李氏,諱世民,神堯次子,聰明英武。貞觀之治,庶幾成康,功德兼隆。由漢以來,未之有也。而銳情經術,初建秦邸,即開文學館,召名儒十八人爲學士。既即位,殿左置弘文館,悉引內學士,番宿更休。聽朝之間,則與討論典籍,雜以文詠。或日昃夜艾,未嘗少怠。詩筆草隸,卓越前古。至於天文秀發,沈麗高朗,有唐三百年風雅之盛,帝實有以啓之焉。在位二十四年,諡曰文。集四十卷。館閣書目,詩一卷,六十九首。今編詩一卷。"
},
}
]
```

View File

@@ -1717,7 +1717,7 @@
"穿花蛺蝶深深見,點水蜻蜓款款飛。",
"傳與風光共流轉,暫時相賞莫相違。"
]
},
}
]
},
{

View File

@@ -10,7 +10,7 @@
"chapter": "一 東",
"paragraphs": [
"雲對雨,雪對風,晚照對晴空。來鴻對去燕,宿鳥對鳴蟲。三尺劍,六鈞弓,嶺北對江東。人間清暑殿,天上廣寒宮。兩岸曉煙楊柳綠,一園春雨杏花紅。兩鬢風霜,途次早行之客;一蓑煙雨,溪邊晚釣之翁。",
"沿對革,異對同,白叟對黃童。江風對海霧,牧子對漁翁。顏巷陋,阮途窮,冀北對遼東。池中濯足水,門外打頭風。帝講經同泰寺,漢皇置酒未央宮。塵慮縈心,懶撫七絃綠綺;霜華滿鬢,羞看百鍊青銅。",
"沿對革,異對同,白叟對黃童。江風對海霧,牧子對漁翁。顏巷陋,阮途窮,冀北對遼東。池中濯足水,門外打頭風。帝講經同泰寺,漢皇置酒未央宮。塵慮縈心,懶撫七絃綠綺;霜華滿鬢,羞看百鍊青銅。",
"貧對富,塞對通,野叟對溪童。鬢皤對眉綠,齒皓對脣紅。天浩浩,日融融,佩劍對彎弓。半溪流水綠,千樹落花紅。野渡燕穿楊柳雨,芳池魚戲芰荷風。女子眉纖,額下現一彎新月;男兒氣壯,胸中吐萬丈長虹。"
]
},

View File

@@ -10,10 +10,10 @@
"chapter": "天文",
"paragraphs": [
"混沌初開,乾坤始奠。氣之輕清上浮者爲天,氣之重濁下凝者爲地。日月五星,謂之七政;天地與人,謂之三才。日爲衆陽之宗,月乃太陰之象。虹名螮蝀,乃天地之淫氣;月裏蟾蜍是月魄之精光。",
"風欲起而石燕飛,天將雨而商羊舞。旋風名爲羊角,閃電號曰雷鞭。青女乃霜之神,素娥即月之號。雷部至捷之鬼曰律令,雷部推車之女阿香。雲師系是豐隆,雪神乃是滕六。歘火、謝仙,俱掌雷火;飛廉、箕伯,悉是風神。",
"風欲起而石燕飛,天將雨而商羊舞。旋風名爲羊角,閃電號曰雷鞭。青女乃霜之神,素娥即月之號。雷部至捷之鬼曰律令,雷部推車之女阿香。雲師系是豐隆,雪神乃是滕六。歘火、謝仙,俱掌雷火;飛廉、箕伯,悉是風神。",
"列缺乃電之神,望舒是月之御。甘霖、甘澍,僅指時雨;玄穹、彼蒼,悉稱上天。雪花飛六出,先兆豐年;日上已三竿,乃雲時晏。蜀犬吠日,比人所見甚稀;吳牛喘月,笑人畏懼過甚。",
"望切者,若雲霓之望;恩深者,如雨露之恩。參商二星,其出沒不相見;牛女兩宿,惟七夕一相逢。后羿妻,奔月宮而爲嫦娥;傅說死,其精神託於箕尾。披星戴月,謂早夜之奔馳;沐雨櫛風,謂風塵之勞苦。事非有意,譬如雲出無心;恩可遍施,乃曰陽春有腳。",
"饋物致敬,曰敢效獻曝之忱;託人轉移,曰全賴回天之力。感救死之恩,曰再造;誦再生之德,曰二天。勢易盡者若冰山,事相懸者如天壤。晨星謂賢人廖落,雷同謂言語相符。心多過慮,何異杞人憂天;事不量力,不殊夸父追。",
"饋物致敬,曰敢效獻曝之忱;託人轉移,曰全賴回天之力。感救死之恩,曰再造;誦再生之德,曰二天。勢易盡者若冰山,事相懸者如天壤。晨星謂賢人廖落,雷同謂言語相符。心多過慮,何異杞人憂天;事不量力,不殊夸父追。",
"如夏日之可畏,是謂趙盾;如冬日之可愛,是謂趙衰。齊婦含冤,三年不雨;鄒衍下獄,六月飛霜。父仇不共戴天,子道須當愛日。",
"盛世黎民,嬉遊於光天化日之下;太平天子,上召夫景星慶雲之祥。夏時大禹在位,上天雨金;春秋孝經既成,赤虹化玉。箕好風,畢好雨,比庶人願欲不同;風從虎,雲從龍,比君臣會合不偶。雨暘時若,系是休徵;天地交泰,稱斯盛世。"
]
@@ -26,7 +26,7 @@
"金城湯池,謂城池之鞏固;礪山帶河,乃封建之誓盟。帝都曰京師,故鄉曰梓里。蓬萊弱水,惟飛仙可渡;方壺員嶠,乃仙子所居。滄海桑田,謂世事之多變;河清海晏,兆天下之昇平。水神曰馮夷,又曰陽侯,火神曰祝融,又曰回祿。海神曰海若,海眼曰尾閭。",
"望人包容曰海涵,謝人思澤曰河潤。無繫累者曰江湖散人,負豪氣者曰湖海之士。問舍求田,原無大志;掀天揭地,方是奇才。憑空起事,謂之平地風波;獨立不移,謂之中流砥柱。黑子、彈丸,漫言至小之邑;咽喉、右臂,皆言要害之區。",
"獨立難持,曰一木焉能支大廈;英雄自恃,曰丸泥亦可封函關。事先敗而後成,曰失之東隅,收之桑榆;事將成而終止,曰爲山九仞,功虧一簣。以蠡測海,喻人之見小;精衛銜石,比人之徒勞。跋涉謂行路艱難,康莊謂道路平坦。磽地曰不毛之地,美田曰膏腴之田。",
"得物無所用,曰如獲石田;爲己大成,日誕登道岸。淄澠之滋味可辨,涇渭之清濁當分。泌水樂飢,隱居不仕;東山高臥,謝職求安。聖人出則黃河清,太守廉則越石見。美俗曰仁裏,惡俗曰互鄉。里名勝母,曾子不入;邑號朝歌,墨翟回車。",
"得物無所用,曰如獲石田;爲己大成,曰诞登道岸。淄澠之滋味可辨,涇渭之清濁當分。泌水樂飢,隱居不仕;東山高臥,謝職求安。聖人出則黃河清,太守廉則越石見。美俗曰仁裏,惡俗曰互鄉。里名勝母,曾子不入;邑號朝歌,墨翟回車。",
"擊壤而歌,堯帝黎民之自得;讓畔而耕,文王百姓之相推。費長房有縮地之方,秦始皇有鞭石之法。堯有九年之水患,湯有七年之旱災。商鞅不仁而阡陌開,夏桀無道而伊洛竭。道不拾遺,由在上有善政;海不揚波,知中國有聖人。"
]
},

View File

@@ -1 +1 @@
pytest==3.1.0
pytest==5.3.2

View File

@@ -227,7 +227,7 @@
"section": "召南",
"content": [
"野有死麕,白茅包之。有女怀春,吉士诱之。",
"林有朴,野有死鹿。白茅纯束,有女如玉。",
"林有朴,野有死鹿。白茅纯束,有女如玉。",
"舒而脱脱兮,无感我帨兮,无使尨也吠。"
]
},
@@ -702,7 +702,7 @@
"chapter": "国风",
"section": "王风",
"content": [
"中谷有蓷,暵其干矣。有女仳离,慨其矣。慨其矣,遇人之艰难矣。",
"中谷有蓷,暵其干矣。有女仳离,慨其矣。慨其矣,遇人之艰难矣。",
"中谷有蓷,暵其修矣。有女仳离,条其歗矣。条其歗矣,遇人之不淑矣。",
"中谷有蓷,暵其湿矣。有女仳离,啜其泣矣。啜其泣矣,何嗟及矣。"
]
@@ -3387,8 +3387,8 @@
},
{
"title": "那",
"chapter": "颂",
"section": "之什",
"chapter": "颂",
"section": "之什",
"content": [
"猗与那与!置我鞉鼓。奏鼓简简,衎我烈祖。汤孙奏假,绥我思成。",
"鞉鼓渊渊,嘒嘒管声。既和且平,依我磬声。于赫汤孙!穆穆厥声。",
@@ -3398,8 +3398,8 @@
},
{
"title": "烈祖",
"chapter": "颂",
"section": "之什",
"chapter": "颂",
"section": "之什",
"content": [
"嗟嗟烈祖!有秩斯祜。申锡无疆,及尔斯所。既载清酤,赉我思成。",
"亦有和羹,既戒既平。鬷假无言,时靡有争。绥我眉寿,黄耇无疆。",
@@ -3409,8 +3409,8 @@
},
{
"title": "玄鸟",
"chapter": "颂",
"section": "之什",
"chapter": "颂",
"section": "之什",
"content": [
"天命玄鸟,降而生商,宅殷土芒芒。古帝命武汤,正域彼四方。",
"方命厥后,奄有九有。商之先后,受命不殆,在武丁孙子。武丁孙子,武王靡不胜。",
@@ -3420,8 +3420,8 @@
},
{
"title": "长发",
"chapter": "颂",
"section": "之什",
"chapter": "颂",
"section": "之什",
"content": [
"浚哲维商,长发其祥。洪水芒芒,禹敷下土方。外大国是疆,幅陨既长。有娀方将,帝立子生商。",
"玄王桓拨,受小国是达,受大国是达。率履不越,遂视既发。相士烈烈。海外有截。",
@@ -3434,8 +3434,8 @@
},
{
"title": "殷武",
"chapter": "颂",
"section": "之什",
"chapter": "颂",
"section": "之什",
"content": [
"挞彼殷武,奋伐荆楚。深入其阻,裒荆之旅。有截其所,汤孙之绪。",
"维女荆楚,居国南乡。昔有成汤,自彼氐羌,莫敢不来享,莫敢不来王。曰商是常。",

View File

@@ -6,7 +6,7 @@
## 说明
繁体中文分发, 各个子目录均为章节.
繁体中文分发, 各个子文件均为不同章节.
## 数据格式

View File

@@ -1,5 +1,4 @@
#! -*- coding: utf-8 -*-
# import sqlite3
# -*- coding: utf-8 -*-
import os
import json
import sys
@@ -15,10 +14,11 @@ def check_json(f, _dir):
with open(filepath) as file:
try:
_ = json.loads(file.read())
sys.stdout.write(f"{filepath} 校验成功")
return True
except:
sys.stderr.write(traceback.format_exc())
assert False, u"校验(%s)失败" % f
assert False, f"{filepath} 校验失败"
def __check_path__(path):
@@ -41,3 +41,8 @@ test_nantang2 = functools.partial(__check_path__, u'./wudai/nantang/')
test_youmengying = functools.partial(__check_path__, u'./youmengying/')
test_sishuwujing = functools.partial(__check_path__, u'./sishuwujing/')
test_yuanqu = functools.partial(__check_path__, u'./yuanqu/')
test_mengxue = functools.partial(__check_path__, u'./mengxue')

File diff suppressed because one or more lines are too long