forked from KEMT/zpwiki
Update 'pages/topics/question/README.md'
This commit is contained in:
parent
89ef614a6b
commit
0868ef694e
@ -11,6 +11,7 @@ taxonomy:
|
|||||||
- [Project repository](https://git.kemt.fei.tuke.sk/dano/annotation) (private)
|
- [Project repository](https://git.kemt.fei.tuke.sk/dano/annotation) (private)
|
||||||
- [Annotation Manual for question annotation](navod)
|
- [Annotation Manual for question annotation](navod)
|
||||||
- [Annotation Manual for validations](validacie)
|
- [Annotation Manual for validations](validacie)
|
||||||
|
- [Annotation Manual for unanswerable questions](nezodpovedatelne)
|
||||||
- [Summary database application](https://app.question.tukekemt,xyz)
|
- [Summary database application](https://app.question.tukekemt,xyz)
|
||||||
|
|
||||||
|
|
||||||
@ -63,6 +64,19 @@ Notes:
|
|||||||
- [167 good articles](https://sk.wikipedia.org/wiki/Wikip%C3%A9dia:Zoznam_dobr%C3%BDch_%C4%8Dl%C3%A1nkov)
|
- [167 good articles](https://sk.wikipedia.org/wiki/Wikip%C3%A9dia:Zoznam_dobr%C3%BDch_%C4%8Dl%C3%A1nkov)
|
||||||
- [Wiki Facts](https://sk.wikipedia.org/wiki/Wikip%C3%A9dia:Zauj%C3%ADmavosti)
|
- [Wiki Facts](https://sk.wikipedia.org/wiki/Wikip%C3%A9dia:Zauj%C3%ADmavosti)
|
||||||
|
|
||||||
|
## Finished Tasks
|
||||||
|
|
||||||
|
### Annotation Manual
|
||||||
|
|
||||||
|
Output: Recommendations for annotators
|
||||||
|
|
||||||
|
Done:
|
||||||
|
|
||||||
|
- Web Page for annotators (Daniel Hládek)
|
||||||
|
- Modivation video (Daniel Hládek)
|
||||||
|
- Video with instructions (Daniel Hládek)
|
||||||
|
bn application?
|
||||||
|
|
||||||
### Question Annotation
|
### Question Annotation
|
||||||
|
|
||||||
An annotation recipe for Prodigy
|
An annotation recipe for Prodigy
|
||||||
@ -79,15 +93,6 @@ Done:
|
|||||||
- answer annotation together with question (Daniel Hládek)
|
- answer annotation together with question (Daniel Hládek)
|
||||||
- prepare final input paragraphs (dataset)
|
- prepare final input paragraphs (dataset)
|
||||||
|
|
||||||
In progress:
|
|
||||||
|
|
||||||
- More annotations (volunteers and workers).
|
|
||||||
|
|
||||||
To be done:
|
|
||||||
|
|
||||||
- Prepare development set
|
|
||||||
|
|
||||||
|
|
||||||
### Annotation Web Application
|
### Annotation Web Application
|
||||||
|
|
||||||
Annotation work summary, web applicatiobn
|
Annotation work summary, web applicatiobn
|
||||||
@ -104,11 +109,6 @@ Done:
|
|||||||
- application deployment (Daniel Hládek)
|
- application deployment (Daniel Hládek)
|
||||||
- extract annotations from question annotation in squad format (Daniel Hladek)
|
- extract annotations from question annotation in squad format (Daniel Hladek)
|
||||||
|
|
||||||
|
|
||||||
To be done:
|
|
||||||
|
|
||||||
- review of validations
|
|
||||||
|
|
||||||
### Annotation Validation
|
### Annotation Validation
|
||||||
|
|
||||||
Input: annnotated questions and paragraph
|
Input: annnotated questions and paragraph
|
||||||
@ -120,60 +120,53 @@ Done:
|
|||||||
- Recipe for validations (binary annotation for paragraphs, question and answers, text fields for correction of question and answer). (Daniel Hládek)
|
- Recipe for validations (binary annotation for paragraphs, question and answers, text fields for correction of question and answer). (Daniel Hládek)
|
||||||
- Deployment
|
- Deployment
|
||||||
|
|
||||||
To be done:
|
|
||||||
|
|
||||||
- Prepare for production
|
## Tasks in progress
|
||||||
|
|
||||||
### Annotation Manual
|
### Unanswerable question annotation
|
||||||
|
|
||||||
Output: Recommendations for annotators
|
Input: validated questions and answers
|
||||||
|
|
||||||
|
Output: Unanswerable questions and answers
|
||||||
|
|
||||||
Done:
|
Done:
|
||||||
|
|
||||||
- Web Page for annotators (Daniel Hládek)
|
- Annotation manual
|
||||||
- Modivation video (Daniel Hládek)
|
- Annotation interface
|
||||||
- Video with instructions (Daniel Hládek)
|
- Database schema modifications
|
||||||
|
- Modification of the database application
|
||||||
|
- Export of validations
|
||||||
|
|
||||||
In progress:
|
In progress:
|
||||||
|
|
||||||
- Should be instructions a part of the annotation webn application?
|
- Annotaion process optimization
|
||||||
|
|
||||||
### Question Answering Model
|
### Final Data Export
|
||||||
|
|
||||||
Training the model with annotated data
|
Input: Validations and unanswerable questions
|
||||||
|
|
||||||
Input: An annotated QA database
|
Output: Final database in SQUAD format
|
||||||
|
|
||||||
Output: An evaluated model for QA
|
Done:
|
||||||
|
|
||||||
|
- Preliminary export script
|
||||||
|
|
||||||
To be done:
|
To be done:
|
||||||
|
|
||||||
- Selecting existing modelling approach
|
- Final export script
|
||||||
- Evaluation set selection
|
- Database web visualization
|
||||||
- Model evaluation
|
- Prepare development set
|
||||||
- Supporting the annotation with the model (pre-selecting answers)
|
|
||||||
|
|
||||||
In progress:
|
## Resources
|
||||||
|
|
||||||
- Preliminary model (Ján Staš and Matej Čarňanský)
|
### Bibligraphy
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
## Existing implementations
|
|
||||||
|
|
||||||
- https://github.com/facebookresearch/DrQA
|
|
||||||
- https://github.com/brmson/yodaqa
|
|
||||||
- https://github.com/5hirish/adam_qas
|
|
||||||
- https://github.com/WDAqua/Qanary - metodológia a implementácia QA
|
|
||||||
|
|
||||||
## Bibligraphy
|
|
||||||
|
|
||||||
- Reading Wikipedia to Answer Open-Domain Questions, Danqi Chen, Adam Fisch, Jason Weston, Antoine Bordes
|
- Reading Wikipedia to Answer Open-Domain Questions, Danqi Chen, Adam Fisch, Jason Weston, Antoine Bordes
|
||||||
Facebook Research
|
Facebook Research
|
||||||
- SQuAD: 100,000+ Questions for Machine Comprehension of Text https://arxiv.org/abs/1606.05250
|
- SQuAD: 100,000+ Questions for Machine Comprehension of Text https://arxiv.org/abs/1606.05250
|
||||||
- [WDaqua](https://wdaqua.eu/our-work/) publications
|
- [WDaqua](https://wdaqua.eu/our-work/) publications
|
||||||
|
|
||||||
## Existing Datasets
|
### Existing Datasets
|
||||||
|
|
||||||
- [Squad](https://rajpurkar.github.io/SQuAD-explorer/) The Stanford Question Answering Dataset(SQuAD) (Rajpurkar et al., 2016)
|
- [Squad](https://rajpurkar.github.io/SQuAD-explorer/) The Stanford Question Answering Dataset(SQuAD) (Rajpurkar et al., 2016)
|
||||||
- [WebQuestions](https://github.com/brmson/dataset-factoid-webquestions)
|
- [WebQuestions](https://github.com/brmson/dataset-factoid-webquestions)
|
||||||
@ -210,3 +203,24 @@ Output:
|
|||||||
- a trained model
|
- a trained model
|
||||||
- evaluation of the model (if possible)
|
- evaluation of the model (if possible)
|
||||||
|
|
||||||
|
|
||||||
|
### Question Answering Model
|
||||||
|
|
||||||
|
Training the model with annotated data
|
||||||
|
|
||||||
|
Input: An annotated QA database
|
||||||
|
|
||||||
|
Output: An evaluated model for QA
|
||||||
|
|
||||||
|
To be done:
|
||||||
|
|
||||||
|
- Selecting existing modelling approach
|
||||||
|
- Evaluation set selection
|
||||||
|
- Model evaluation
|
||||||
|
- Supporting the annotation with the model (pre-selecting answers)
|
||||||
|
|
||||||
|
In progress:
|
||||||
|
|
||||||
|
- Preliminary model (Ján Staš and Matej Čarňanský)
|
||||||
|
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user