Sytem Deployment is now available

prepare

enter the repo's root dir and run "git clone https://github.com/google-research/albert.git"

then run the python setup develop to install the package KnowledgeExtractor

How to use the CRF Model:

predicting phase:

import knowledgeextractor as ke

config_file="xxxx" # repo/config/crf_albert_model.json

nermodel=ke.nermodels.crf_albert.NERModel(config_file)

query_data={ "guid":"test1", "text":"this is just a test snippet!" }

query_list=[query_data]

results=nermodel.predict(query_list)

training phase

training need text like

{ "originalText":"xxxxx",

"entities": [ {

    "label_type": "疾病和诊断",

        "start_pos": 19,

        "end_pos": 27
},

...

] }

you can write such json string in a data.json file with each line as a json record.

run the repo/test/crf_gen_taggers.py and in its main module specify max_sequence_length and and source file(path to data.json) and then run the split_files.py in same pathto gen result file(where the generated train.json, deve.json, test.json are stored)

run the repo/test/run_crf.sh, where you need to prepare the init weights dict of albert chinese model(assume your model are stored in path/sharedModels/albert_base_zh/, which contains weights, vocabs, and config file for the albert).

Sytem Deployment is now available

specify the config file

tornado_server.json

crf_processor.json

crf_albert_model.json

run the repo/test/start_service/crf_service.py

example of request using post method

url should be like this

http://ip:port/methodCore (port is specified in tornado_server.json and ip the machine's IP that runs the service)

post body should be a json string like this

{ "query_list":[

    {"guid":"id1 str",
    "text":"a test text-1."},

    {"guid":"id2 str",
    "text":"a test text-2."}
]

}

the returned results contains

{ "predictions":[

    {"words":["list of words of text-1"],

    "tags":["list of tags for each word]},

    {
    "words":["list of words of text-2"],

    "tags":["list of tags for each word]
    }
],

"query_list":[....(the query list defined above)]

}

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
albert		albert
config		config
knowledgeextractor		knowledgeextractor
test		test
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

prepare

How to use the CRF Model:

predicting phase:

training phase

Sytem Deployment is now available

specify the config file

tornado_server.json

crf_processor.json

crf_albert_model.json

run the repo/test/start_service/crf_service.py

example of request using post method

url should be like this

post body should be a json string like this

the returned results contains

About

Uh oh!

Releases

Packages

Uh oh!

Languages

zhangzhenyu13/KnowledgeExtraction

Folders and files

Latest commit

History

Repository files navigation

prepare

How to use the CRF Model:

predicting phase:

training phase

Sytem Deployment is now available

specify the config file

tornado_server.json

crf_processor.json

crf_albert_model.json

run the repo/test/start_service/crf_service.py

example of request using post method

url should be like this

post body should be a json string like this

the returned results contains

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages