14-01-2021 дата публикации
Номер: US20210011937A1
Принадлежит:
A method comprising receiving digital documents, a query statement, and a summary length constraint; identifying, for each of said digital documents, a sentence subset, based, at least in part, on said query statement, a modified version of said summary length constraint, and a first set of quality objectives, generating, for each of said sentence subsets, a random forest representation; iteratively (i) sampling, from each of said random forest representations, a plurality of tokens to create a corresponding candidate document summary, based, at least in part, on weights assigned to each of said tokens, (ii) assigning a quality ranking to said candidate document summary, based, at least in part, on said first set of quality objectives and a second set of quality objectives, and (iii) adjusting said weights, based, at least in part, on said quality rankings; and outputting a highest ranking said candidate document as a compressed summary. 1. A system comprising:at least one hardware processor; and receive, as input, one or more digital documents, a query statement, and a summary length constraint,', 'automatically identify, for each of said one or more digital documents, a sentence subset, based, at least in part, on said query statement, a modified version of said summary length constraint, and a first set of quality objectives,', 'automatically generate, for each of said sentence subsets, a random forest representation,', 'iteratively:', '(i) automatically sample, from each of said random forest representations, a plurality of tokens to create a corresponding candidate document summary, based, at least in part, on weights assigned to each of said tokens,', '(ii) automatically assign a quality ranking to said candidate document summary, based, at least in part, on said first set of quality objectives and a second set of quality objectives, and', '(iii) automatically adjust said weights, based, at least in part, on said quality rankings, and', 'automatically output ...
Подробнее