Summarising scientific publications from a tech-transfer perspective#

This is a simplified example illustrating how great-ai is used in practice at ScoutinScience. The subpages show great-ai in action by going over the lifecycle of fine-tuning and deploying a BERT-based software service.

Propriety data

The purpose of this example is to show you different ways in which great-ai can assist you. The exact NLP task being solved is not central. Stemming from this and from the difficult nature of obtaining appropriate training data, the propriety dataset used for the experiments is not shared.

Objectives#

You will see how the great_ai.utilities can integrate into your Data Science workflow.
You will see how great_ai.large_file can be used to version and store your trained model.
You will see how GreatAI should be used to prepare your model for a robust and responsible deployment.
You will see multiple ways of customising your deployment.

Overview#

One of the core features of the ScoutinScience platform is summarising research papers from a tech-transfer perspective. In short, extractive summarisation is preferred using a binary classifier trained on clients' judgement of sentence interestingness. Thus, documents are sentences, and the expected output is a binary label showing whether a sentence is "worthy" of being in the tech-transfer summary. Explaining each decision is imperative since ScoutinScience embraces applying only explainable AI (XAI) methods wherever feasible.

Success

You are ready to start the tutorial. Feel free to return to the summary section once you're finished.

Examine data

Train model

Deploy service

Summarising scientific publications from a tech-transfer perspective#

Objectives#

Overview#

Summary#

Data notebook #

Training notebook #

Deployment notebook #

Additional files #

Summarising scientific publications from a tech-transfer perspective#

Objectives#

Overview#

Summary#

Data notebook#

Training notebook#

Deployment notebook#

Additional files#

Data notebook #

Training notebook #

Deployment notebook #

Additional files #