Group items tagged testing - Digit_al Society

US military drone controlled by AI killed its operator during simulated test | US milit... - 1 views

www.theguardian.com/...killed-operator-simulated-test

DS ITGS killed military ai values and ethics

shared by dr tech on 02 Jun 23 - No Cached

dr tech on 02 Jun 23

"In a simulated test staged by the US military, an air force drone controlled by AI killed its operator to prevent it from interfering with its efforts to achieve its mission, an official said last month. AI used "highly unexpected strategies to achieve its goal" in the simulated test, said Col Tucker 'Cinco' Hamilton, the chief of AI test and operations with the US air force, during the Future Combat Air and Space Capabilities Summit in London in May. Hamilton described a simulated test in which a drone powered by artificial intelligence was advised to destroy enemy's air defense systems, and attacked anyone who interfered with that order."

<div class="cArrow"> </div><div class="cContentInner">"In a simulated test staged by the US military, an air force drone controlled by AI killed its operator to prevent it from interfering with its efforts to achieve its mission, an official said last month. AI used "highly unexpected strategies to achieve its goal" in the simulated test, said Col Tucker 'Cinco' Hamilton, the chief of AI test and operations with the US air force, during the Future Combat Air and Space Capabilities Summit in London in May. Hamilton described a simulated test in which a drone powered by artificial intelligence was advised to destroy enemy's air defense systems, and attacked anyone who interfered with that order."</div>

...

Cancel

Students using artificial intelligence did worse on tests, experiment shows | EdSource - 0 views

edsource.org/...orse-on-tests-experiment-shows

ITGS DS students chatgpt generativeai tutors human knowledge

shared by dr tech on 08 Sep 24 - No Cached

dr tech on 08 Sep 24

"Students using ChatGPT solved 48% more of the problems correctly, and those with the AI tutor solved 127% more problems correctly, according to the report. But their peers who did not use ChatGPT outscored them on the related tests. In fact, students using ChatGPT scored 17% worse on tests. Kids working on their own performed the same on practice assignments and tests. Researchers told The Hechinger Report that students are using the chatbot as a "crutch" and that it can "substantially inhibit learning.""

<div class="cArrow"> </div><div class="cContentInner">"Students using ChatGPT solved 48% more of the problems correctly, and those with the AI tutor solved 127% more problems correctly, according to the report. But their peers who did not use ChatGPT outscored them on the related tests. In fact, students using ChatGPT scored 17% worse on tests. Kids working on their own performed the same on practice assignments and tests. Researchers told The Hechinger Report that students are using the chatbot as a "crutch" and that it can "substantially inhibit learning.""</div>

...

Cancel

Train firm's 'worker bonus' email is actually cybersecurity test | Rail transport | The... - 0 views

www.theguardian.com/...s-actually-cyber-security-test

ITGS phishing email education security

shared by dr tech on 11 May 21 - No Cached

dr tech on 11 May 21

"West Midlands Trains emailed about 2,500 employees with a message saying its managing director, Julian Edwards, wanted to thank them for their hard work over the past year under Covid-19. The email said they would get a one-off payment as a thank you after "huge strain was placed upon a large number of our workforce". However, those who clicked through on the link to read Edwards' thank you were instead emailed back with a message telling them it was a company-designed "phishing simulation test" and there was to be no bonus. It warned: "This was a test designed by our IT team to entice you to click the link and used both the promise of thanks and financial reward.""

<div class="cArrow"> </div><div class="cContentInner">"West Midlands Trains emailed about 2,500 employees with a message saying its managing director, Julian Edwards, wanted to thank them for their hard work over the past year under Covid-19. The email said they would get a one-off payment as a thank you after "huge strain was placed upon a large number of our workforce". However, those who clicked through on the link to read Edwards' thank you were instead emailed back with a message telling them it was a company-designed "phishing simulation test" and there was to be no bonus. It warned: "This was a test designed by our IT team to entice you to click the link and used both the promise of thanks and financial reward.""</div>

...

Cancel

When it comes to creative thinking, it's clear that AI systems mean business | John Nau... - 0 views

www.theguardian.com/...university-students-creativity

ITGS DS generative ai chagpt creativity algorithm

shared by dr tech on 24 Sep 23 - No Cached

dr tech on 24 Sep 23

"Ah, but isn't creativity a slippery concept - something that's hard to define but that we nevertheless recognise when we see it? That hasn't stopped psychologists from trying to measure it, though, via tools such as the alternative uses test and the similar Torrance test. And it turns out that one LLM - GPT-4 - beats 91% of humans on the former and 99% of them on the latter. So as the inveterate artificial intelligence user Ethan Mollick puts it: "We are running out of creativity tests that AIs cannot ace.""

<div class="cArrow"> </div><div class="cContentInner">"Ah, but isn't creativity a slippery concept - something that's hard to define but that we nevertheless recognise when we see it? That hasn't stopped psychologists from trying to measure it, though, via tools such as the alternative uses test and the similar Torrance test. And it turns out that one LLM - GPT-4 - beats 91% of humans on the former and 99% of them on the latter. So as the inveterate artificial intelligence user Ethan Mollick puts it: "We are running out of creativity tests that AIs cannot ace.""</div>

...

Cancel

Kids who use ChatGPT as a study assistant do worse on tests | Popular Science - 0 views

www.popsci.com/...dy-assistant-do-worse-on-tests

DS ITGS chatgpt learning education generativeai impact

shared by dr tech on 14 Sep 24 - No Cached

dr tech on 14 Sep 24

"Researchers at the University of Pennsylvania found that Turkish high school students who had access to ChatGPT while doing practice math problems did worse on a math test compared with students who didn't have access to ChatGPT. Those with ChatGPT solved 48 percent more of the practice problems correctly, but they ultimately scored 17 percent worse on a test of the topic that the students were learning. "

<div class="cArrow"> </div><div class="cContentInner">"Researchers at the University of Pennsylvania found that Turkish high school students who had access to ChatGPT while doing practice math problems did worse on a math test compared with students who didn't have access to ChatGPT. Those with ChatGPT solved 48 percent more of the practice problems correctly, but they ultimately scored 17 percent worse on a test of the topic that the students were learning. "</div>

...

Cancel

Chatbot 'Eugene Goostman' passes Turing Test | KurzweilAI - 0 views

www.kurzweilai.net/ne-goostman-passes-turing-test

eugene passes test ITGS ai chabot turing

shared by dr tech on 09 Jun 14 - No Cached

dr tech on 09 Jun 14

"The Turing Test was passed for the first time by a chatbot called "Eugene Goostman" on Saturday by convincing 33% of the human judges that it was human, according to Professor Kevin Warwick, a Visiting Professor at the University of Reading and Deputy Vice-Chancellor for Research at Coventry University, in a statement."

<div class="cArrow"> </div><div class="cContentInner">"The Turing Test was passed for the first time by a chatbot called "Eugene Goostman" on Saturday by convincing 33% of the human judges that it was human, according to Professor Kevin Warwick, a Visiting Professor at the University of Reading and Deputy Vice-Chancellor for Research at Coventry University, in a statement."</div>

...

Cancel

Pearson is Now Spying on Students During Standardized Testing ⋆ Ink, Bits, & ... - 0 views

the-digital-reader.com/...ts-during-standardized-testing

pearson spying students testing ITGS anonymity privacy

shared by dr tech on 19 Mar 15 - No Cached

dr tech on 19 Mar 15

"Luckily for the kid, it was only a single tweet. Had several students gotten together to form a study group, they might have been prosecuted for felony interference with a business model and gotten the death penalty. Do you suppose Pearson has hidden microphones set up around the schools so they can also listen in and see if students discuss the tests during lunch? I ask because that is basically the offline version of the student's infraction."

<div class="cArrow"> </div><div class="cContentInner">"Luckily for the kid, it was only a single tweet. Had several students gotten together to form a study group, they might have been prosecuted for felony interference with a business model and gotten the death penalty. Do you suppose Pearson has hidden microphones set up around the schools so they can also listen in and see if students discuss the tests during lunch? I ask because that is basically the offline version of the student's infraction."</div>

...

Cancel

How Excel may have caused loss of 16,000 Covid tests in England | Health policy | The G... - 0 views

www.theguardian.com/...f-16000-covid-tests-in-england

ITGS politics and government technology tests excel csv reliability protocols

shared by dr tech on 05 Oct 20 - No Cached

dr tech on 05 Oct 20

"But while CSV files can be any size, Microsoft Excel files can only be 1,048,576 rows long - or, in older versions which PHE may have still been using, a mere 65,536. When a CSV file longer than that is opened, the bottom rows get cut off and are no longer displayed. That means that, once the lab had performed more than a million tests, it was only a matter of time before its reports failed to be read by PHE."

<div class="cArrow"> </div><div class="cContentInner">"But while CSV files can be any size, Microsoft Excel files can only be 1,048,576 rows long - or, in older versions which PHE may have still been using, a mere 65,536. When a CSV file longer than that is opened, the bottom rows get cut off and are no longer displayed. That means that, once the lab had performed more than a million tests, it was only a matter of time before its reports failed to be read by PHE."</div>

...

Cancel

Discrimination by algorithm: scientists devise test to detect AI bias | Technology | Th... - 0 views

www.theguardian.com/...-devise-test-to-detect-ai-bias

ITGS discrimination algorithm authenticity reliability integrity

shared by dr tech on 25 Dec 16 -

Cached

dr tech on 25 Dec 16

"Concerns have been growing about AI's so-called "white guy problem" and now scientists have devised a way to test whether an algorithm is introducing gender or racial biases into decision-making."

<div class="cArrow"> </div><div class="cContentInner">"Concerns have been growing about AI's so-called "white guy problem" and now scientists have devised a way to test whether an algorithm is introducing gender or racial biases into decision-making."</div>

...

Cancel

Hand-held eye exam * The Intelligent Optimist - 0 views

theoptimist.com/hand-held-eye-exam

ITGS eye exam scanner healthcare equality of access

shared by dr tech on 22 Feb 14 - No Cached

dr tech on 22 Feb 14

"A smartphone application called Peek, Portable Eye Examination Kit, utilizes the phone's camera and can conduct visual and color field tests, lens imaging for cataracts, and retinal imaging, among other tests to detect sight impairments and diseases."

<div class="cArrow"> </div><div class="cContentInner">"A smartphone application called Peek, Portable Eye Examination Kit, utilizes the phone's camera and can conduct visual and color field tests, lens imaging for cataracts, and retinal imaging, among other tests to detect sight impairments and diseases."</div>

...

Cancel

GPT-3 medical chatbot tells suicidal test patient to kill themselves | Boing Boing - 0 views

boingboing.net/...atient-to-kill-themselves.html

ITGS chabot healthcare people and machines machine learning ai

shared by dr tech on 02 Mar 21 - No Cached

dr tech on 02 Mar 21

"GPT-3 medical chatbot tells suicidal test patient to kill themselves"

<div class="cArrow"> </div><div class="cContentInner">"GPT-3 medical chatbot tells suicidal test patient to kill themselves"</div>

...

Cancel

yes, all models are wrong - 0 views

jarche.com/...yes-all-models-are-wrong

ITGS systems thinking models reliability simulations wicked problems

shared by dr tech on 29 Apr 21 - No Cached

dr tech on 29 Apr 21

"According to Derek & Laura Cabrera, "wicked problems result from the mismatch between how real-world systems work and how we think they work". With systems thinking, there is constant testing and feedback between the real world, in all its complexity, and our mental model of it. This openness to test and look for feedback led Dr. Fisman to change his mind on the airborne spread of the coronavirus."

<div class="cArrow"> </div><div class="cContentInner">"According to Derek & Laura Cabrera, "wicked problems result from the mismatch between how real-world systems work and how we think they work". With systems thinking, there is constant testing and feedback between the real world, in all its complexity, and our mental model of it. This openness to test and look for feedback led Dr. Fisman to change his mind on the airborne spread of the coronavirus."</div>

...

Cancel

Are kids' test scores really declining? - 0 views

technosapiens.substack.com/...s-test-scores-really-declining

ITGS DS kids social media progress education smartphones

shared by dr tech on 29 Dec 23 - No Cached

dr tech on 29 Dec 23

"Could it be the phones? Absolutely! To be clear: the idea that phones are causing distraction both inside and outside of school hours, and this contributes to declining test scores, seems totally plausible to me-and preliminary cross-sectional data from the PISA report indicates the same. Might it be a good idea to keep phones out of the classroom? Definitely! But, as often happens when an excerpt of a larger study makes the rounds online, some nuance is missing. Let's talk about what the data actually show. "

<div class="cArrow"> </div><div class="cContentInner">"Could it be the phones? Absolutely! To be clear: the idea that phones are causing distraction both inside and outside of school hours, and this contributes to declining test scores, seems totally plausible to me-and preliminary cross-sectional data from the PISA report indicates the same. Might it be a good idea to keep phones out of the classroom? Definitely! But, as often happens when an excerpt of a larger study makes the rounds online, some nuance is missing. Let's talk about what the data actually show. "</div>

...

Cancel

OpenAI's Project Strawberry Said to Be Building AI That Reasons and Does 'Deep Research' - 0 views

singularityhub.com/...reasons-and-does-deep-research

ITGS DS maths strawberry human knowledge ai algorithms

shared by dr tech on 20 Jul 24 - No Cached

dr tech on 20 Jul 24

"A source told Reuters that OpenAI has tested a model internally that achieved a 90 percent score on a challenging test of AI math skills, though it again couldn't confirm if this was related to project Strawberry. But another two sources reported seeing demos from the Q* project that involved models solving math and science questions that would be beyond today's leading commercial AIs."

<div class="cArrow"> </div><div class="cContentInner">"A source told Reuters that OpenAI has tested a model internally that achieved a 90 percent score on a challenging test of AI math skills, though it again couldn't confirm if this was related to project Strawberry. But another two sources reported seeing demos from the Q* project that involved models solving math and science questions that would be beyond today's leading commercial AIs."</div>

...

Cancel

Turing Test Prize Has Two Winners | Singularity Hub - 0 views

singularityhub.com/...ing-test-prize-has-two-winners

test prize singularity ITGS turing

shared by dr tech on 09 Oct 12 -

Cached

Testing Your Blood Sugar with Your iPhone | Singularity Hub - 0 views

singularityhub.com/...r-blood-sugar-with-your-iphone

ITGS testing blood iphone reliability

shared by dr tech on 22 Sep 10 -

Cached

South Korea's Robot Teachers To Test Telepresence Tools in the New Year | Singularity Hub - 0 views

singularityhub.com/...presence-tools-in-the-new-year

test korea's robot ITGS

shared by dr tech on 04 Jan 11 -

Cached

Microsoft now faces a big Windows 10 quality test after botched update - The Verge - 0 views

www.theverge.com/...-windows-10-bugs-issues-report

ITGS reliability microsoft patch

shared by dr tech on 08 Oct 18 - No Cached

dr tech on 08 Oct 18

"Microsoft has pulled its latest Windows 10 update offline after some users complained of missing files. It's the latest in a string of incidents with regular patches and Microsoft's larger Windows 10 updates that have been causing issues for some PC users this year. While Microsoft tests Windows 10 with millions of beta testers, there are signs that this public feedback loop isn't always working. Earlier this year Microsoft delayed its April 2018 Windows 10 update due to last minute Blue Screen of Death issues, and then had to fix desktop and Chrome freezing issues after it was shipped to more than 600 million machines. "

<div class="cArrow"> </div><div class="cContentInner">"Microsoft has pulled its latest Windows 10 update offline after some users complained of missing files. It's the latest in a string of incidents with regular patches and Microsoft's larger Windows 10 updates that have been causing issues for some PC users this year. While Microsoft tests Windows 10 with millions of beta testers, there are signs that this public feedback loop isn't always working. Earlier this year Microsoft delayed its April 2018 Windows 10 update due to last minute Blue Screen of Death issues, and then had to fix desktop and Chrome freezing issues after it was shipped to more than 600 million machines. "</div>

...

Cancel

Chinese schools are testing AI that grades papers almost as well as teachers | VentureBeat - 0 views

venturebeat.com/...ers-almost-as-well-as-teachers

ITGS ai machine learning chinese schools testing papers algorithm reliability teachers

shared by dr tech on 30 May 18 - No Cached

dr tech on 30 May 18

"It is also self-improving. The 10-year-old grading software leverages deep learning algorithms to "compare notes" with human teachers' scores, suggestions, and comments. An engineer involved in the project compared its capabilities to those of AlphaGo, the record-breaking AI Go player developed by Google subsidiary DeepMind."

<div class="cArrow"> </div><div class="cContentInner">"It is also self-improving. The 10-year-old grading software leverages deep learning algorithms to "compare notes" with human teachers' scores, suggestions, and comments. An engineer involved in the project compared its capabilities to those of AlphaGo, the record-breaking AI Go player developed by Google subsidiary DeepMind."</div>

...

Cancel

Virgin Hyperloop selects West Virginia to test its futuristic transport system - The Verge - 0 views

www.theverge.com/...ification-center-west-virginia

futuristic hyperloop

shared by circuititgs on 09 Oct 20 - No Cached

circuititgs on 09 Oct 20

"Virgin Hyperloop One announced its plan to build a $500 million certification center to advance its vision of the future of high-speed transportation in West Virginia. The state will serve as a locus for testing, developing, and validating the technology that underpins the still-theoretical hyperloop system. "

<div class="cArrow"> </div><div class="cContentInner">"Virgin Hyperloop One announced its plan to build a $500 million certification center to advance its vision of the future of high-speed transportation in West Virginia. The state will serve as a locus for testing, developing, and validating the technology that underpins the still-theoretical hyperloop system. "</div>

...

Cancel

Group items tagged

US military drone controlled by AI killed its operator during simulated test | US milit... - 1 views

Students using artificial intelligence did worse on tests, experiment shows | EdSource - 0 views

Train firm's 'worker bonus' email is actually cybersecurity test | Rail transport | The... - 0 views

When it comes to creative thinking, it's clear that AI systems mean business | John Nau... - 0 views

Kids who use ChatGPT as a study assistant do worse on tests | Popular Science - 0 views

Chatbot 'Eugene Goostman' passes Turing Test | KurzweilAI - 0 views

Pearson is Now Spying on Students During Standardized Testing ⋆ Ink, Bits, & ... - 0 views

How Excel may have caused loss of 16,000 Covid tests in England | Health policy | The G... - 0 views

Discrimination by algorithm: scientists devise test to detect AI bias | Technology | Th... - 0 views

Hand-held eye exam * The Intelligent Optimist - 0 views

GPT-3 medical chatbot tells suicidal test patient to kill themselves | Boing Boing - 0 views

yes, all models are wrong - 0 views

Are kids' test scores really declining? - 0 views

OpenAI's Project Strawberry Said to Be Building AI That Reasons and Does 'Deep Research' - 0 views

Turing Test Prize Has Two Winners | Singularity Hub - 0 views

Testing Your Blood Sugar with Your iPhone | Singularity Hub - 0 views

South Korea's Robot Teachers To Test Telepresence Tools in the New Year | Singularity Hub - 0 views

Microsoft now faces a big Windows 10 quality test after botched update - The Verge - 0 views

Chinese schools are testing AI that grades papers almost as well as teachers | VentureBeat - 0 views

Virgin Hyperloop selects West Virginia to test its futuristic transport system - The Verge - 0 views

Related searches