Skip to main content

Home/ Digit_al Society/ Group items tagged testing

Rss Feed Group items tagged

dr tech

US military drone controlled by AI killed its operator during simulated test | US milit... - 1 views

  •  
    "In a simulated test staged by the US military, an air force drone controlled by AI killed its operator to prevent it from interfering with its efforts to achieve its mission, an official said last month. AI used "highly unexpected strategies to achieve its goal" in the simulated test, said Col Tucker 'Cinco' Hamilton, the chief of AI test and operations with the US air force, during the Future Combat Air and Space Capabilities Summit in London in May. Hamilton described a simulated test in which a drone powered by artificial intelligence was advised to destroy enemy's air defense systems, and attacked anyone who interfered with that order."
dr tech

Students using artificial intelligence did worse on tests, experiment shows | EdSource - 0 views

  •  
    "Students using ChatGPT solved 48% more of the problems correctly, and those with the AI tutor solved 127% more problems correctly, according to the report. But their peers who did not use ChatGPT outscored them on the related tests. In fact, students using ChatGPT scored 17% worse on tests.  Kids working on their own performed the same on practice assignments and tests.  Researchers told The Hechinger Report that students are using the chatbot as a "crutch" and that it can "substantially inhibit learning.""
dr tech

Train firm's 'worker bonus' email is actually cybersecurity test | Rail transport | The... - 0 views

  •  
    "West Midlands Trains emailed about 2,500 employees with a message saying its managing director, Julian Edwards, wanted to thank them for their hard work over the past year under Covid-19. The email said they would get a one-off payment as a thank you after "huge strain was placed upon a large number of our workforce". However, those who clicked through on the link to read Edwards' thank you were instead emailed back with a message telling them it was a company-designed "phishing simulation test" and there was to be no bonus. It warned: "This was a test designed by our IT team to entice you to click the link and used both the promise of thanks and financial reward.""
dr tech

When it comes to creative thinking, it's clear that AI systems mean business | John Nau... - 0 views

  •  
    "Ah, but isn't creativity a slippery concept - something that's hard to define but that we nevertheless recognise when we see it? That hasn't stopped psychologists from trying to measure it, though, via tools such as the alternative uses test and the similar Torrance test. And it turns out that one LLM - GPT-4 - beats 91% of humans on the former and 99% of them on the latter. So as the inveterate artificial intelligence user Ethan Mollick puts it: "We are running out of creativity tests that AIs cannot ace.""
dr tech

Kids who use ChatGPT as a study assistant do worse on tests | Popular Science - 0 views

  •  
    "Researchers at the University of Pennsylvania found that Turkish high school students who had access to ChatGPT while doing practice math problems did worse on a math test compared with students who didn't have access to ChatGPT. Those with ChatGPT solved 48 percent more of the practice problems correctly, but they ultimately scored 17 percent worse on a test of the topic that the students were learning. "
dr tech

Chatbot 'Eugene Goostman' passes Turing Test | KurzweilAI - 0 views

  •  
    "The Turing Test was passed for the first time by a chatbot called "Eugene Goostman" on Saturday by convincing 33% of the human judges that it was human, according to Professor Kevin Warwick, a Visiting Professor at the University of Reading and Deputy Vice-Chancellor for Research at Coventry University, in a statement."
dr tech

Pearson is Now Spying on Students During Standardized Testing ⋆ Ink, Bits, & ... - 0 views

  •  
    "Luckily for the kid, it was only a single tweet. Had several students gotten together to form a study group, they might have been prosecuted for felony interference with a business model and gotten the death penalty. Do you suppose Pearson has hidden microphones set up around the schools so they can also listen in and see if students discuss the tests during lunch? I ask because that is basically the offline version of the student's infraction."
dr tech

How Excel may have caused loss of 16,000 Covid tests in England | Health policy | The G... - 0 views

  •  
    "But while CSV files can be any size, Microsoft Excel files can only be 1,048,576 rows long - or, in older versions which PHE may have still been using, a mere 65,536. When a CSV file longer than that is opened, the bottom rows get cut off and are no longer displayed. That means that, once the lab had performed more than a million tests, it was only a matter of time before its reports failed to be read by PHE."
dr tech

Discrimination by algorithm: scientists devise test to detect AI bias | Technology | Th... - 0 views

  •  
    "Concerns have been growing about AI's so-called "white guy problem" and now scientists have devised a way to test whether an algorithm is introducing gender or racial biases into decision-making."
dr tech

Hand-held eye exam * The Intelligent Optimist - 0 views

  •  
    "A smartphone application called Peek, Portable Eye Examination Kit, utilizes the phone's camera and can conduct visual and color field tests, lens imaging for cataracts, and retinal imaging, among other tests to detect sight impairments and diseases."
dr tech

GPT-3 medical chatbot tells suicidal test patient to kill themselves | Boing Boing - 0 views

  •  
    "GPT-3 medical chatbot tells suicidal test patient to kill themselves"
dr tech

yes, all models are wrong - 0 views

  •  
    "According to Derek & Laura Cabrera, "wicked problems result from the mismatch between how real-world systems work and how we think they work". With systems thinking, there is constant testing and feedback between the real world, in all its complexity, and our mental model of it. This openness to test and look for feedback led Dr. Fisman to change his mind on the airborne spread of the coronavirus."
dr tech

Are kids' test scores really declining? - 0 views

  •  
    "Could it be the phones? Absolutely! To be clear: the idea that phones are causing distraction both inside and outside of school hours, and this contributes to declining test scores, seems totally plausible to me-and preliminary cross-sectional data from the PISA report indicates the same. Might it be a good idea to keep phones out of the classroom? Definitely! But, as often happens when an excerpt of a larger study makes the rounds online, some nuance is missing. Let's talk about what the data actually show. "
dr tech

OpenAI's Project Strawberry Said to Be Building AI That Reasons and Does 'Deep Research' - 0 views

  •  
    "A source told Reuters that OpenAI has tested a model internally that achieved a 90 percent score on a challenging test of AI math skills, though it again couldn't confirm if this was related to project Strawberry. But another two sources reported seeing demos from the Q* project that involved models solving math and science questions that would be beyond today's leading commercial AIs."
dr tech

Microsoft now faces a big Windows 10 quality test after botched update - The Verge - 0 views

  •  
    "Microsoft has pulled its latest Windows 10 update offline after some users complained of missing files. It's the latest in a string of incidents with regular patches and Microsoft's larger Windows 10 updates that have been causing issues for some PC users this year. While Microsoft tests Windows 10 with millions of beta testers, there are signs that this public feedback loop isn't always working. Earlier this year Microsoft delayed its April 2018 Windows 10 update due to last minute Blue Screen of Death issues, and then had to fix desktop and Chrome freezing issues after it was shipped to more than 600 million machines. "
dr tech

Chinese schools are testing AI that grades papers almost as well as teachers | VentureBeat - 0 views

  •  
    "It is also self-improving. The 10-year-old grading software leverages deep learning algorithms to "compare notes" with human teachers' scores, suggestions, and comments. An engineer involved in the project compared its capabilities to those of AlphaGo, the record-breaking AI Go player developed by Google subsidiary DeepMind."
circuititgs

Virgin Hyperloop selects West Virginia to test its futuristic transport system - The Verge - 0 views

  •  
    "Virgin Hyperloop One announced its plan to build a $500 million certification center to advance its vision of the future of high-speed transportation in West Virginia. The state will serve as a locus for testing, developing, and validating the technology that underpins the still-theoretical hyperloop system. "
1 - 20 of 110 Next › Last »
Showing 20 items per page