Skip to main content

Google research lets sign language switch ‘active speaker’ in video calls

 https://www.mapleidentity.com/forums/showthread.php?tid=2560


https://tfaforum.org/showthread.php?tid=13609


https://forummafia.net/showthread.php?tid=62180


http://prayformypet.com/board/showthread.php?tid=26739


http://forum.realtor-room.ru/showthread.php?tid=1786


http://www.nksoa.org/mybb/showthread.php?tid=34927


http://forum.engesoftbi.com.br/showthread.php?tid=27341


https://mysourcetelevision.com/forum/showthread.php?tid=149469


https://ekgelirrehberi.com/showthread.php?tid=10733&pid=11251#pid11251

An aspect of video calls that many of us take for granted is the way they can switch between feeds to highlight whoever’s speaking. Great — if speaking is how you communicate. Silent speech like sign language doesn’t trigger those algorithms, unfortunately, but this research from Google  might change that.


It’s a real-time sign language detection engine that can tell when someone is signing (as opposed to just moving around) and when they’re done. Of course it’s trivial for humans to tell this sort of thing, but it’s harder for a video call system that’s used to just pushing pixels.


A new paper from Google researchers, presented (virtually, of course) at ECCV, shows how it can be done efficiency and with very little latency. It would defeat the point if the sign language detection worked but it resulted in delayed or degraded video, so their goal was to make sure the model was both lightweight and reliable.


https://www.fiftyeyes.com/showthread.php?tid=714


https://insigniagsdrivers.co.uk/showthread.php?tid=92178


http://forum.naronanews.com/showthread.php?tid=7174


http://concerns.sportshouse.com.ph/showthread.php?tid=7752


http://concerns.sportshouse.com.ph/showthread.php?tid=43851


https://forum.hacknslashworld.com/viewtopic.php?f=9&t=43584


https://forum.hacknslashworld.com/viewtopic.php?t=5428


https://forum.hacknslashworld.com/viewtopic.php?t=11650


https://forum.hacknslashworld.com/viewtopic.php?t=470


https://forum.hacknslashworld.com/viewtopic.php?t=8145


https://forum.hacknslashworld.com/viewtopic.php?t=7301


https://forum.hacknslashworld.com/viewtopic.php?t=191


https://forum.hacknslashworld.com/viewtopic.php?t=16293


https://forum.hacknslashworld.com/viewtopic.php?t=12916


https://forum.hacknslashworld.com/viewtopic.php?t=6431


https://forum.hacknslashworld.com/viewtopic.php?t=13507


https://forum.hacknslashworld.com/viewtopic.php?t=8554


http://www.flyingfish.nl/forum/viewtopic.php?t=1929920


http://www.flyingfish.nl/forum/viewtopic.php?p=2326980


http://www.flyingfish.nl/forum/viewtopic.php?p=2333407

The system first runs the video through a model called PoseNet, which estimates the positions of the body and limbs in each frame. This simplified visual information (essentially a stick figure) is sent to a model trained on pose data from video of people using German Sign Language, and it compares the live image to what it thinks signing looks like.


This simple process already produces 80 percent accuracy in predicting whether a person is signing or not, and with some additional optimizing gets up to 91.5 percent accuracy. Considering how the “active speaker” detection on most calls is only so-so at telling whether a person is talking or coughing, those numbers are pretty respectable.


In order to work without adding some new “a person is signing” signal to existing calls, the system pulls clever a little trick. It uses a virtual audio source to generate a 20 kHz tone, which is outside the range of human hearing, but noticed by computer audio systems. This signal is generated whenever the person is signing, making the speech detection algorithms think that they are speaking out loud.

http://www.flyingfish.nl/forum/viewtopic.php?p=2405895


http://www.flyingfish.nl/forum/viewtopic.php?p=2386502


http://www.flyingfish.nl/forum/viewtopic.php?p=2407424


http://www.flyingfish.nl/forum/viewtopic.php?p=2341767


http://www.flyingfish.nl/forum/viewtopic.php?p=2331743


https://modelcarsforum.com/showthread.php?tid=24000


http://detimgn.iboards.ru/viewtopic.php?f=50&t=16348

Comments

Popular posts from this blog

GET TECHNICAL FORUMS

http://www.streathamcommonforum.co.uk/viewtopic.php?f=14&t=21768 http://www.cyklistikakrnov.com/forum/viewtopic.php?t=89069 http://fms.misionsucre.gob.ve/foro/viewtopic.php?t=902593 http://forum.prokarters.co.uk/viewtopic.php?f=2&t=545030 https://techninjahub.blogspot.com/2019/05/get-technology-ideas-from-here.html https://technicalweb85.blogspot.com/2019/05/get-technical-support-by-visiting-this.html https://www.ex-ttcommunity.com/forum/viewtopic.php?t=239190 http://understandanxiety.org/anxiety-forum/viewtopic.php?t=44589 http://www.skyarn.fr/forum/viewtopic.php?t=59733 http://www.trungvitlon.com/viewtopic.php?t=2215 http://www.taflan.org/viewtopic.php?t=297889 http://cafe103.info/phpBB/viewtopic.php?t=95110 http://forum.rethia.net/viewtopic.php?t=1331399 https://coalpail.com/coal-forum/viewtopic.php?t=12562 http://frlegends.net/showthread.php?tid=11133 http://forum.packbel.by/viewtopic.php?t=51682 http://pure-arrogance.de/forum/viewtopic.php?...

Finary wants to create the wealth management dashboard for the next generation

 Meet Finary, a new French startup that wants to change how you manage your savings, investments, mortgage, real estate assets and cryptocurrencies. The company lets you aggregate all your accounts across various banks and financial institutions so that you can track your wealth comprehensively over time. After attending Y Combinator, the startup has just closed a $2.7 million (€2.2 million) seed round led by Speedinvest with Kima Ventures and angel investors, such as Raphaël Vullierme also participating. https://www.redheronation.org/forums/showthread.php?tid=892 http://forum.naronanews.com/showthread.php?tid=19123 https://crackx.to/Thread-Mega-nz-voucher-codes http://kaikodai.com/viewtopic.php?f=16&t=60576 https://whitehatcommunity.com/showthread.php?pid=217878&tid=148248 http://hanabilkova.svet-stranek.cz/nakup/41 http://mobile.jaksezijespolecnicim.stranky1.cz/forum/ http://maskedavengerstudios.blogspot.com/2014/07/batman66-king-tut.html https://emrebaransel.blogspot.com...

Daily Crunch: Jio and Google set November 4 rollout for India’s $87 JioPhone Next

Hello and welcome to Daily Crunch for October 29, 2021. If you feel a little snowed-under after all the news from the week, we understand. This week saw Facebook change its name, new hardware from Google and Samsung, Apple laptops reviews, Sequoia revamping its entire structure, Big Tech earnings, issues at Ro, and eighty-eleven startup funding rounds and product launches. But we made it through, so let’s go back over today’s biggest news and then get right into this weekend! —Alex The TechCrunch Top 3 Public cloud revenues reach $45B: In the third quarter, the value of public cloud revenues from Google, Microsoft and Amazon hit $45 billion, a figure good for a $180 billion run rate. That figure underscores how far the cloud has come in recent years and represents spend from a host of companies big and small, tech and otherwise. TechCrunch dug into what impact the chip shortage is, and isn’t, having on growth amongst the public cloud lords, in case you were curious about that particula...