Spyke

Supportless 5-axis 3D printing using Prusa i3 with open5x

This is 3D printed with converted Prusa i3 using ongoing project called open5xPreprint article can be found in below link:https://arxiv.org/abs/2202.11426Git...

View original on kbin.social

Comments

machinelearning·Machine LearningbyDeliverator

BayesFlow: Amortized Bayesian Workflows With Neural Networks

Modern Bayesian inference involves a mixture of computational techniques for estimating, validating, and drawing conclusions from probabilistic models as part of principled workflows for data analysis. Typical problems in Bayesian workflows are the approximation of intractable posterior distributions for diverse model types and the comparison of competing models of the same process in terms of their complexity and predictive performance. This manuscript introduces the Python library BayesFlow for simulation-based training of established neural network architectures for amortized data compression and inference. Amortized Bayesian inference, as implemented in BayesFlow, enables users to train custom neural networks on model simulations and re-use these networks for any subsequent application of the models. Since the trained networks can perform inference almost instantaneously, the upfront neural network training is quickly amortized.

https://arxiv.org/abs/2306.16015Open link View original on kbin.social

Comments

science·SciencebyDeliverator

Scientists have finally 'heard' the chorus of gravitational waves that ripple through the universe

NEW YORK (AP) — Scientists have observed for the first time the faint ripples caused by the motion of black holes that are gently stretching and squeezing everything in the universe.

They reported Wednesday that they were able to “hear” what are called low-frequency gravitational waves — changes in the fabric of the universe that are created by huge objects moving around and colliding in space.

“It’s really the first time that we have evidence of just this large-scale motion of everything in the universe,” said Maura McLaughlin, co-director of NANOGrav, the research collaboration that published the results in The Astrophysical Journal Letters.

https://apnews.com/article/gravitational-waves-black-holes-universe-cc0d633ec51a5dc3acb0492baf7f818aOpen link View original on kbin.social

Comments

vegetarian·VegetarianbyDeliverator

What are some of your favorite vegetarian cookbooks?

This one was passed down to me by my grandmother and it's proved to be a very useful tool for both cooking and gardening. There are non-vegetarian recipies but overall it's a great book, and these days it's not too hard to find a suitable meat alternative.

View original on kbin.social

Comments3

gardening·GardeningbyDeliverator

Black spots on my Bolivian Torch cactus, any tips? (USA zone 6)

I did some cursory reading and read that too much sun can cause white/black spots, I keep it next to a south facing window but it's not getting completely baked out in my lawn or anything. Is this normal/fixable?

View original on kbin.social

Comments3

AskKbin·Moving to: m/AskMbin!byDeliverator

What were some "bad habits" on reddit that we should try to avoid bringing over here?

As the fediverse continues to grow, let's reflect on some of the things that we disliked most about posting/lurking on reddit and what we can do differently now that we have a chance to build something new.

View original on kbin.social

Comments87

programming·ProgrammingbyDeliverator

The Curse of Recursion: Training on Generated Data Makes Models Forget

Stable Diffusion revolutionised image creation from descriptive text. GPT-2, GPT-3(.5) and GPT-4 demonstrated astonishing performance across a variety of language tasks. ChatGPT introduced such language models to the general public. It is now clear that large language models (LLMs) are here to stay, and will bring about drastic change in the whole ecosystem of online text and images. In this paper we consider what the future might hold. What will happen to GPT-{n} once LLMs contribute much of the language found online? We find that use of model-generated content in training causes irreversible defects in the resulting models, where tails of the original content distribution disappear. We refer to this effect as Model Collapse and show that it can occur in Variational Autoencoders, Gaussian Mixture Models and LLMs. We build theoretical intuition behind the phenomenon and portray its ubiquity amongst all learned generative models. We demonstrate that it has to be taken seriously if we are to sustain the benefits of training from large-scale data scraped from the web. Indeed, the value of data collected about genuine human interactions with systems will be increasingly valuable in the presence of content generated by LLMs in data crawled from the Internet.

https://arxiv.org/abs/2305.17493v2Open link View original on kbin.social

Comments

machinelearning·Machine LearningbyDeliverator

The Curse of Recursion: Training on Generated Data Makes Models Forget

https://arxiv.org/abs/2305.17493v2Open link View original on kbin.social

Comments2

kbinMeta·/kbin metabyDeliverator

I just noticed that notifications for replies to comments/posts are disabled by default, to enable them you just need to go to your settings and check the relevant boxes

It might be a good idea to have comment replies turned on by default, I feel like it'll help drive discussion/engagement but that's ultimately up to the devs

View original on kbin.social

Comments2

3DPrinting·3D PrintingbyDeliverator

What 3d modeling software do you recommend in the year 2023?

I'm currently getting by with a mixture of Design Spark Mechanical, FreeCAD, and OpenSCAD for prototyping/editing files, I'd love to find a good alternative that isn't from a predatory company like Autodesk

View original on kbin.social

Comments15

gardening·GardeningbyDeliverator

Your garden doesn't like the wildfires, either. Here's how to help plants handle smoke and ash

https://apnews.com/article/garden-plants-smoke-protection-935d603a69a4d34f875aec224d44c8edOpen link View original on kbin.social

Comments

machinelearning·Machine LearningbyDeliverator

Machine Learning Beginner Info/Resources

MOOCs

Nowadays, there are a couple of really excellent online lectures to get you started. The list is too long to include them all. Every one of the major MOOC sites offers not only one but several good Machine Learning classes, so please check coursera, edX, Udacity yourself to see which ones are interesting to you.

However, there are a few that stand out, either because they're very popular or are done by people who are famous for their work in ML. Roughly in order from easiest to hardest, those are:

Andrew Ng's ML-Class at coursera: Focused on application of techniques. Easy to understand, but mathematically very shallow. Good for beginners!
https://www.coursera.org/course/ml
Hasti/Tibshirani's Statistical Learning
https://lagunita.stanford.edu/courses/HumanitiesSciences/StatLearning/Winter2016/about
Yaser Abu-Mostafa's Learning From Data: Focuses a lot more on theory, but also doable for beginners
https://work.caltech.edu/telecourse.html
Geoff Hinton's Neural Nets for Machine Learning: As the title says, this is almost exclusively about Neural Networks.
https://www.coursera.org/course/neuralnets
Hugo Larochelle's Neural Net lectures: Again mostly on Neural Nets, with a focus on Deep Learning
http://www.youtube.com/playlist?list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH
Daphne Koller's Probabilistic Graphical Models Is a very challenging class, but has a lot of good material that few of the other MOOCs here will cover
https://www.coursera.org/course/pgm

Books

The most often recommended textbooks on general Machine Learning are (in no particular order):

Hasti/Tibshirani/Friedman's Elements of Statistical Learning FREE
http://statweb.stanford.edu/%7Etibs/ElemStatLearn/
Barber's Bayesian Reasoning and Machine Learning FREE
http://web4.cs.ucl.ac.uk/staff/D.Barber/pmwiki/pmwiki.php?n=Brml.HomePage
MacKay's Information Theory, Inference and Learning Algorithms FREE
http://www.inference.phy.cam.ac.uk/itila/book.html
Goodfellow/Bengio/Courville's Deep Learning FREE
http://www.deeplearningbook.org/
Nielsen's Neural Networks and Deep Learning FREE
http://neuralnetworksanddeeplearning.com/
Graves' Supervised Sequence Labelling with Recurrent Neural Networks FREE
http://www.cs.toronto.edu/%7Egraves/preprint.pdf
Sutton/Barto's Reinforcement Learning: An Introduction; 2nd Edition FREE
https://www.dropbox.com/s/7jl597kllvtm50r/book2015april.pdf

Note that these books delve deep into math, and might be a bit heavy for complete beginners. If you don't care so much about derivations or how exactly the methods work but would rather just apply them, then the following are good practical intros:

An Introduction to Statistical Learning FREE
http://www-bcf.usc.edu/%7Egareth/ISL/
Probabilistic Programming and Bayesian Methods for Hackers FREE
http://nbviewer.ipython.org/github/CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers/blob/master/Prologue/Prologue.ipynb

There are of course a whole plethora on books that only cover specific subjects, as well as many books about surrounding fields in Math. A very good list has been collected by /u/ilsunil here

Deep Learning Resources

Karpathy's Stanford CS231n: Convolutional Neural Networks for Visual Recognition (Lecture Notes)
http://cs231n.github.io/
Video Lecture's Stanford CS231n: Convolutional Neural Networks for Visual Recognition
https://www.youtube.com/playlist?list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA
Silver's Reinforcement Learning Lectures
https://www.youtube.com/watch?v=2pWv7GOvuf0
Colah's Informational Blog
http://colah.github.io/
Bruna's UC Berkeley Stat212b: Topics Course on Deep Learning
https://joanbruna.github.io/stat212b/
Overview of Neural Network Architectures
http://www.asimovinstitute.org/neural-network-zoo/

Math Resources

Strang's Linear Algebra Lectures
https://www.youtube.com/watch?v=ZK3O402wf1c
Kolter/Do's Linear Algebra Review and Reference Notes
http://cs229.stanford.edu/section/cs229-linalg.pdf
Calculus 1
https://www.edx.org/course/calculus-1a-differentiation-mitx-18-01-1x
Introduction to Probability
https://www.edx.org/course/introduction-probability-science-mitx-6-041x-1

Programming Languages and Software

In general, the most used languages in ML are probably Python, R and Matlab (with the latter losing more and more ground to the former two). Which one suits you better depends wholy on your personal taste. For R, a lot of functionality is either already in the standard library or can be found through various packages in CRAN. For Python, NumPy/SciPy are a must. From there, Scikit-Learn covers a broad range of ML methods.

If you just want to play around a bit and don't do much programming yourself then things like Visions of Chaos, WEKA, KNIME or RapidMiner might be of your liking. Word of caution: a lot of people in this subreddit are very critical of WEKA, so even though it's listed here, it is probably not a good tool to do anything more than just playing around a bit. A more detailed discussion can be found here

Deep Learning Software, GPU's and Examples

There are a number of modern deep learning toolkits you can utilize to implement your models. Below, you will find some of the more popular toolkits. This is by no means an exhaustive list. Generally speaking, you should utilize whatever GPU has the most memory, highest clock speed, and most CUDA cores available to you. This was the NVIDIA Titan X from the previous generation. These frameworks are all very close in computation speed, so you should choose the one you prefer in terms of syntax.

Theano is a python based deep learning toolkit developed by the Montreal Institute of Learning Algorithms, a cutting edge deep learning academic research center and home of many users of this forum. This has a large number of tutorials ranging from beginner to cutting edge research.

Torch is a Luajit based scientific computing framework developed by Facebook Artificial Intelligence Research (FAIR) and is also in use at Twitter Cortex. There is the torch blog which contains examples of the torch framework in action.

TensorFlow is a python deep learning framework developed by Google Brain and in use at Google Brain and Deepmind. The newest framework around. Some TensorFlow examples may be found here Do not ask questions on the Google Groups, ask them on stackoverflow

Neon is a python based deep learning framework built around a custom and highly performant CUDA compiler Maxas by NervanaSys.

Caffe is an easy to use, beginner friendly deep learning framework. It provides many pretrained models and is built around a protobuf format of implementing neural networks.

Keras can be used to wrap Theano or TensorFlow for ease of use.

Datasets and Challenges for Beginners

There are a lot of good datasets here to try out your new Machine Learning skills.

Kaggle has a lot of challenges to sink your teeth into. Some even offer prize money!
http://www.kaggle.com/
The UCI Machine Learning Repository is a collection of a lot of good datasets
http://archive.ics.uci.edu/ml/
http://blog.mortardata.com/post/67652898761/6-dataset-lists-curated-by-data-scientists lists some more datasets
Here is a very extensive list of large-scale datasets of all kinds.
http://www.quora.com/Data/Where-can-I-find-large-datasets-open-to-the-public
Another dataset list
http://www.datawrangling.com/some-datasets-available-on-the-web

Research Oriented Datasets

In many papers, you will find a few datasets are the most common. Below, you can find the links to some of them.

MNIST A short handwriting dataset that is often used as a sanity check in modern research
http://yann.lecun.com/exdb/mnist/
SVHN Similar to MNIST, but with color numbers. A sanity check in most cases.
http://ufldl.stanford.edu/housenumbers/
CIFAR-10/0 CIFAR 10 and 100 are two natural color images that are often used with convolutional neural networks for image classification.
https://www.cs.toronto.edu/%7Ekriz/cifar.html

Communities

http://www.datatau.com/ is a data-science centric hackernews
http://metaoptimize.com/qa/ and http://stats.stackexchange.com/ are Stackoverflow-like discussion forums

ML Research

Machine Learning is a very active field of research. The two most prominent conferences are without a doubt NIPS and ICML. Both sites contain the pdf-version of the papers accepted there, they're a great way to catch up on the most up-to-date research in the field. Other very good conferences include UAI (general AI), COLT (covers theoretical aspects) and AISTATS.

Good journals for ML papers are the Journal of Machine Learning Research, the Journal of Machine Learning and arxiv.

FAQ

How much Math/Stats should I know?

That depends on how deep you want to go. For a first exposure (e.g. Ng's Coursera class) you won't need much math, but in order to understand how the methods really work,having at least an undergrad level of Statistics, Linear Algebra and Optimization won't hurt.

View original on kbin.social

Comments3

anarchism·AnarchismbyDeliverator

Nestor Makhno — Anarchy's Cossack: The Struggle for Free Soviets in Ukraine 1917-1921 (Free Audiobook)

The phenomenal life of Ukrainian peasant Nestor Makhno (1888–1934) provides the framework for this breakneck account of the downfall of the tsarist empire and the civil war that convulsed and bloodied Russia between 1917 and 1921. As in many of history's chivalric tales, clashes were fought through lightning cavalry charges and bitter hand-to-hand, saber-wielding combat. The combatants were drawn from several camps: Budyenny's Red cavalry, the Don Cossacks and Kuban Cossacks (allied with the Whites), Ukrainian nationalists, and Makhnovist partisans. Makhno, a formidable and daring strategist, headed an army of anarchist insurgents—a popular peasant movement which bore his name.

https://archive.org/details/anarchys-cossackOpen link View original on kbin.social

Comments

vegetarian·VegetarianbyDeliverator

Basic tips, tutorials and recipes (work in progress)

Recipes

https://github.com/hrs/recipes

excellent repository of vegetarian/vegan recipes compiled by github.

https://vegetatio.com/content/joyous-living-full-vegetarian-cheese-list

list of vegetarian friendly cheese brands.

Common non-vegetarian ingredients

Bonito Flakes (dried fish flakes). Traditional flavoring used in Thai, Chinese, Japanese etc., soups & dishes.
Bone Marrow. Extracted from animal bones. Used in broths.
Carmine (cochineal insect extract). Used as a red food dye (natural red 4, C.I. 75470, or E120).
Castoreum (beaver anal glands). Uncommon but is used in vanilla and musk scents/flavorings.
Isinglass (fish bladders). Used in beer and wine making. See Barnivore.com for an updated list of vegetarian brands of beers and wines.
Gelatin (extracted from animal skin, bones and tissue).
Lard (rendered animal fat).
Rennet--Used in cheese making. See vegetatio.com for an updated list of vegetarian brands of cheeses.
Lipase (pre-gastric tongue root glands of lambs/calves used in cheese making). See vegetatio.com for an updated list of vegetarian brands of cheeses.
L-cysteine (feathers/hair). Used in bread making.
Lanolin (sheep oil). Vitamin D supplements.
Pepsin (pig stomach). See Rennet.

Hidden animal products in popular foods

Gelatin Gelatin is produced from pig, cow and chicken bones. (Sometimes fish bones for rare Passover marshmallows.) Anything with marshmallow has gelatin in it. This includes those hard little cereal marshmallows. Gelatin is also found in most gummy candy, some jellybeans, even most Candy Corn. It's also found in everything by the Necco brand of candy (Necco wafers). It's often found in pudding or cheesecake desserts, and it's in Costco chocolate cake. Those shiny tarts with the fruit on top? Gelatin is usually in the clear film on top. Some random foods with gelatin added - instant rice, peanuts, some wine and beer is cleared with it.
MacDonald's French Fries BEEF! These contain an ingredient you would not expect in a french fry - cow! Also look out for lard in any refried beans or in Mexican food in general. This may be limited to the USA.
Prepared pie crusts More often than not these contain LARD instead of vegetable shortening. At most health food stores you can find veg-friendly alternative, but most at the mega-marts are lard laden.
Texas Roadhouse sweet potatoes I thought this was safe - but they're rubbed down in bacon grease before cooking.
Cheese rennet Unless it says vegetable rennet you can assume it's animal rennet, made from the stomach of cows, making your cheese. I don't know how much rennet stays in the final product (or if it is flushed with the whey) but it's good to know.
Hidden anchovies Lots of items like sauces, dressings, and flavorings, contain Worcestershire sauce, which itself contains anchovies.
Chicken stock/fat I find this in so many products, from soups, chips, rice, noodle dishes, etc. It's one you can find almost anywhere.

Vegetarian Protein Guide

Quinoa Protein: 8 grams per 1-cup serving, cooked
Buckwheat Protein: 6 grams per 1-cup serving, cooked
Soy Protein: 12 grams per ½-cup serving (firm tofu), 15 grams per 3-ounce serving (tempeh), 18 grams per 3.5-ounce serving (natto), 17 grams per 1-cup serving (edamame)
Mycoprotein (Quorn) Protein: 13 grams per ½-cup serving
Rice and Beans Protein: 8 grams per 1-cup serving
Hummus and pita Protein: 7 grams per 1 whole-wheat pita and 2 tablespoons hummus
Spirulina Protein: 4 grams per 1 tablespoon
Peanut butter sandwich Protein: 15 grams per 2-slice sandwich with 2 tablespoons peanut butter

(source https://greatist.com/health/complete-vegetarian-proteins)

View original on kbin.social

Comments

Posts

MOOCs

Books

Deep Learning Resources

Math Resources

Programming Languages and Software

Deep Learning Software, GPU's and Examples

Datasets and Challenges for Beginners

Research Oriented Datasets

Communities

ML Research

Other sites and Tutorials

FAQ

Recipes

Common non-vegetarian ingredients

Hidden animal products in popular foods

Vegetarian Protein Guide