Talk:Integer square root
This article is rated Start-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | ||||||||||||||||||||||||||||
|
Request for a picture
editIf someone with Maple, Mathematica, etc. and would graph a function of isqrt, that would be very appreciated. I have use of Maple over Putty, but it displays graphs with ASCII characters, and it's hideous to look at. Iamunknown 09:26, Sep 13, 2004 (UTC)-
- On second thought, is a graph necessary? Opinions, please. Iamunknown 18:53, 28 July 2006 (UTC)
It might be nice to have a plot for every specific function with its own article. To give a quick visual insight to what's going on. If the scale isn't terrible. WikiSlothBlobThing (talk) 02:13, 24 July 2021 (UTC)
Remarks about the writing
editHi Olegalexandrov. I was planning on editing the page again, but wanted to consult you beforehand. I think that the apostrophe in the bottom link needs to be escaped; otherwise Wikipedia fails to recognize the enclosed text as a link. On the flip-side, thanks for cleaning up after me! I messed up on standardization of 'x' and 'n', 'x_n', and 'valid' is much better than 'correct'. I am wondering, however, about your excess of links. For the words (take 'number theory' for example) that are throughout the document, I personally think that only one reference should be linked, presumably at the top. Since that was a bulk of your edit, though, I decided to ask you beforehand; I didn't want to completely reverse your edit! ('Recursive function' is also linked more than once.) Also, I completely realize my ambiguity in the statement, "The beauty of Newton's Iteration for finding the integer square root of a number n is that it can use solely integers," and was wondering what you thought about this statement: "The beauty of Newton's Iteration as used to find the integer square root of a number n is its inherent ability to stay within the set of integers, requiring no use of floating-point arithmetic." That was definitely what I trying to get at. Thanks! --Iamunknown 08:45, 11 Dec 2004 (UTC)
- Thanks for the info about the excess links; I will be reverting that soon. I do mean that every operation can be performed in integers. About the statement,
I mean the function itself is wholly algebraic. It uses fractions, yes, but fractions are merely two integers placed atop and beneath one another. Keeping that into perspective, you could retain their integer identites and continue on with your calculation (still remaining in the set of integers) and when you have reached the desired accuracy, then do your final division.The beauty of Newton's Iteration as used to find the integer square root of a number n is its inherent ability to stay within the set of integers, requiring no use of floating-point arithmetic,
- --Iamunknown 21:50, 11 Dec 2004 (UTC)
- Got it now! You meant to say that you use rational numbers, not integers. Then it all makes sense! So maybe you should write on the integer square root page that finding the square root directly gives you irrational numbers, while doing Newton's method gives you rational numbers. What do you think? --Olegalexandrov 22:15, 11 Dec 2004 (UTC)
- Excellent! That is much easier to understand than what I previously had written. I'll definitely add that right now. Thanks, Olegalexandrov! --Iamunknown 22:29, 11 Dec 2004 (UTC)
Division-free square root
editIs anyone interested in the divison-free algorithm for the square root? Obviously it only converges linearly rather than quadratically, but, especially in binary, it might be faster anyway. —The preceding unsigned comment was added by 80.175.250.218 (talk • contribs) 10:46, 31 October 2005 (UTC).
- Actually, now that I think about it, it's only purely division-free in binary, not in other bases. —The preceding unsigned comment was added by 80.175.250.218 (talk • contribs) 10:48, 31 October 2005 (UTC).
Sure... I had to code up a division- (and multiplication)-free version for an 8 bit micro that had no divide or multiply instruction - you're welcome to use this if you want it. The code has been verified against all 65535 inputs:
u_int8_t IterSqrt(u_int16_t b) {
u_int8_t i, j;
u_int16_t step, prev, tot, isquared, jsquared;
if (b <= 1) return b;
i = j = 0; isquared = prev = tot = 0; step = 1;
for (;;) { // maximum 255 iterations, but cheap compared to the 5 divides in software needed by Newton-Raphson.
tot += step; step += 1; jsquared = tot+prev; prev = tot; // calculate squares using 'table of differences' method.
if (jsquared == 0) return 255U; // overflow and loop-end test in one!
if (b >= isquared && b < jsquared) return i; // test plausible square root: sqrt(b) = i IF i^2 <= b < (i+1)^2
i += 1; isquared = jsquared;
} // loop contains two full adds, two increments, and 3 comparisons.
}
Algorithm in terms of integers
editThe algorithm given in the article
with works if you do integer calculations, too (truncate when dividing). Then the algorithm terminates when . And as far as I can see, it actually converges a little faster. Wouldn't it make more sense for the article to do everything in terms of integers, instead of rationals? dbenbenn | talk 09:45, 18 March 2006 (UTC)
If you use truncation when dividing, the algorithm doesn't work when n=3, or any square less one. In that case it'll oscillate between sqrt(n+1) and sqrt(n+1)-1. CHL (aka yse) (talk) 14:04, 30 November 2008 (UTC)
The description of the algorithm in the article should explicitly state what kind of variables are used in the algorithm. It is very easy for someone to think that the variables are integers, for example.
My opinion is that the algorithm used should be in terms of integer variables. It seems that the context in which someone is most likely to look at the algorithm is when they are operating in integer domains, and algorithms based on non-integer computations will be of little interest in such situations. —Preceding unsigned comment added by 71.112.25.123 (talk) 00:18, 7 August 2009 (UTC)
Integer square root code in C
editWould it be appropriate to post the following C code in the article? It is fast and tested:
#include <stdio.h> /* * Return the truncated integer square root of "y" using longs. * Return -1 on error. */ long lsqrt(long y) { long x_old, x_new; long testy; int nbits; int i; if (y <= 0) { if (y != 0) { fprintf(stderr, "Domain error in lsqrt().\n"); return -1L; } return 0L; } /* select a good starting value using binary logarithms: */ nbits = (sizeof(y) * 8) - 1; /* subtract 1 for sign bit */ for (i = 4, testy = 16L;; i += 2, testy <<= 2L) { if (i >= nbits || y <= testy) { x_old = (1L << (i / 2L)); /* x_old = sqrt(testy) */ break; } } /* x_old >= sqrt(y) */ /* use the Babylonian method to arrive at the integer square root: */ for (;;) { x_new = (y / x_old + x_old) / 2L; if (x_old <= x_new) break; x_old = x_new; } return x_old; }
I have thoroughly tested this code and guarantee it is correct. --Gesslein 16:20, 2 July 2006 (UTC)
- I do not know. I started a Integer square root codebase on Wikisource which was eventually deleted, because (although the programs in each language worked) the programs were original contributions. I think that applies to this article and would thus forbid you to place that code in the article, but I will forward the query to Charles Matthews, a prominent mathematics administrator. This article itself, however, may be an original contribution. He will be more likely to know than I. Iamunknown 20:03, 18 July 2006 (UTC)
- I thought the terms of Wikipedia edits were "No original research". Contributions should always be welcome :-) --Gesslein 18:53, 28 July 2006 (UTC)
- I agree that contributions should always be welcome. The proposed mathematics convention for algorithms, however, suggests that "Algorithms, like all other content in Wikipedia, should be traceable to reliable, published sources. Every presentation of an algorithm (both pseudocode and sample code) should include a citation." I agree with this, even though abiding by it currently prevents your code from inclusion. Iamunknown 19:34, 28 July 2006 (UTC)
- Thanks for your thoughts on this. The algorithm used is Newton's method (also called the Babylonian method) described in the article. It gets a speed increase from using binary logarithms to select the starting value. --Gesslein 19:40, 28 July 2006 (UTC)
The thing about Newton-Raphson is that it iterates so quickly that a loop is seldom required - if the maximum integer size is known, then a fixed number of iterations can be used inline instead. For example:
u_int16_t isqrt32_nr(u_int32_t number) {
u_int32_t guess2, guess1 = number, guess = number;
if (number <= 1) return number;
while (guess1) { guess >>= 1; guess1 >>= 2; } // ballpark - half the number of bits
guess1 = (guess + number / guess) >> 1; // enough Newton-Raphson iterations for 32 bit input
guess2 = (guess1 + number / guess1) >> 1;
guess = (guess2 + number / guess2) >> 1; if (guess == guess1) return guess2;
guess1 = (guess + number / guess) >> 1; if (guess1 == guess2) return guess;
guess2 = (guess1 + number / guess1) >> 1; if (guess2 == guess) return guess1;
return (u_int16_t)guess2;
}
(Also verified against all 2^32 unsigned inputs. Takes about 10 minutes to test on my old i686 32-bit PC) — Preceding unsigned comment added by 70.124.38.160 (talk) 01:18, 22 May 2023 (UTC)
Extend definition to include zero
editI would like to suggest that the definition is extended from 'a positive integer n' to 'a non-negative integer n', which would be in line with the definition of the (real-number domain) square root.
If that is accepted, it should be stated that the algorithm is only suitable for finding isqrt(n), n > 0.
Stopping criterion
editCurrently, there is a statement in the Algorithm section with an unsourced statement that implies , which is not actually difficult to prove (contrary to what the text suggests). Of course, I can't cite myself, since that's original research. I'm going to remove the claim that the proof is not trivial; if anybody objects, reply here. (Oh, and no offence to the person who wrote that it isn't trivial.) CHL (aka yse) (talk) 07:28, 20 October 2009 (UTC)
Funny that you should nitpick over the claim that the proof is trivial, while you ignore that the claim is not true. The stopping criterion should be , else if the root is in between 2 integers, the error could carry you over to the next number. 5.102.253.21 (talk) —Preceding undated comment added 15:43, 12 February 2014 (UTC)
Having looked back at my old "proof", it turned out to be rather shoddy and nonrigorous, but I have since retried proving it and the original stopping criterion of really is sufficient. Some algebraic manipulation on gives . Assuming that , we have . Splitting this into the cases where is a perfect square and where is not leads straightforwardly in either case to . CHL (aka yse) (talk) 10:43, 12 December 2014 (UTC)
irrational for "almost all"?
editI doubt that √n is irrational for almost all n. I can easily give infinitely many n, such that √n is rational (even an integer). Maybe something like "for most n" was meant, though strictly this is not correct either, as there are exactly as many square numbers as non-square numbers ... --134.102.204.124 (talk) 16:17, 21 February 2012 (UTC)
- It is irrational for almost all n, insofar as it is rational for a set of measure 0. CRGreathouse (t | c) 18:50, 21 February 2012 (UTC)
- There are much more irrational numbers than rational numbers. — Preceding unsigned comment added by 129.97.171.175 (talk) 21:41, 3 March 2014 (UTC)
Notes ad Revision 08-12-2021 (01:11)
editAd 'Algorithm using linear search'
edit- In the previous revision(s) the C-code was changed. The type of the functions was changed from
unsigned long long int
toint
.
- Comment: In a fragment further down [1] the type used was
unsigned long
.[2] An article should be consistent in style, so a mix of data types should be based on a reason, which here I do not see. I admit that by introducingunsigned long long int
myself I did not stick to this design principle. I fixed this in the current release by changing the type throughout tounsigned long
. Whatever the outcome shall be, I am in favour of keepingunsigned
, to visualize that negative numbers are disallowed.
- The code
x = 1; while( x * x <= y ) x = x + 1;
- was changed to
for( x = 1; x * x <= y; ++x ) ;
- This is a change of syntax only. As a professional C-veteran myself I have no problems at all with this
for
-statement. My decision to use an equivalentwhile
is based on the consideration that the present article is not an article about C. The readers can be expected to know some Pascal or Algol or BASIC or ... . Pseudocode too. But they are not necessarily experts in C. Afor
statement like this will pull many off. On the other hand the C-programmers also understand thewhile
-construct. So why not stay on common ground?
Ad 'Algorithm using binary search'
edit- The assignments
L = 0; R = y;
are illegal without previous declaration of these variables. The code does not compile. I fixed this.
- In the previous revision the test
while( L != R - 1 )
was changed towhile( L < R - 1 )
. The author hailed this as an increase of robustness. I disagree. For awhile (condition){...}
loop to be valid, the condition must at some point turn False. If the != condition is never met then the loop will run forever. An unexpected eternal loop is made more unlikely by strengthening the termination condition to <. As the procedure is designed to reach an = situation, the defensive reflex to allow illegitimate variable values to become even more illegitimate cannot be called 'more robust'. On the theoretical side see the following informal example.
- An inline variable declaration
int const
was introduced. This is not possible in many other languages. The attention is deflected to considerations about const and scope. Avoid! Keep it simple.
- I further simplified the code, keeping the proposed name change
M
instead ofx
.
- It was incorrectly mentioned that the linear search on input 2000000 would need 2000000 steps. 1414 steps are needed.
Reply
editIn 'Algorithm using linear search', you convinced me to keep the while loop, which indeed makes to code more understandible to non-C-programmers. However, for a similar reason, I'd still suggest to keep the involved types (throughout the whole article) as easy as possible. That is, long should be avoided, as it is very specific to C. My personal taste would also omit unsigned, but you have a point in explicitly disallowing negative numbers.
In 'Algorithm using binary search', you convinced me that != in the loop condition is more robust than < . On the other hand, the latter indicates the loop invariant L <= R in a better way. If you insist on !=, I won't object any further. As for the declaration of M, as far as I know, many languages allow new local declarations at the beginning of a block, i.e. immediately after a {. I found programs easier to understand if variable scopes are as small as possible, and if immutible variables are used whenever possible (using const in C); however, for a demonstration program of this small size, it doesn't make much difference, so I don't want to argue any further about this, either. In the while condition, I'm still in favor of omitting the parantheses; the involved operator priorities are the same in all programming languages I know, and in mathematics, so the expression is understandible for anybody without parantheses. Finally, you are right with 1414 vs. 2000000. - Jochen Burghardt (talk) 10:23, 8 December 2021 (UTC)
Notes ad Revision 27-02-2022 (16:51)
editFrom the section Introductory remark I removed the sentence (which I had introduced myself):
- "One can define to be the number for which there is an algorithm — more precisely: "Turing machine" — which, given a second input parameter , terminates with the decimal representation of truncated to decimal digits."
My reasons for removal were:
- It looks like the definition of is self-referential, as appears in the definiens as well as in the definiendum. It is not, but a second thougth is required.
- The notion of "algorithm" is too broad, as it carries on its back the Church–Turing thesis. "Turing machine" is a more restrained concept. On the other hand I did not want to talk about Turing machines in this article.
- The removed sentence is not neccessary for the remark I make. The sentence might even obnubilate my point to some extent.
My revision has been reverted with the comment: "unexplained content removal".
As the content removal is explained here, I will "undo".
Suggest correction or removal of "... using subtraction section"
editAs written, the program does not produce correct results for all input (e.g. for input 8 the program produces 3 and not the desired 2). This may have been an attempt to modify the algorithm to avoid needing to repeatedly insert zeros into digits as described in the referenced paper (see step R2 on page 1), which would involve more complicated arithmetic. 2604:3D08:6881:4A00:D9A0:6FE3:4F1F:17DE (talk) 17:50, 12 September 2022 (UTC)
Alternate methods
editOne approach described at https://leetcode.com/problems/sqrtx/editorial/ (paywalled) uses the Exponential Identity to rewrite sqrt(x) into e^((1/2)lnx. Obviously you need to then be able to calculate exponentials and logarithms
Another approach (probably only realistic in languages with fixed-width integer types) is to make lookup tables. (I.e. [0-1):0, [1-2): 1, [2-4):2, [4-9): 3, [9, 16): 4, [16-25): 5, [25-36): 6, [36-49): 7, to the desired length) which can then be binary-searched upon 2605:A601:A629:3300:9F2E:49BA:32DC:82AF (talk) 06:16, 9 November 2023 (UTC)
- Yes, the log and exponential method is commonly known, so it is a bit surprising that it isn't in the article. However, logs and exponentials take a relatively long time to calculate. Also, you want the result to be exactly the correct integer, so you have to be careful with the floating-point math, to make sure you get the right result.
- The table-lookup would be practical only for small ranges. For instance, if you wanted to take square roots of integers up to , the table would have entries, each four bytes, or 16GB in total. Bubba73 You talkin' to me? 07:51, 9 November 2023 (UTC)
The denormalization in Karatsuba's algorithm is wrong.
editIn line 59 of the algorithm the remainder is generated by using the remainder of the normalized and simply denormalize it using the bit shift.
return (s >> denormalization_shift, r >> denormalization_shift);
This is simply incorrect and I can explain: Suppose is the normalized . Then there exists a such that . The algorithm already gave us the and the remainder of . Then we have .
We can see the first problem at this point: The remainder needs to be shifted twice.
But there is an even bigger problem: is in general not devisable by . Suppose are the rightmost bits of and is the rest. We obtain . Note that has the same definition as the first return value. Everything right of it is the remainder we were looking for as our second value. Trainpilot (talk) 17:49, 11 September 2024 (UTC)