Jump to content

Year 2038 problem

From Wikipedia, the free encyclopedia
An animated visual of the bug in action. The overflow error will occur at 03:14:08 UTC on 19 January 2038.

The year 2038 problem (also known as Y2038,[1] Y2K38, Y2K38 superbug or the Epochalypse[2][3]) is a time computing problem that leaves some computer systems unable to represent times after 03:14:07 UTC on 19 January 2038.

The problem exists in systems which measure Unix time—the number of seconds elapsed since the Unix epoch (00:00:00 UTC on 1 January 1970)—and store it in a signed 32-bit integer. The data type is only capable of representing integers between −(231) and 231 − 1, meaning the latest time that can be properly encoded is 231 − 1 seconds after epoch (03:14:07 UTC on 19 January 2038). Attempting to increment to the following second (03:14:08) will cause the integer to overflow, setting its value to −(231) which systems will interpret as 231 seconds before epoch (20:45:52 UTC on 13 December 1901). The problem is similar in nature to the year 2000 problem, the difference being the Year 2000 problem had to do with base 10 numbers, whereas the Year 2038 problem involves base 2 numbers.

Analogous storage constraints will be reached in 2106, where systems storing Unix time as an unsigned (rather than signed) 32-bit integer will overflow on 7 February 2106 at 06:28:15 UTC.

Computer systems that use time for critical computations may encounter fatal errors if the year 2038 problem is not addressed. Some applications that use future dates have already encountered the bug.[4][5] The most vulnerable systems are those which are infrequently or never updated, such as legacy and embedded systems. Modern systems and software updates to legacy systems address this problem by using signed 64-bit integers instead of 32-bit integers, which will take 292 billion years to overflow—approximately 21 times the estimated age of the universe.

Cause

[edit]

Many computer systems measure time and date using Unix time, an international standard for digital timekeeping. Unix time is defined as the number of seconds elapsed since 00:00:00 UTC on 1 January 1970 (an arbitrarily chosen time based on the creation of the first Unix system), which has been dubbed the Unix epoch.[6]

Unix time has historically been encoded as a signed 32-bit integer, a data type composed of 32 binary digits (bits) which represent an integer value, with 'signed' meaning that the number can represent both positive and negative numbers, as well as zero; and is usually stored in two's complement format.[a] Thus, a signed 32-bit integer can only represent integer values from −(231) to 231 − 1 inclusive. Consequently, if a signed 32-bit integer is used to store Unix time, the latest time that can be stored is 231 − 1 (2,147,483,647) seconds after epoch, which is 03:14:07 on Tuesday, 19 January 2038.[7] Systems that attempt to increment this value by one more second to 231 seconds after epoch (03:14:08) will suffer integer overflow, inadvertently flipping the sign bit to indicate a negative number. This changes the integer value to −(231), or 231 seconds before epoch rather than after, which systems will interpret as 20:45:52 on Friday, 13 December 1901. From here, systems will continue to count up, toward zero, and then up through the positive integers again. As many computer systems use time computations to run critical functions, the bug may introduce serious problems.

Vulnerable systems

[edit]

Any system using data structures with signed 32-bit time representations has an inherent risk of failing. A full list of these data structures is virtually impossible to derive, but there are well-known data structures that have the Unix time problem:

  • File systems that use 32 bits to represent times in inodes
  • Binary file formats with 32-bit time fields
  • Databases with 32-bit time fields
  • Database query languages (such as SQL) that have UNIX_TIMESTAMP()-like commands

Embedded systems

[edit]

Embedded systems that use dates for either computation or diagnostic logging are most likely to be affected by the Y2038 problem.[1] Despite the modern 18–24 month generational update in computer systems technology, embedded systems are designed to last the lifetime of the machine in which they are a component. It is conceivable that some of these systems may still be in use in 2038. It may be impractical or, in some cases, impossible to upgrade the software running these systems, ultimately requiring replacement if the 32-bit limitations are to be corrected.

Many transportation systems from flight to automobiles use embedded systems extensively. In automotive systems, this may include anti-lock braking system (ABS), electronic stability control (ESC/ESP), traction control (TCS) and automatic four-wheel drive; aircraft may use inertial guidance systems and GPS receivers.[b] Another major use of embedded systems is in communications devices, including cell phones and Internet-enabled appliances (e.g. routers, wireless access points, IP cameras) which rely on storing an accurate time and date and are increasingly based on Unix-like operating systems. For example, the Y2038 problem makes some devices running 32-bit Android crash and not restart when the time is changed to that date.[8]

However, this does not imply that all embedded systems will suffer from the Y2038 problem, since many such systems do not require access to dates. For those that do, those systems which only track the difference between times/dates and not absolute times/dates will, by the nature of the calculation, not experience a major problem. This is the case for automotive diagnostics based on legislated standards such as CARB (California Air Resources Board).[9]

Early problems

[edit]

In May 2006, reports surfaced of an early manifestation of the Y2038 problem in the AOLserver software. The software was designed with a kludge to handle a database request that should "never" time out. Rather than specifically handling this special case, the initial design simply specified an arbitrary time-out date in the future with a default configuration specifying that requests should time out after a maximum of one billion seconds. However, one billion seconds before the 2038 cutoff date is 01:27:28 UTC on 13 May 2006, so requests sent after this time would result in a time-out date which is beyond the cutoff. This made time-out calculations overflow and return dates that were actually in the past, causing software to crash. When the problem was discovered, AOLServer operators had to edit the configuration file and set the time-out to a lower value.[4][5]

Solutions

[edit]

There is no universal solution for the Year 2038 problem. For example, in the C language, any change to the definition of the time_t data type would result in code-compatibility problems in any application in which date and time representations are dependent on the nature of the signed 32-bit time_t integer. Changing time_t to an unsigned 32-bit integer, which would extend the range to 2106[10] (specifically, 06:28:15 UTC on Sunday, 7 February 2106), would adversely affect programs that store, retrieve, or manipulate dates prior to 1970, as such dates are represented by negative numbers. Increasing the size of the time_t type to 64 bits in an existing system would cause incompatible changes to the layout of structures and the binary interface of functions.

Most operating systems designed to run on 64-bit hardware already use signed 64-bit time_t integers. Using a signed 64-bit value introduces a new wraparound date that is over twenty times greater than the estimated age of the universe: approximately 292 billion years from now.[11] The ability to make computations on dates is limited by the fact that tm_year uses a signed 32-bit integer value starting at 1900 for the year. This limits the year to a maximum of 2,147,485,547 (2,147,483,647 + 1900).[12]

Alternative proposals have been made (some of which are already in use), such as storing either milliseconds or microseconds since an epoch (typically either 1 January 1970 or 1 January 2000) in a signed 64-bit integer, providing a minimum range of 292,000 years at microsecond resolution.[13][14] In particular, Java's and JavaScript's use of 64-bit signed integers to represent absolute timestamps as "milliseconds since 1 January 1970" will work correctly for the next 292 million years. Other proposals for new time representations provide different precisions, ranges, and sizes (almost always wider than 32 bits), as well as solving other related problems, such as the handling of leap seconds. In particular, TAI64[15] is an implementation of the International Atomic Time (TAI) standard, the current international real-time standard for defining a second and frame of reference.

Implemented solutions

[edit]
  • Starting with Ruby version 1.9.2 (released on 18 August 2010), the bug with year 2038 is fixed,[16] by storing time in a signed 64-bit integer on systems with 32-bit time_t.[17]
  • Starting with NetBSD version 6.0 (released in October 2012), the NetBSD operating system uses a 64-bit time_t for both 32-bit and 64-bit architectures. Applications that were compiled for an older NetBSD release with 32-bit time_t are supported via a binary compatibility layer, but such older applications will still suffer from the Y2038 problem.[18]
  • OpenBSD since version 5.5, released in May 2014, also uses a 64-bit time_t for both 32-bit and 64-bit architectures. In contrast to NetBSD, there is no binary compatibility layer. Therefore, applications expecting a 32-bit time_t and applications using anything different from time_t to store time values may break.[19]
  • Linux originally used a 64-bit time_t for 64-bit architectures only; the pure 32-bit ABI was not changed due to backward compatibility.[20] Starting with version 5.6 of 2020, 64-bit time_t is supported on 32-bit architectures, too. This was done primarily for the sake of embedded Linux systems.[21]
  • GNU C Library since version 2.34 (released August 2021), added support for using 64-bit time_t on 32-bit platforms with appropriate Linux versions. This support can be activated by defining preprocessor macro _TIME_BITS to 64 when compiling source code.[22]
  • FreeBSD uses 64-bit time_t for all 32-bit and 64-bit architectures except 32-bit i386, which uses signed 32-bit time_t instead.[23]
  • The x32 ABI for Linux (which defines an environment for programs with 32-bit addresses but running the processor in 64-bit mode) uses a 64-bit time_t. Since it was a new environment, there was no need for special compatibility precautions.[20]
  • Network File System version 4 has defined its time fields as struct nfstime4 {int64_t seconds; uint32_t nseconds;} since December 2000.[24] Version 3 supports unsigned 32-bit values as struct nfstime3 {uint32 seconds; uint32 nseconds;};.[25] Values greater than zero for the seconds field denote dates after the 0-hour, January 1, 1970. Values less than zero for the seconds field denote dates before the 0-hour, January 1, 1970. In both cases, the nseconds (nanoseconds) field is to be added to the seconds field for the final time representation.
  • The ext4 filesystem, when used with inode sizes larger than 128 bytes, has an extra 32-bit field per timestamp, of which 30 bits are used for the nanoseconds part of the timestamp, and the other 2 bits are used to extend the timestamp range to the year 2446.[26]
  • The XFS filesystem, starting with Linux 5.10, has an optional "big timestamps" feature which extends the timestamp range to the year 2486.[27]
  • While the native APIs of OpenVMS can support timestamps up to 31 July 31086,[28] the C runtime library (CRTL) uses 32-bit integers for time_t.[29] As part of Y2K compliance work that was carried out in 1998, the CRTL was modified to use unsigned 32-bit integers to represent time; extending the range of time_t up to 7 February 2106.[30]
  • PostgreSQL since version 7.2, released 2002-02-04, stores timestamp WITHOUT TIMEZONE as 64-bit.[31][failed verification] Prior versions already stored timestamp as 64-bit.[citation needed]
  • As of MySQL 8.0.28, released in January 2022, the functions FROM_UNIXTIME(), UNIX_TIMESTAMP(), and CONVERT_TZ() handle 64-bit values on platforms that support them. This includes 64-bit versions of Linux, macOS, and Windows.[32][33] In older versions, built-in functions like UNIX_TIMESTAMP() will return 0 after 03:14:07 UTC on 19 January 2038.[34]
  • As of MariaDB 11.5.1, released in May 2024, the data type TIMESTAMP and functions FROM_UNIXTIME(), UNIX_TIMESTAMP(), and CONVERT_TZ() handle unsigned 32-bit values on 64-bit versions of Linux, macOS, and Windows.[35] This extended the range to 2106-02-07 06:28:15 and allowed users to store such timestamp values in tables without changing the storage layout and thus staying fully compatible with existing user data.
  • Starting with Visual C++ 2005, the CRT uses a 64-bit time_t unless the _USE_32BIT_TIME_T preprocessor macro is defined.[36] However, the Windows API itself is unaffected by the year 2038 bug, as Windows internally tracks time as the number of 100-nanosecond intervals since 1 January 1601 in a 64-bit signed integer, which will not overflow until year 30,828.[37]

See also

[edit]

Notes

[edit]
  1. ^ Unless otherwise specified, all the numbers provided in this article have been derived using two's complement for signed integer arithmetic.
  2. ^ GPS suffers its own time counter overflow problem known as GPS Week Number Rollover.

References

[edit]
  1. ^ a b "Is the Year 2038 problem the new Y2K bug?". The Guardian. 17 December 2014. Archived from the original on 25 January 2022. Retrieved 11 October 2018.
  2. ^ Bergmann, Arnd (6 February 2020). "The end of an Era". Linaro. Archived from the original on 7 February 2020. Retrieved 13 September 2020.
  3. ^ Wagenseil, Paul (28 July 2017). "Digital 'Epochalypse' Could Bring World to Grinding Halt". Tom's Guide. Archived from the original on 29 November 2021. Retrieved 13 September 2020.
  4. ^ a b "The Future Lies Ahead". 28 June 2006. Archived from the original on 28 November 2006. Retrieved 19 November 2006.
  5. ^ a b Weird "memory leak" problem in AOLserver 3.4.2/3.x Archived 4 January 2010 at the Wayback Machine 12 May 2006
  6. ^ "Epoch Time". unixtutoria. 15 March 2019. Archived from the original on 13 April 2023. Retrieved 13 April 2023.
  7. ^ Diomidis Spinellis (2006). Code quality: the open source perspective. Effective software development series in Safari Books Online (illustrated ed.). Adobe Press. p. 49. ISBN 978-0-321-16607-4.
  8. ^ "ZTE Blade running Android 2.2 has 2038 problems". Archived from the original on 19 May 2022. Retrieved 20 November 2018.
  9. ^ "ARB Test Methods / Procedures". ARB.ca.gov. California Air Resources Board. Archived from the original on 18 November 2016. Retrieved 12 September 2013.
  10. ^ "DRAFT: Y2038 Proofness Design". Archived from the original on 21 September 2019. Retrieved 25 May 2024.
  11. ^ "When does the 64-bit Unix time_t really end?". Archived from the original on 23 September 2022. Retrieved 24 September 2022.
  12. ^ Felts, Bob (17 April 2010). "The End of Time". Stablecross.com. Archived from the original on 11 October 2012. Retrieved 19 March 2012.
  13. ^ "Unununium Time". Archived from the original on 8 April 2006. Retrieved 19 November 2006.
  14. ^ Sun Microsystems. "Java API documentation for System.currentTimeMillis()". Archived from the original on 30 September 2017. Retrieved 29 September 2017.
  15. ^ "TAI64". Archived from the original on 26 September 2012. Retrieved 4 September 2012.
  16. ^ "Ruby 1.9.2 is released". 18 August 2010. Archived from the original on 8 April 2022. Retrieved 1 April 2022.
  17. ^ "time.c: use 64bit arithmetic even on platforms with 32bit VALUE". GitHub. Archived from the original on 3 November 2023. Retrieved 3 November 2023.
  18. ^ "Announcing NetBSD 6.0". 17 October 2012. Archived from the original on 15 January 2016. Retrieved 18 January 2016.
  19. ^ "OpenBSD 5.5 released (May 1, 2014)". 1 May 2014. Archived from the original on 22 December 2015. Retrieved 18 January 2016.
  20. ^ a b Jonathan Corbet (14 August 2013). "Pondering 2038". LWN.net. Archived from the original on 4 March 2016. Retrieved 9 March 2016.
  21. ^ "LKML: Arnd Bergmann: [GIT PULL] y2038: core, driver and file system changes". lkml.org. Archived from the original on 14 February 2020. Retrieved 30 January 2020.
  22. ^ O'Donell, Carlos (2 August 2021). "The GNU C Library version 2.34 is now available". Sourceware. Archived from the original on 30 April 2024. Retrieved 30 April 2024.
  23. ^ "arch". www.freebsd.org. Archived from the original on 26 September 2018. Retrieved 26 September 2018.
  24. ^ Haynes, Thomas; Noveck, David, eds. (March 2015). "Structured Data Types". Network File System (NFS) Version 4 Protocol. sec. 2.2. doi:10.17487/RFC7530. RFC 7530.
  25. ^ Staubach, Peter; Pawlowski, Brian; Callaghan, Brent (June 1995). "NFS Version 3 Protocol Specification". Retrieved 25 May 2024.
  26. ^ "ext4 Data Structures and Algorithms". Archived from the original on 13 September 2022. Retrieved 13 September 2022.
  27. ^ Michael Larabel (15 October 2020). "XFS File-System With Linux 5.10 Punts Year 2038 Problem To The Year 2486". Phoronix. Archived from the original on 13 September 2022. Retrieved 13 September 2022.
  28. ^ "Why is Wednesday, November 17, 1858 the base time for OpenVMS (VAX VMS)?". Stanford University. 24 July 1997. Archived from the original on 24 July 1997. Retrieved 8 January 2020.
  29. ^ "VSI C Run-Time Library Reference Manual for OpenVMS Systems" (PDF). VSI. November 2020. Archived from the original (PDF) on 17 April 2021. Retrieved 17 April 2021.
  30. ^ "OpenVMS and the year 2038". HP. Archived from the original on 17 April 2021. Retrieved 17 April 2021.
  31. ^ "PostgreSQL Release 7.2". January 2012. Archived from the original on 26 April 2024. Retrieved 25 April 2024.
  32. ^ "What Is New in MySQL 8.0". dev.mysql.com.
  33. ^ "Changes in MySQL 8.0.28 (2022-01-18, General Availability)". dev.mysql.com. Archived from the original on 8 December 2023. Retrieved 14 May 2024.
  34. ^ "MySQL Bugs: #12654: 64-bit unix timestamp is not supported in MySQL functions". bugs.mysql.com. Archived from the original on 29 March 2017. Retrieved 28 March 2017.
  35. ^ "MariaDB 11.5.1 Release Notes".
  36. ^ "Microsoft C/C++ change history 2003 - 2015". learn.microsoft.com. 25 May 2023. Retrieved 13 August 2024.
  37. ^ "About Time - Win32 apps". learn.microsoft.com. 7 January 2021. Retrieved 13 August 2024.
[edit]