Do file-extensions have any purpose (for the operating system)?
![Creative The name of the picture](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgO9GURib1T8z7lCwjOGLQaGtrueEthgQ8LO42ZX8cOfTqDK4jvDDpKkLFwf2J49kYCMNW7d4ABih_XCb_2UXdq5fPJDkoyg7-8g_YfRUot-XnaXkNYycsNp7lA5_TW9td0FFpLQ2APzKcZ/s1600/1.jpg)
![Creative The name of the picture](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhYQ0N5W1qAOxLP7t7iOM6O6AzbZnkXUy16s7P_CWfOb5UbTQY_aDsc727chyphenhyphen5W4IppVNernMMQeaUFTB_rFzAd95_CDt-tnwN-nBx6JyUp2duGjPaL5-VgNO41AVsA_vu30EJcipdDG409/s400/Clash+Royale+CLAN+TAG%2523URR8PPP.png)
up vote
66
down vote
favorite
Linux determines the type of a file via a code in the file-header. It doesn't depend on file-extensions for to know which software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
Working a bit with Ubuntu systems recently: I see a lot of files on the systems which have extensions like .sh
, .txt
, .o
, .c
Now I'm wondering: Are these extensions are meant only for humans? So that one should get an idea what kind of file it is?
Or does they have some purpose for the operating-system too?
files file-format mime-type
 |Â
show 5 more comments
up vote
66
down vote
favorite
Linux determines the type of a file via a code in the file-header. It doesn't depend on file-extensions for to know which software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
Working a bit with Ubuntu systems recently: I see a lot of files on the systems which have extensions like .sh
, .txt
, .o
, .c
Now I'm wondering: Are these extensions are meant only for humans? So that one should get an idea what kind of file it is?
Or does they have some purpose for the operating-system too?
files file-format mime-type
5
If you don't get a good response here, remember there is also unix.stackexchange.com
â mchid
Jul 27 '16 at 6:51
@mchid Thanks a lot. That's very kind of you.
â mizech
Jul 27 '16 at 6:52
Related, almost duplicate: askubuntu.com/questions/390015/â¦
â Zzzach...
Jul 27 '16 at 7:01
5
In Windows they do, in Linux/Unix they mostly don't. The main exception are the compression-programs -gzip
,bzip2
,xz
- and so on. These programs uses suffixes to separate the compressed version of a file from the uncompressed one they replace. Compression-programs will often complain about incorrect suffix, even though the file actually is a compressed file of the type it should handle.
â Baard Kopperud
Jul 27 '16 at 10:34
5
I think part of the problem with this question is that "the operating-system" is not a well-defined concept. What is part of the operating system, and what is an application on top of it? Not many parts of the OS (whichever OS we're talking about) care what type a file is - they just do what they're told. So distinctions about how they know are irrelevant; they do neither. Applcations, on the other hand, may well do one or both things.
â IMSoP
Jul 27 '16 at 17:23
 |Â
show 5 more comments
up vote
66
down vote
favorite
up vote
66
down vote
favorite
Linux determines the type of a file via a code in the file-header. It doesn't depend on file-extensions for to know which software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
Working a bit with Ubuntu systems recently: I see a lot of files on the systems which have extensions like .sh
, .txt
, .o
, .c
Now I'm wondering: Are these extensions are meant only for humans? So that one should get an idea what kind of file it is?
Or does they have some purpose for the operating-system too?
files file-format mime-type
Linux determines the type of a file via a code in the file-header. It doesn't depend on file-extensions for to know which software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
Working a bit with Ubuntu systems recently: I see a lot of files on the systems which have extensions like .sh
, .txt
, .o
, .c
Now I'm wondering: Are these extensions are meant only for humans? So that one should get an idea what kind of file it is?
Or does they have some purpose for the operating-system too?
files file-format mime-type
edited Jul 27 '16 at 15:22
asked Jul 27 '16 at 6:46
mizech
512157
512157
5
If you don't get a good response here, remember there is also unix.stackexchange.com
â mchid
Jul 27 '16 at 6:51
@mchid Thanks a lot. That's very kind of you.
â mizech
Jul 27 '16 at 6:52
Related, almost duplicate: askubuntu.com/questions/390015/â¦
â Zzzach...
Jul 27 '16 at 7:01
5
In Windows they do, in Linux/Unix they mostly don't. The main exception are the compression-programs -gzip
,bzip2
,xz
- and so on. These programs uses suffixes to separate the compressed version of a file from the uncompressed one they replace. Compression-programs will often complain about incorrect suffix, even though the file actually is a compressed file of the type it should handle.
â Baard Kopperud
Jul 27 '16 at 10:34
5
I think part of the problem with this question is that "the operating-system" is not a well-defined concept. What is part of the operating system, and what is an application on top of it? Not many parts of the OS (whichever OS we're talking about) care what type a file is - they just do what they're told. So distinctions about how they know are irrelevant; they do neither. Applcations, on the other hand, may well do one or both things.
â IMSoP
Jul 27 '16 at 17:23
 |Â
show 5 more comments
5
If you don't get a good response here, remember there is also unix.stackexchange.com
â mchid
Jul 27 '16 at 6:51
@mchid Thanks a lot. That's very kind of you.
â mizech
Jul 27 '16 at 6:52
Related, almost duplicate: askubuntu.com/questions/390015/â¦
â Zzzach...
Jul 27 '16 at 7:01
5
In Windows they do, in Linux/Unix they mostly don't. The main exception are the compression-programs -gzip
,bzip2
,xz
- and so on. These programs uses suffixes to separate the compressed version of a file from the uncompressed one they replace. Compression-programs will often complain about incorrect suffix, even though the file actually is a compressed file of the type it should handle.
â Baard Kopperud
Jul 27 '16 at 10:34
5
I think part of the problem with this question is that "the operating-system" is not a well-defined concept. What is part of the operating system, and what is an application on top of it? Not many parts of the OS (whichever OS we're talking about) care what type a file is - they just do what they're told. So distinctions about how they know are irrelevant; they do neither. Applcations, on the other hand, may well do one or both things.
â IMSoP
Jul 27 '16 at 17:23
5
5
If you don't get a good response here, remember there is also unix.stackexchange.com
â mchid
Jul 27 '16 at 6:51
If you don't get a good response here, remember there is also unix.stackexchange.com
â mchid
Jul 27 '16 at 6:51
@mchid Thanks a lot. That's very kind of you.
â mizech
Jul 27 '16 at 6:52
@mchid Thanks a lot. That's very kind of you.
â mizech
Jul 27 '16 at 6:52
Related, almost duplicate: askubuntu.com/questions/390015/â¦
â Zzzach...
Jul 27 '16 at 7:01
Related, almost duplicate: askubuntu.com/questions/390015/â¦
â Zzzach...
Jul 27 '16 at 7:01
5
5
In Windows they do, in Linux/Unix they mostly don't. The main exception are the compression-programs -
gzip
, bzip2
, xz
- and so on. These programs uses suffixes to separate the compressed version of a file from the uncompressed one they replace. Compression-programs will often complain about incorrect suffix, even though the file actually is a compressed file of the type it should handle.â Baard Kopperud
Jul 27 '16 at 10:34
In Windows they do, in Linux/Unix they mostly don't. The main exception are the compression-programs -
gzip
, bzip2
, xz
- and so on. These programs uses suffixes to separate the compressed version of a file from the uncompressed one they replace. Compression-programs will often complain about incorrect suffix, even though the file actually is a compressed file of the type it should handle.â Baard Kopperud
Jul 27 '16 at 10:34
5
5
I think part of the problem with this question is that "the operating-system" is not a well-defined concept. What is part of the operating system, and what is an application on top of it? Not many parts of the OS (whichever OS we're talking about) care what type a file is - they just do what they're told. So distinctions about how they know are irrelevant; they do neither. Applcations, on the other hand, may well do one or both things.
â IMSoP
Jul 27 '16 at 17:23
I think part of the problem with this question is that "the operating-system" is not a well-defined concept. What is part of the operating system, and what is an application on top of it? Not many parts of the OS (whichever OS we're talking about) care what type a file is - they just do what they're told. So distinctions about how they know are irrelevant; they do neither. Applcations, on the other hand, may well do one or both things.
â IMSoP
Jul 27 '16 at 17:23
 |Â
show 5 more comments
7 Answers
7
active
oldest
votes
up vote
37
down vote
accepted
Linux determines the type of a file via a code in the file header. It doesn't depend on file extensions for to know with software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
- correctly remembered.
Are these extensions are meant only for humans?
- Yes, with a but.
When you interact with other operating systems that do depend on extensions being what they are it is the smarter idea to use those.
In Windows, opening software is attached to the extensions.
Opening a text file named "file" is harder in Windows than opening the same file named "file.txt" (you will need to switch the file open dialog from *.txt
to *.*
every time). The same goes for TAB and semi-colon separated text files. The same goes for importing and exporting e-mails (.mbox extension).
In particular when you code software. Opening a file named "software1" that is an HTML file and "software2" that is a JavaScript file becomes more difficult compared to "software.html" and "software.js".
If there is a system in place in Linux where file extensions are important, I would call that a bug. When software depends on file extensions, that is exploitable. We use an interpreter directive to identify what a file is ("the first two bytes in a file can be the characters "#!", which constitute a magic number (hexadecimal 23 and 21, the ASCII values of "#" and "!") often referred to as shebang,").
The most famous problem with file extensions was LOVE-LETTER-FOR-YOU.TXT.vbs on Windows. This is a visual basic script being shown in file explorer as a text file.
In Ubuntu when you start a file from Nautilus you get a warning what it is going to do. Executing a script from Nautilus where it wants to start some software where it is supposed to open gEdit is obvious a problem and we get a warning about it.
In command line when you execute something, you can visually see what the extension is. If it ends on .vbs I would start to become suspicious (not that .vbs is executable on Linux. At least not without some more effort ;) ).
30
I completely don't get what you wanted to say in your last sentence. First, it is a problem of hiding the extension rather than having it, second the exploit would work the same in Linux - you name a binary filereadme.txt
and make it executable. If user executed it, it does not open the editor, but runs the code. In this respect making extensions matter (but not hiding them) is more secure and easier to explain for non-savvy users. There are other differences (most notably not executing files from the current directory), but they have nothing to do with extensions.
â techraf
Jul 27 '16 at 8:07
3
@techraf Actually the file manager will probably try to open thereadme.txt
file with a text editor. I just tried with dolphin in KDE, creating a shell script adding executable permission, saving it as.txt
and clicking on it will make it open in Kate. If I rename it to.sh
then clicking on it runs it.
â Bakuriu
Jul 27 '16 at 11:22
9
linux: since make is build around rules that depend on the file extension, wouldn't this make (no pun intended) the extensions meant for more than just humans?
â bolov
Jul 27 '16 at 13:08
14
This is a monumentally wrong answer. Some parts of Linux use magic numbers to determine file types. Executing files at the command line. But other huge parts of the system use file extensions to know what to look at, whether those be the dynamic linker (which wants .so files), modprobe, build systems, plugins, libraries for python, ruby, etc. Many files don't have magic numbers,file
is heuristic-based, not definite.
â Alan Shutko
Jul 29 '16 at 0:08
3
"Linux determines the type of a file via a code in the file header" "correct" WTF? What "code in the file header" ? There is no such code, and there is no such a generic "file header" in Linux.
â leonbloy
Jul 30 '16 at 18:37
 |Â
show 15 more comments
up vote
64
down vote
There is no 100% black or white answer here.
Usually Linux does not rely on file names (and file extensions i.e. the part of the file name after the normally last period) and instead determines the file type by examining the first few bytes of its content and comparing that to a list of known magic numbers.
For example all Bitmap image files (usually with name extension .bmp
) must start with the letters BM
in their first two bytes. Scripts in most scripting languages like Bash, Python, Perl, AWK, etc. (basically everything that treats lines starting with #
as comment) may contain a shebang like #!/bin/bash
as first line. This special comment tells the system with which application to open the file.
So normally the operating system relies on the file content and not its name to determine the file type, but stating that file extensions are never needed on Linux is only half of the truth.
Applications may of course implement their file checks however they want, which includes verifying the file name and extension. An example is the Eye of Gnome (eog
, standard picture viewer) which determines the image format by the file extension and throws an error if it does not match the content. Whether this is a bug or a feature can be discussed...
However, even some parts of the operating system rely on file name extensions, e.g. when parsing your software sources files in /etc/apt/sources.list.d/
- only files with the *.list
extension get parsed all others are ignored. It's maybe not mainly used to determine the file type here but rather to enable/disable parsing of some files, but it's still a file extension that affects how the system treats a file.
And of course the human user profits most from file extensions as that makes the type of a file obvious and also allows multiple files with the same base name and different extensions like site.html
, site.php
, site.js
, site.css
etc. The disadvantage is of course that file extension and the actual file type/content do not necessarily have to match.
Additionally it's needed for cross-platform interoperability, as e.g. Windows will not know what to do with a readme
file, but only a readme.txt
.
You slightly contradict yourself here: if the standard image viewer requires a filename ending .bmp, what part of the OS are you saying relies on the file content starting "BM"? AFAIK, the only "magic numbers the kernel cares about are executable types, including the special case of#!
. Everything else is up to some application's decision.
â IMSoP
Jul 27 '16 at 18:20
@IMSoP I don't know the exact implementation ofeog
and I don't know why they care about the file name at all. This is a bug in my opinion. And of course if the file is named "bmp" but its content format does not match, there will be an error as well, of course. Of course each application decides how to verify files, but in general Linux applications should not rely on the name. Btw, you can use thefile
commend to examine file types by their content.
â Byte Commander
Jul 27 '16 at 19:12
1
The sentence I am challenging is this: "Linux ... determines the file type by examining the first few bytes". What definition of "Linux" are you using in that sentence? The existence of thefile
utility doesn't really prove anything; it's a useful tool, that could exist on any OS. What fundamental part of the OS makes runningfile
any more "correct" than globbing the file name?
â IMSoP
Jul 27 '16 at 21:51
Note that files without an extension can be associated with a program.
â isanae
Jul 28 '16 at 2:09
add a comment |Â
up vote
23
down vote
As mentioned by others, in Linux an interpreter directive method is used (storing some metadata in a file as a header or magic number so the correct interpreter can be told to read it) rather than the filename extension association method used by Windows.
This means you can create a file with almost any name you like... with a few exceptions
However
I would like to add a word of caution.
If you have some files on your system from a system that uses filename association, the files may not have those magic numbers or headers. Filename extensions are used to identify these files by applications that are able to read them, and you may experience some unexpected effects if you rename such files. For example:
If you rename a file My Novel.doc
to My-Novel
, Libreoffice will still be able to open it, but it will open as 'Untitled' and you will have to name it again in order to save it (Libreoffice adds an extension by default, so you would then have two files My-Novel
and My-Novel.odt
, which could be annoying)
More seriously, if you rename a file My Spreadsheet.xlsx to My-Spreadsheet, then try to open it with xdg-open My-Spreadsheet
you will get this (because it's actually a compressed file):
And if you rename a file My Spreadsheet.xls
to My-Spreadsheet
, when you xdg-open My-Spreadsheet
you get an error saying
error opening location: No application is registered as handling this file
(Although in both these cases it works OK if you do soffice My-Spreadsheet
)
If you then rename the extensionless file to My-Spreadsheet.ods
with mv
and try to open it you will get this:
(repair fails)
And you will have to put the original extension back on to open the file correctly (you can then convert the format if you wish)
TL;DR:
If you have non-native files with name extensions, don't remove the extensions assuming everything will be OK!
3
A new-style MS Office document (docx, xlsx, pptx etc) without file extension opens in the archive manager because those file types are actually just ordinary ZIP compressed files which contain all the XML documents and media files necessary to define the document content. The file format of a ZIP compressed directory is pretty common nowadays btw.
â Byte Commander
Jul 27 '16 at 8:21
1
Already many great answers, but just one more specific to libreoffice that I've noticed. You create a file of comma separated values (CSV) and save it as "test.csv", a window will open asking what type of separator are you using (i.e., libreoffice Calc). If you rename this file to "test.cs", for example, then libreoffice's Writer opens it. So, besides the ZIP example above, it does seem like libreoffice does make use of the file extension.
â Ray
Jul 27 '16 at 8:31
2
The linux filesystem doesn't do anything regarding file types. That is all down to the programs running on top of it.
â Peter Green
Jul 27 '16 at 15:29
@PeterGreen Yes, but the fact that the programs do assign it significance means it's not "just for humans" the way, e.g., classic MacOS had it [there were four-byte "file type" and "creator app" fields that weren't part of the file name, so the OS and applications had all the information they needed without looking at file extensions]
â Random832
Jul 27 '16 at 15:36
3
@PeterGreen The Windows filesystem doesn't do anything regarding file types either. The graphical shell (Windows Explorer) uses file extension to choose an action for double-click, but technically that's just a program running on top of the OS, just as Nautilus is. It would be perfectly possible to write a Linux file manager with that behaviour, or a Windows one which examined the file contents.
â IMSoP
Jul 27 '16 at 17:19
add a comment |Â
up vote
19
down vote
I'd like to take a different approach to this from other answers, and challenge the notion that "Linux" or "Windows" have anything to do with this (bear with me).
The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". The other common conventions for identifying the type of a file are comparing its contents against a database of known signatures (the "magic number" approach), and storing it as an extra attribute on the file system (the approach used in the original MacOS).
Since every file on a Windows or a Linux system has both a name and contents, processes which want to know the file type can use either the "extension" or the "magic number" approaches as they see fit. The metadata approach is not generally available, as there is no standard place for this attribute on most file systems.
On Windows, there is a strong tradition of using the file extension as the primary means of identifying a file; most visibly, the graphical file browser (File Manager on Windows 3.1 and Explorer on modern Windows) uses it when you double-click on a file to determine which application to launch. On Linux (and, more generally, Unix-based systems), there is more tradition for inspecting the contents; most notably, the kernel looks at the beginning of a file executed directly to determine how to run it; script files can indicate an interpreter to use by starting with #!
followed by the path to the interpreter.
These traditions influence UI design of programs written for each system, but there are plenty of exceptions, because each approach has pros and cons in different situations. Reasons to use file extensions rather than examining contents include:
- examining file contents is fairly costly compared to examining file names; so for instance "find all files named *.conf" will be a lot quicker than "find all files whose first line matches this signature"
- file contents can be ambiguous; many file formats are actually just text files treated in a special way, many others are specially-structured zip files, and defining accurate signatures for these can be tricky
- a file can genuinely be valid as more than one type; an HTML file may also be valid XML, a zip file and a GIF concatenated together remain valid for both formats
- magic number matching might lead to false positives; a file format that has no header might happen to begin with the bytes "GIF89a" and be misidentified as a GIF image
- renaming a file can be a convenient way to mark it as "disabled"; e.g. changing "foo.conf" to "foo.conf~" to indicate a backup is easier than editing the file to comment out all of its directives, and more convenient than moving it out of an autoloaded directory; similarly, renaming a .php file to .txt will tell Apache to serve its source as plain text, rather than passing it to the PHP engine
Examples of Linux programs which use file names by default (but may have other modes):
- gzip and gunzip have special handling of any file ending ".gz"
- gcc will handle ".c" files as C, and ".cc" or ".C" as C++
Windows also has a strong tradition of hiding the extension if it's "well known" and even DOS allowed a command to omit .COM, .BAT, and .EXE, automatically searching for those to determine what actual program to execute. There is no such tradition in *nix.
â Monty Harder
Jul 28 '16 at 21:59
1
This should be the accepted answer.
â Ave
Jul 31 '16 at 13:29
This is a much better answer but has one factual error... a script cannot be made executable by placing#!
at the beginning. Any file with its executable bit(s) set can executed in one of several ways.#!/bin/bash
and similar signatures just specify which interpreter to use. If no such signature is supplied, the default shell interpreter is assumed. A file containing nothing but the two words 'Hello World', but with its execution bit set, will attempt to find a 'Hello' command when run.
â DocSalvager
Aug 2 '16 at 21:33
1
@DocSalvager Good catch, that was clumsy wording as much as anything. I've reworded it a bit to make clear that the shebang doesn't make the script executable, it just changes how it is executed.
â IMSoP
Aug 2 '16 at 22:08
add a comment |Â
up vote
14
down vote
Actually, some technologies do rely on file extensions, so if you use those technologies in Ubuntu, you'll have to rely on extensions too. A few examples:
gcc
uses extensions to distinguish between C an C++ files. Without the extension it's pretty much impossible to differentiate them (imagine a C++ file with no classes).- many files (
docx
,jar
,apk
) are just particularly structured ZIP archives. While you can usually infer the type from the content, it may not always be possible (e.g. Java Manifest is optional injar
files).
Not using file extensions in such cases will only be possible with hacky workarounds and is likely to be very error-prone.
Good on you for mentioning programming, but you got most of the details wrong.gcc
is the front-end for C files, for C++ files you need either theg++
front-end or a command-line switch to specify language. More important is themake
program that decides whether to usegcc
org++
to build a particular file -- andmake
is completely dependent on filename patterns (mostly extensions) for its rule-matching.
â Ben Voigt
Jul 29 '16 at 18:50
@BenVoigt When compiling a file with a.cc
extension withgcc
, it really will be compiled as C++, and this is documented inman gcc
: "For any given input file, the file name suffix determines what kind of compilation is done:" followed by a list of extensions and how they are treated.
â hvd
Jul 30 '16 at 10:53
1
@hvd Then maybe it's the default set of libraries that goes horribly wrong if you don't use the right frontend. Anyway make is the prime example because everything it does is based on file extension.
â Ben Voigt
Jul 30 '16 at 13:28
1
@BenVoigtmake
is a good example too, butgcc
relies just as heavily on filenames. Here's an example clearer than.c
vs.cc
: For C,gcc
uses suffixes to tell if its first step is to preprocess (.c
), compile (.i
), assemble (.s
), or link (.o
). Here, I use-E
,-S
and-c
to tellgcc
where to stop, but it uses filenames to know where to start.gcc something.cc
won't link to the right libraries for C++ but it will treat the file as C++, which is why many users are confused by the error messages they get when making that mistake.
â Eliah Kagan
Jan 24 '17 at 13:56
add a comment |Â
up vote
7
down vote
Your first assumption is correct: the extensions on Linux do not matter and only are useful for humans( and other non-Unix-like OS that care about extensions ). The type of a file is determined by first 32 bits of data in the file , which is known as magic number
This is why shell scripts need #!
line - to tell operating system what interpreter to call. Without it , the shell script is just text file.
As far as file managers go, they do want to know extensions of some files, such as .desktop
files , which basically same as Window's version of shortcuts but with more capabilities. But as far as OS is concerned, it needs to know what's in the file, not what's in its name
3
This isn't quite true. There are programs that expect a specific extension. The most commonly used example is probablygunzip
which won't decompress a file if it isn't calledfoo.gz
.
â terdonâ¦
Jul 27 '16 at 9:27
That's an implementation of specific software. For the most part, utilities on unix-like systems don't expect an extension.
â Sergiy Kolodyazhnyy
Jul 27 '16 at 9:36
7
For the most part they don't, no. Your first sentence, however, claims that they are never used and only matter to humans. That isn't entirely true.gunzip
is one example,eog
is another. Also, many tools won't autocomplete names without the right extension. All I'm saying is that it's a bit more complicated than "extensions are always irrelevant".
â terdonâ¦
Jul 27 '16 at 9:40
1
1 small issue: OP asked about the operating system. 'gunzip' and 'eog' are not the operating system but decided to create their own restrictions (in case of gunzip) or method (eog). "mime types" though.
â Rinzwind
Jul 27 '16 at 17:49
1
@Serg Sure, you can define OS narrowly, and get a trivial answer to the question. It's not a particularly helpful answer, though, because the vast majority of what a user does with a computer involves software you've excluded. Note that the question contrasted "only for humans" against "the operating-system"; I don't think they meant "the kernel".
â IMSoP
Jul 29 '16 at 9:06
 |Â
show 7 more comments
up vote
5
down vote
This is a too big for a comment answer.
Keep in mind that even "extension" has a lot if different meanings.
What your talking about seems to be the 3 letters after the . DOS made the 8.3 format really popular and windows uses the .3 part to this day.
Linux has a lot of files like .conf or .list or .d or .c that have meaning, but are not really extensions in the 8.3 sense. For example Apache looks at /etc/apache2/sites-enabled/website.conf for it's configuration directive. While the system uses MIME Types and content headers and what not to determine it's a text file, Apache (by default) still isn't going to load it without it ending in .conf.
.c is another great one. Yep it's a text file, but gcc depends on main.c becoming main.o and finally main (after linking). At no time does the system use the .c, .o or no extension to have any meaning as far as content, but the stuff after the . does have some meaning. You would probably setup your SCM to ignore main.o and main.
Point being is this: Extensions are not used the way they are in windows. The kernel will not execute a .txt file because you remove the .txt part of the name. It is also very happy to execute a .txt file if the execute permission is set. That being said, they do have meaning, and are still used on a "computer level" for many things.
1
Windows is also not bound to thex.3
naming scheme any more, you have got longer extensions there as well like.doxc
,.torrent
,.part
, etc. It's just that many file formats and extensions were already defined back in the time when 8.3 naming was still a thing and later formats mostly simply adapted the convention of using up to 3 letters.
â Byte Commander
Jul 27 '16 at 10:07
I don't see how ".conf", ".c", etc, are "a different meaning" from "the 8.3 sense". The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". Not even DOS / Win3.1 required the correct extension (you could call a Word document "STUPIDN.AME" and open it with Ctrl-O in WinWord). It's just that some systems (e.g. double-click on Windows,gzip
, your Makefile, etc) may be written to use this convention to make assumptions about the correct action to take on each file.
â IMSoP
Jul 27 '16 at 16:42
@ByteCommander That's true, but the extension still determines the app used. I am not sure how to edit the answer to reflect that.
â coteyr
Jul 27 '16 at 20:16
1
@coteyr Again, it all depends what we mean by "the OS". The File Manager will certainly look up a registry key for "AME", and will tell me that "foo.txt" is a text file. But runningdir
at a command prompt will tell me no such thing; it simply won't care. Executing files is certainly an exception, on both OSes; if the question was limited to those, the answer would be that DOS/Windows only care about the name, and Unix/Linux only care about the execute permission and the first bytes of the file. Outside of that, there is always some application choosing a convention to follow.
â IMSoP
Jul 27 '16 at 21:07
1
@coteyr You forgot *.scr (screen saver binary) in Windows 3.1 and up. That said, the file extension even in DOS/Windows systems even for executables is still just a convenience. The specifics depend very much on where you draw the line of "operating system", but you can always load a binary into memory and jump into it yourself, doing the work that one normally asks of the OS to do. In MS-DOS, if you look through command.com, I'm pretty sure there's a list like EXE COM that you can edit such that it looks for other extensions if none is specified (not saying it'd be a good idea, mind you).
â Michael Kjörling
Jul 30 '16 at 11:59
 |Â
show 1 more comment
7 Answers
7
active
oldest
votes
7 Answers
7
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
37
down vote
accepted
Linux determines the type of a file via a code in the file header. It doesn't depend on file extensions for to know with software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
- correctly remembered.
Are these extensions are meant only for humans?
- Yes, with a but.
When you interact with other operating systems that do depend on extensions being what they are it is the smarter idea to use those.
In Windows, opening software is attached to the extensions.
Opening a text file named "file" is harder in Windows than opening the same file named "file.txt" (you will need to switch the file open dialog from *.txt
to *.*
every time). The same goes for TAB and semi-colon separated text files. The same goes for importing and exporting e-mails (.mbox extension).
In particular when you code software. Opening a file named "software1" that is an HTML file and "software2" that is a JavaScript file becomes more difficult compared to "software.html" and "software.js".
If there is a system in place in Linux where file extensions are important, I would call that a bug. When software depends on file extensions, that is exploitable. We use an interpreter directive to identify what a file is ("the first two bytes in a file can be the characters "#!", which constitute a magic number (hexadecimal 23 and 21, the ASCII values of "#" and "!") often referred to as shebang,").
The most famous problem with file extensions was LOVE-LETTER-FOR-YOU.TXT.vbs on Windows. This is a visual basic script being shown in file explorer as a text file.
In Ubuntu when you start a file from Nautilus you get a warning what it is going to do. Executing a script from Nautilus where it wants to start some software where it is supposed to open gEdit is obvious a problem and we get a warning about it.
In command line when you execute something, you can visually see what the extension is. If it ends on .vbs I would start to become suspicious (not that .vbs is executable on Linux. At least not without some more effort ;) ).
30
I completely don't get what you wanted to say in your last sentence. First, it is a problem of hiding the extension rather than having it, second the exploit would work the same in Linux - you name a binary filereadme.txt
and make it executable. If user executed it, it does not open the editor, but runs the code. In this respect making extensions matter (but not hiding them) is more secure and easier to explain for non-savvy users. There are other differences (most notably not executing files from the current directory), but they have nothing to do with extensions.
â techraf
Jul 27 '16 at 8:07
3
@techraf Actually the file manager will probably try to open thereadme.txt
file with a text editor. I just tried with dolphin in KDE, creating a shell script adding executable permission, saving it as.txt
and clicking on it will make it open in Kate. If I rename it to.sh
then clicking on it runs it.
â Bakuriu
Jul 27 '16 at 11:22
9
linux: since make is build around rules that depend on the file extension, wouldn't this make (no pun intended) the extensions meant for more than just humans?
â bolov
Jul 27 '16 at 13:08
14
This is a monumentally wrong answer. Some parts of Linux use magic numbers to determine file types. Executing files at the command line. But other huge parts of the system use file extensions to know what to look at, whether those be the dynamic linker (which wants .so files), modprobe, build systems, plugins, libraries for python, ruby, etc. Many files don't have magic numbers,file
is heuristic-based, not definite.
â Alan Shutko
Jul 29 '16 at 0:08
3
"Linux determines the type of a file via a code in the file header" "correct" WTF? What "code in the file header" ? There is no such code, and there is no such a generic "file header" in Linux.
â leonbloy
Jul 30 '16 at 18:37
 |Â
show 15 more comments
up vote
37
down vote
accepted
Linux determines the type of a file via a code in the file header. It doesn't depend on file extensions for to know with software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
- correctly remembered.
Are these extensions are meant only for humans?
- Yes, with a but.
When you interact with other operating systems that do depend on extensions being what they are it is the smarter idea to use those.
In Windows, opening software is attached to the extensions.
Opening a text file named "file" is harder in Windows than opening the same file named "file.txt" (you will need to switch the file open dialog from *.txt
to *.*
every time). The same goes for TAB and semi-colon separated text files. The same goes for importing and exporting e-mails (.mbox extension).
In particular when you code software. Opening a file named "software1" that is an HTML file and "software2" that is a JavaScript file becomes more difficult compared to "software.html" and "software.js".
If there is a system in place in Linux where file extensions are important, I would call that a bug. When software depends on file extensions, that is exploitable. We use an interpreter directive to identify what a file is ("the first two bytes in a file can be the characters "#!", which constitute a magic number (hexadecimal 23 and 21, the ASCII values of "#" and "!") often referred to as shebang,").
The most famous problem with file extensions was LOVE-LETTER-FOR-YOU.TXT.vbs on Windows. This is a visual basic script being shown in file explorer as a text file.
In Ubuntu when you start a file from Nautilus you get a warning what it is going to do. Executing a script from Nautilus where it wants to start some software where it is supposed to open gEdit is obvious a problem and we get a warning about it.
In command line when you execute something, you can visually see what the extension is. If it ends on .vbs I would start to become suspicious (not that .vbs is executable on Linux. At least not without some more effort ;) ).
30
I completely don't get what you wanted to say in your last sentence. First, it is a problem of hiding the extension rather than having it, second the exploit would work the same in Linux - you name a binary filereadme.txt
and make it executable. If user executed it, it does not open the editor, but runs the code. In this respect making extensions matter (but not hiding them) is more secure and easier to explain for non-savvy users. There are other differences (most notably not executing files from the current directory), but they have nothing to do with extensions.
â techraf
Jul 27 '16 at 8:07
3
@techraf Actually the file manager will probably try to open thereadme.txt
file with a text editor. I just tried with dolphin in KDE, creating a shell script adding executable permission, saving it as.txt
and clicking on it will make it open in Kate. If I rename it to.sh
then clicking on it runs it.
â Bakuriu
Jul 27 '16 at 11:22
9
linux: since make is build around rules that depend on the file extension, wouldn't this make (no pun intended) the extensions meant for more than just humans?
â bolov
Jul 27 '16 at 13:08
14
This is a monumentally wrong answer. Some parts of Linux use magic numbers to determine file types. Executing files at the command line. But other huge parts of the system use file extensions to know what to look at, whether those be the dynamic linker (which wants .so files), modprobe, build systems, plugins, libraries for python, ruby, etc. Many files don't have magic numbers,file
is heuristic-based, not definite.
â Alan Shutko
Jul 29 '16 at 0:08
3
"Linux determines the type of a file via a code in the file header" "correct" WTF? What "code in the file header" ? There is no such code, and there is no such a generic "file header" in Linux.
â leonbloy
Jul 30 '16 at 18:37
 |Â
show 15 more comments
up vote
37
down vote
accepted
up vote
37
down vote
accepted
Linux determines the type of a file via a code in the file header. It doesn't depend on file extensions for to know with software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
- correctly remembered.
Are these extensions are meant only for humans?
- Yes, with a but.
When you interact with other operating systems that do depend on extensions being what they are it is the smarter idea to use those.
In Windows, opening software is attached to the extensions.
Opening a text file named "file" is harder in Windows than opening the same file named "file.txt" (you will need to switch the file open dialog from *.txt
to *.*
every time). The same goes for TAB and semi-colon separated text files. The same goes for importing and exporting e-mails (.mbox extension).
In particular when you code software. Opening a file named "software1" that is an HTML file and "software2" that is a JavaScript file becomes more difficult compared to "software.html" and "software.js".
If there is a system in place in Linux where file extensions are important, I would call that a bug. When software depends on file extensions, that is exploitable. We use an interpreter directive to identify what a file is ("the first two bytes in a file can be the characters "#!", which constitute a magic number (hexadecimal 23 and 21, the ASCII values of "#" and "!") often referred to as shebang,").
The most famous problem with file extensions was LOVE-LETTER-FOR-YOU.TXT.vbs on Windows. This is a visual basic script being shown in file explorer as a text file.
In Ubuntu when you start a file from Nautilus you get a warning what it is going to do. Executing a script from Nautilus where it wants to start some software where it is supposed to open gEdit is obvious a problem and we get a warning about it.
In command line when you execute something, you can visually see what the extension is. If it ends on .vbs I would start to become suspicious (not that .vbs is executable on Linux. At least not without some more effort ;) ).
Linux determines the type of a file via a code in the file header. It doesn't depend on file extensions for to know with software is to use for opening the file.
That's what I remember from my education. Please correct me in case I'm wrong!
- correctly remembered.
Are these extensions are meant only for humans?
- Yes, with a but.
When you interact with other operating systems that do depend on extensions being what they are it is the smarter idea to use those.
In Windows, opening software is attached to the extensions.
Opening a text file named "file" is harder in Windows than opening the same file named "file.txt" (you will need to switch the file open dialog from *.txt
to *.*
every time). The same goes for TAB and semi-colon separated text files. The same goes for importing and exporting e-mails (.mbox extension).
In particular when you code software. Opening a file named "software1" that is an HTML file and "software2" that is a JavaScript file becomes more difficult compared to "software.html" and "software.js".
If there is a system in place in Linux where file extensions are important, I would call that a bug. When software depends on file extensions, that is exploitable. We use an interpreter directive to identify what a file is ("the first two bytes in a file can be the characters "#!", which constitute a magic number (hexadecimal 23 and 21, the ASCII values of "#" and "!") often referred to as shebang,").
The most famous problem with file extensions was LOVE-LETTER-FOR-YOU.TXT.vbs on Windows. This is a visual basic script being shown in file explorer as a text file.
In Ubuntu when you start a file from Nautilus you get a warning what it is going to do. Executing a script from Nautilus where it wants to start some software where it is supposed to open gEdit is obvious a problem and we get a warning about it.
In command line when you execute something, you can visually see what the extension is. If it ends on .vbs I would start to become suspicious (not that .vbs is executable on Linux. At least not without some more effort ;) ).
edited Jul 31 '16 at 17:38
TimWolla
248214
248214
answered Jul 27 '16 at 7:01
Rinzwind
196k25375509
196k25375509
30
I completely don't get what you wanted to say in your last sentence. First, it is a problem of hiding the extension rather than having it, second the exploit would work the same in Linux - you name a binary filereadme.txt
and make it executable. If user executed it, it does not open the editor, but runs the code. In this respect making extensions matter (but not hiding them) is more secure and easier to explain for non-savvy users. There are other differences (most notably not executing files from the current directory), but they have nothing to do with extensions.
â techraf
Jul 27 '16 at 8:07
3
@techraf Actually the file manager will probably try to open thereadme.txt
file with a text editor. I just tried with dolphin in KDE, creating a shell script adding executable permission, saving it as.txt
and clicking on it will make it open in Kate. If I rename it to.sh
then clicking on it runs it.
â Bakuriu
Jul 27 '16 at 11:22
9
linux: since make is build around rules that depend on the file extension, wouldn't this make (no pun intended) the extensions meant for more than just humans?
â bolov
Jul 27 '16 at 13:08
14
This is a monumentally wrong answer. Some parts of Linux use magic numbers to determine file types. Executing files at the command line. But other huge parts of the system use file extensions to know what to look at, whether those be the dynamic linker (which wants .so files), modprobe, build systems, plugins, libraries for python, ruby, etc. Many files don't have magic numbers,file
is heuristic-based, not definite.
â Alan Shutko
Jul 29 '16 at 0:08
3
"Linux determines the type of a file via a code in the file header" "correct" WTF? What "code in the file header" ? There is no such code, and there is no such a generic "file header" in Linux.
â leonbloy
Jul 30 '16 at 18:37
 |Â
show 15 more comments
30
I completely don't get what you wanted to say in your last sentence. First, it is a problem of hiding the extension rather than having it, second the exploit would work the same in Linux - you name a binary filereadme.txt
and make it executable. If user executed it, it does not open the editor, but runs the code. In this respect making extensions matter (but not hiding them) is more secure and easier to explain for non-savvy users. There are other differences (most notably not executing files from the current directory), but they have nothing to do with extensions.
â techraf
Jul 27 '16 at 8:07
3
@techraf Actually the file manager will probably try to open thereadme.txt
file with a text editor. I just tried with dolphin in KDE, creating a shell script adding executable permission, saving it as.txt
and clicking on it will make it open in Kate. If I rename it to.sh
then clicking on it runs it.
â Bakuriu
Jul 27 '16 at 11:22
9
linux: since make is build around rules that depend on the file extension, wouldn't this make (no pun intended) the extensions meant for more than just humans?
â bolov
Jul 27 '16 at 13:08
14
This is a monumentally wrong answer. Some parts of Linux use magic numbers to determine file types. Executing files at the command line. But other huge parts of the system use file extensions to know what to look at, whether those be the dynamic linker (which wants .so files), modprobe, build systems, plugins, libraries for python, ruby, etc. Many files don't have magic numbers,file
is heuristic-based, not definite.
â Alan Shutko
Jul 29 '16 at 0:08
3
"Linux determines the type of a file via a code in the file header" "correct" WTF? What "code in the file header" ? There is no such code, and there is no such a generic "file header" in Linux.
â leonbloy
Jul 30 '16 at 18:37
30
30
I completely don't get what you wanted to say in your last sentence. First, it is a problem of hiding the extension rather than having it, second the exploit would work the same in Linux - you name a binary file
readme.txt
and make it executable. If user executed it, it does not open the editor, but runs the code. In this respect making extensions matter (but not hiding them) is more secure and easier to explain for non-savvy users. There are other differences (most notably not executing files from the current directory), but they have nothing to do with extensions.â techraf
Jul 27 '16 at 8:07
I completely don't get what you wanted to say in your last sentence. First, it is a problem of hiding the extension rather than having it, second the exploit would work the same in Linux - you name a binary file
readme.txt
and make it executable. If user executed it, it does not open the editor, but runs the code. In this respect making extensions matter (but not hiding them) is more secure and easier to explain for non-savvy users. There are other differences (most notably not executing files from the current directory), but they have nothing to do with extensions.â techraf
Jul 27 '16 at 8:07
3
3
@techraf Actually the file manager will probably try to open the
readme.txt
file with a text editor. I just tried with dolphin in KDE, creating a shell script adding executable permission, saving it as .txt
and clicking on it will make it open in Kate. If I rename it to .sh
then clicking on it runs it.â Bakuriu
Jul 27 '16 at 11:22
@techraf Actually the file manager will probably try to open the
readme.txt
file with a text editor. I just tried with dolphin in KDE, creating a shell script adding executable permission, saving it as .txt
and clicking on it will make it open in Kate. If I rename it to .sh
then clicking on it runs it.â Bakuriu
Jul 27 '16 at 11:22
9
9
linux: since make is build around rules that depend on the file extension, wouldn't this make (no pun intended) the extensions meant for more than just humans?
â bolov
Jul 27 '16 at 13:08
linux: since make is build around rules that depend on the file extension, wouldn't this make (no pun intended) the extensions meant for more than just humans?
â bolov
Jul 27 '16 at 13:08
14
14
This is a monumentally wrong answer. Some parts of Linux use magic numbers to determine file types. Executing files at the command line. But other huge parts of the system use file extensions to know what to look at, whether those be the dynamic linker (which wants .so files), modprobe, build systems, plugins, libraries for python, ruby, etc. Many files don't have magic numbers,
file
is heuristic-based, not definite.â Alan Shutko
Jul 29 '16 at 0:08
This is a monumentally wrong answer. Some parts of Linux use magic numbers to determine file types. Executing files at the command line. But other huge parts of the system use file extensions to know what to look at, whether those be the dynamic linker (which wants .so files), modprobe, build systems, plugins, libraries for python, ruby, etc. Many files don't have magic numbers,
file
is heuristic-based, not definite.â Alan Shutko
Jul 29 '16 at 0:08
3
3
"Linux determines the type of a file via a code in the file header" "correct" WTF? What "code in the file header" ? There is no such code, and there is no such a generic "file header" in Linux.
â leonbloy
Jul 30 '16 at 18:37
"Linux determines the type of a file via a code in the file header" "correct" WTF? What "code in the file header" ? There is no such code, and there is no such a generic "file header" in Linux.
â leonbloy
Jul 30 '16 at 18:37
 |Â
show 15 more comments
up vote
64
down vote
There is no 100% black or white answer here.
Usually Linux does not rely on file names (and file extensions i.e. the part of the file name after the normally last period) and instead determines the file type by examining the first few bytes of its content and comparing that to a list of known magic numbers.
For example all Bitmap image files (usually with name extension .bmp
) must start with the letters BM
in their first two bytes. Scripts in most scripting languages like Bash, Python, Perl, AWK, etc. (basically everything that treats lines starting with #
as comment) may contain a shebang like #!/bin/bash
as first line. This special comment tells the system with which application to open the file.
So normally the operating system relies on the file content and not its name to determine the file type, but stating that file extensions are never needed on Linux is only half of the truth.
Applications may of course implement their file checks however they want, which includes verifying the file name and extension. An example is the Eye of Gnome (eog
, standard picture viewer) which determines the image format by the file extension and throws an error if it does not match the content. Whether this is a bug or a feature can be discussed...
However, even some parts of the operating system rely on file name extensions, e.g. when parsing your software sources files in /etc/apt/sources.list.d/
- only files with the *.list
extension get parsed all others are ignored. It's maybe not mainly used to determine the file type here but rather to enable/disable parsing of some files, but it's still a file extension that affects how the system treats a file.
And of course the human user profits most from file extensions as that makes the type of a file obvious and also allows multiple files with the same base name and different extensions like site.html
, site.php
, site.js
, site.css
etc. The disadvantage is of course that file extension and the actual file type/content do not necessarily have to match.
Additionally it's needed for cross-platform interoperability, as e.g. Windows will not know what to do with a readme
file, but only a readme.txt
.
You slightly contradict yourself here: if the standard image viewer requires a filename ending .bmp, what part of the OS are you saying relies on the file content starting "BM"? AFAIK, the only "magic numbers the kernel cares about are executable types, including the special case of#!
. Everything else is up to some application's decision.
â IMSoP
Jul 27 '16 at 18:20
@IMSoP I don't know the exact implementation ofeog
and I don't know why they care about the file name at all. This is a bug in my opinion. And of course if the file is named "bmp" but its content format does not match, there will be an error as well, of course. Of course each application decides how to verify files, but in general Linux applications should not rely on the name. Btw, you can use thefile
commend to examine file types by their content.
â Byte Commander
Jul 27 '16 at 19:12
1
The sentence I am challenging is this: "Linux ... determines the file type by examining the first few bytes". What definition of "Linux" are you using in that sentence? The existence of thefile
utility doesn't really prove anything; it's a useful tool, that could exist on any OS. What fundamental part of the OS makes runningfile
any more "correct" than globbing the file name?
â IMSoP
Jul 27 '16 at 21:51
Note that files without an extension can be associated with a program.
â isanae
Jul 28 '16 at 2:09
add a comment |Â
up vote
64
down vote
There is no 100% black or white answer here.
Usually Linux does not rely on file names (and file extensions i.e. the part of the file name after the normally last period) and instead determines the file type by examining the first few bytes of its content and comparing that to a list of known magic numbers.
For example all Bitmap image files (usually with name extension .bmp
) must start with the letters BM
in their first two bytes. Scripts in most scripting languages like Bash, Python, Perl, AWK, etc. (basically everything that treats lines starting with #
as comment) may contain a shebang like #!/bin/bash
as first line. This special comment tells the system with which application to open the file.
So normally the operating system relies on the file content and not its name to determine the file type, but stating that file extensions are never needed on Linux is only half of the truth.
Applications may of course implement their file checks however they want, which includes verifying the file name and extension. An example is the Eye of Gnome (eog
, standard picture viewer) which determines the image format by the file extension and throws an error if it does not match the content. Whether this is a bug or a feature can be discussed...
However, even some parts of the operating system rely on file name extensions, e.g. when parsing your software sources files in /etc/apt/sources.list.d/
- only files with the *.list
extension get parsed all others are ignored. It's maybe not mainly used to determine the file type here but rather to enable/disable parsing of some files, but it's still a file extension that affects how the system treats a file.
And of course the human user profits most from file extensions as that makes the type of a file obvious and also allows multiple files with the same base name and different extensions like site.html
, site.php
, site.js
, site.css
etc. The disadvantage is of course that file extension and the actual file type/content do not necessarily have to match.
Additionally it's needed for cross-platform interoperability, as e.g. Windows will not know what to do with a readme
file, but only a readme.txt
.
You slightly contradict yourself here: if the standard image viewer requires a filename ending .bmp, what part of the OS are you saying relies on the file content starting "BM"? AFAIK, the only "magic numbers the kernel cares about are executable types, including the special case of#!
. Everything else is up to some application's decision.
â IMSoP
Jul 27 '16 at 18:20
@IMSoP I don't know the exact implementation ofeog
and I don't know why they care about the file name at all. This is a bug in my opinion. And of course if the file is named "bmp" but its content format does not match, there will be an error as well, of course. Of course each application decides how to verify files, but in general Linux applications should not rely on the name. Btw, you can use thefile
commend to examine file types by their content.
â Byte Commander
Jul 27 '16 at 19:12
1
The sentence I am challenging is this: "Linux ... determines the file type by examining the first few bytes". What definition of "Linux" are you using in that sentence? The existence of thefile
utility doesn't really prove anything; it's a useful tool, that could exist on any OS. What fundamental part of the OS makes runningfile
any more "correct" than globbing the file name?
â IMSoP
Jul 27 '16 at 21:51
Note that files without an extension can be associated with a program.
â isanae
Jul 28 '16 at 2:09
add a comment |Â
up vote
64
down vote
up vote
64
down vote
There is no 100% black or white answer here.
Usually Linux does not rely on file names (and file extensions i.e. the part of the file name after the normally last period) and instead determines the file type by examining the first few bytes of its content and comparing that to a list of known magic numbers.
For example all Bitmap image files (usually with name extension .bmp
) must start with the letters BM
in their first two bytes. Scripts in most scripting languages like Bash, Python, Perl, AWK, etc. (basically everything that treats lines starting with #
as comment) may contain a shebang like #!/bin/bash
as first line. This special comment tells the system with which application to open the file.
So normally the operating system relies on the file content and not its name to determine the file type, but stating that file extensions are never needed on Linux is only half of the truth.
Applications may of course implement their file checks however they want, which includes verifying the file name and extension. An example is the Eye of Gnome (eog
, standard picture viewer) which determines the image format by the file extension and throws an error if it does not match the content. Whether this is a bug or a feature can be discussed...
However, even some parts of the operating system rely on file name extensions, e.g. when parsing your software sources files in /etc/apt/sources.list.d/
- only files with the *.list
extension get parsed all others are ignored. It's maybe not mainly used to determine the file type here but rather to enable/disable parsing of some files, but it's still a file extension that affects how the system treats a file.
And of course the human user profits most from file extensions as that makes the type of a file obvious and also allows multiple files with the same base name and different extensions like site.html
, site.php
, site.js
, site.css
etc. The disadvantage is of course that file extension and the actual file type/content do not necessarily have to match.
Additionally it's needed for cross-platform interoperability, as e.g. Windows will not know what to do with a readme
file, but only a readme.txt
.
There is no 100% black or white answer here.
Usually Linux does not rely on file names (and file extensions i.e. the part of the file name after the normally last period) and instead determines the file type by examining the first few bytes of its content and comparing that to a list of known magic numbers.
For example all Bitmap image files (usually with name extension .bmp
) must start with the letters BM
in their first two bytes. Scripts in most scripting languages like Bash, Python, Perl, AWK, etc. (basically everything that treats lines starting with #
as comment) may contain a shebang like #!/bin/bash
as first line. This special comment tells the system with which application to open the file.
So normally the operating system relies on the file content and not its name to determine the file type, but stating that file extensions are never needed on Linux is only half of the truth.
Applications may of course implement their file checks however they want, which includes verifying the file name and extension. An example is the Eye of Gnome (eog
, standard picture viewer) which determines the image format by the file extension and throws an error if it does not match the content. Whether this is a bug or a feature can be discussed...
However, even some parts of the operating system rely on file name extensions, e.g. when parsing your software sources files in /etc/apt/sources.list.d/
- only files with the *.list
extension get parsed all others are ignored. It's maybe not mainly used to determine the file type here but rather to enable/disable parsing of some files, but it's still a file extension that affects how the system treats a file.
And of course the human user profits most from file extensions as that makes the type of a file obvious and also allows multiple files with the same base name and different extensions like site.html
, site.php
, site.js
, site.css
etc. The disadvantage is of course that file extension and the actual file type/content do not necessarily have to match.
Additionally it's needed for cross-platform interoperability, as e.g. Windows will not know what to do with a readme
file, but only a readme.txt
.
edited Jul 28 '16 at 1:44
techraf
2,75291935
2,75291935
answered Jul 27 '16 at 7:22
![](https://i.stack.imgur.com/m8DYH.jpg?s=32&g=1)
![](https://i.stack.imgur.com/m8DYH.jpg?s=32&g=1)
Byte Commander
59.2k26158267
59.2k26158267
You slightly contradict yourself here: if the standard image viewer requires a filename ending .bmp, what part of the OS are you saying relies on the file content starting "BM"? AFAIK, the only "magic numbers the kernel cares about are executable types, including the special case of#!
. Everything else is up to some application's decision.
â IMSoP
Jul 27 '16 at 18:20
@IMSoP I don't know the exact implementation ofeog
and I don't know why they care about the file name at all. This is a bug in my opinion. And of course if the file is named "bmp" but its content format does not match, there will be an error as well, of course. Of course each application decides how to verify files, but in general Linux applications should not rely on the name. Btw, you can use thefile
commend to examine file types by their content.
â Byte Commander
Jul 27 '16 at 19:12
1
The sentence I am challenging is this: "Linux ... determines the file type by examining the first few bytes". What definition of "Linux" are you using in that sentence? The existence of thefile
utility doesn't really prove anything; it's a useful tool, that could exist on any OS. What fundamental part of the OS makes runningfile
any more "correct" than globbing the file name?
â IMSoP
Jul 27 '16 at 21:51
Note that files without an extension can be associated with a program.
â isanae
Jul 28 '16 at 2:09
add a comment |Â
You slightly contradict yourself here: if the standard image viewer requires a filename ending .bmp, what part of the OS are you saying relies on the file content starting "BM"? AFAIK, the only "magic numbers the kernel cares about are executable types, including the special case of#!
. Everything else is up to some application's decision.
â IMSoP
Jul 27 '16 at 18:20
@IMSoP I don't know the exact implementation ofeog
and I don't know why they care about the file name at all. This is a bug in my opinion. And of course if the file is named "bmp" but its content format does not match, there will be an error as well, of course. Of course each application decides how to verify files, but in general Linux applications should not rely on the name. Btw, you can use thefile
commend to examine file types by their content.
â Byte Commander
Jul 27 '16 at 19:12
1
The sentence I am challenging is this: "Linux ... determines the file type by examining the first few bytes". What definition of "Linux" are you using in that sentence? The existence of thefile
utility doesn't really prove anything; it's a useful tool, that could exist on any OS. What fundamental part of the OS makes runningfile
any more "correct" than globbing the file name?
â IMSoP
Jul 27 '16 at 21:51
Note that files without an extension can be associated with a program.
â isanae
Jul 28 '16 at 2:09
You slightly contradict yourself here: if the standard image viewer requires a filename ending .bmp, what part of the OS are you saying relies on the file content starting "BM"? AFAIK, the only "magic numbers the kernel cares about are executable types, including the special case of
#!
. Everything else is up to some application's decision.â IMSoP
Jul 27 '16 at 18:20
You slightly contradict yourself here: if the standard image viewer requires a filename ending .bmp, what part of the OS are you saying relies on the file content starting "BM"? AFAIK, the only "magic numbers the kernel cares about are executable types, including the special case of
#!
. Everything else is up to some application's decision.â IMSoP
Jul 27 '16 at 18:20
@IMSoP I don't know the exact implementation of
eog
and I don't know why they care about the file name at all. This is a bug in my opinion. And of course if the file is named "bmp" but its content format does not match, there will be an error as well, of course. Of course each application decides how to verify files, but in general Linux applications should not rely on the name. Btw, you can use the file
commend to examine file types by their content.â Byte Commander
Jul 27 '16 at 19:12
@IMSoP I don't know the exact implementation of
eog
and I don't know why they care about the file name at all. This is a bug in my opinion. And of course if the file is named "bmp" but its content format does not match, there will be an error as well, of course. Of course each application decides how to verify files, but in general Linux applications should not rely on the name. Btw, you can use the file
commend to examine file types by their content.â Byte Commander
Jul 27 '16 at 19:12
1
1
The sentence I am challenging is this: "Linux ... determines the file type by examining the first few bytes". What definition of "Linux" are you using in that sentence? The existence of the
file
utility doesn't really prove anything; it's a useful tool, that could exist on any OS. What fundamental part of the OS makes running file
any more "correct" than globbing the file name?â IMSoP
Jul 27 '16 at 21:51
The sentence I am challenging is this: "Linux ... determines the file type by examining the first few bytes". What definition of "Linux" are you using in that sentence? The existence of the
file
utility doesn't really prove anything; it's a useful tool, that could exist on any OS. What fundamental part of the OS makes running file
any more "correct" than globbing the file name?â IMSoP
Jul 27 '16 at 21:51
Note that files without an extension can be associated with a program.
â isanae
Jul 28 '16 at 2:09
Note that files without an extension can be associated with a program.
â isanae
Jul 28 '16 at 2:09
add a comment |Â
up vote
23
down vote
As mentioned by others, in Linux an interpreter directive method is used (storing some metadata in a file as a header or magic number so the correct interpreter can be told to read it) rather than the filename extension association method used by Windows.
This means you can create a file with almost any name you like... with a few exceptions
However
I would like to add a word of caution.
If you have some files on your system from a system that uses filename association, the files may not have those magic numbers or headers. Filename extensions are used to identify these files by applications that are able to read them, and you may experience some unexpected effects if you rename such files. For example:
If you rename a file My Novel.doc
to My-Novel
, Libreoffice will still be able to open it, but it will open as 'Untitled' and you will have to name it again in order to save it (Libreoffice adds an extension by default, so you would then have two files My-Novel
and My-Novel.odt
, which could be annoying)
More seriously, if you rename a file My Spreadsheet.xlsx to My-Spreadsheet, then try to open it with xdg-open My-Spreadsheet
you will get this (because it's actually a compressed file):
And if you rename a file My Spreadsheet.xls
to My-Spreadsheet
, when you xdg-open My-Spreadsheet
you get an error saying
error opening location: No application is registered as handling this file
(Although in both these cases it works OK if you do soffice My-Spreadsheet
)
If you then rename the extensionless file to My-Spreadsheet.ods
with mv
and try to open it you will get this:
(repair fails)
And you will have to put the original extension back on to open the file correctly (you can then convert the format if you wish)
TL;DR:
If you have non-native files with name extensions, don't remove the extensions assuming everything will be OK!
3
A new-style MS Office document (docx, xlsx, pptx etc) without file extension opens in the archive manager because those file types are actually just ordinary ZIP compressed files which contain all the XML documents and media files necessary to define the document content. The file format of a ZIP compressed directory is pretty common nowadays btw.
â Byte Commander
Jul 27 '16 at 8:21
1
Already many great answers, but just one more specific to libreoffice that I've noticed. You create a file of comma separated values (CSV) and save it as "test.csv", a window will open asking what type of separator are you using (i.e., libreoffice Calc). If you rename this file to "test.cs", for example, then libreoffice's Writer opens it. So, besides the ZIP example above, it does seem like libreoffice does make use of the file extension.
â Ray
Jul 27 '16 at 8:31
2
The linux filesystem doesn't do anything regarding file types. That is all down to the programs running on top of it.
â Peter Green
Jul 27 '16 at 15:29
@PeterGreen Yes, but the fact that the programs do assign it significance means it's not "just for humans" the way, e.g., classic MacOS had it [there were four-byte "file type" and "creator app" fields that weren't part of the file name, so the OS and applications had all the information they needed without looking at file extensions]
â Random832
Jul 27 '16 at 15:36
3
@PeterGreen The Windows filesystem doesn't do anything regarding file types either. The graphical shell (Windows Explorer) uses file extension to choose an action for double-click, but technically that's just a program running on top of the OS, just as Nautilus is. It would be perfectly possible to write a Linux file manager with that behaviour, or a Windows one which examined the file contents.
â IMSoP
Jul 27 '16 at 17:19
add a comment |Â
up vote
23
down vote
As mentioned by others, in Linux an interpreter directive method is used (storing some metadata in a file as a header or magic number so the correct interpreter can be told to read it) rather than the filename extension association method used by Windows.
This means you can create a file with almost any name you like... with a few exceptions
However
I would like to add a word of caution.
If you have some files on your system from a system that uses filename association, the files may not have those magic numbers or headers. Filename extensions are used to identify these files by applications that are able to read them, and you may experience some unexpected effects if you rename such files. For example:
If you rename a file My Novel.doc
to My-Novel
, Libreoffice will still be able to open it, but it will open as 'Untitled' and you will have to name it again in order to save it (Libreoffice adds an extension by default, so you would then have two files My-Novel
and My-Novel.odt
, which could be annoying)
More seriously, if you rename a file My Spreadsheet.xlsx to My-Spreadsheet, then try to open it with xdg-open My-Spreadsheet
you will get this (because it's actually a compressed file):
And if you rename a file My Spreadsheet.xls
to My-Spreadsheet
, when you xdg-open My-Spreadsheet
you get an error saying
error opening location: No application is registered as handling this file
(Although in both these cases it works OK if you do soffice My-Spreadsheet
)
If you then rename the extensionless file to My-Spreadsheet.ods
with mv
and try to open it you will get this:
(repair fails)
And you will have to put the original extension back on to open the file correctly (you can then convert the format if you wish)
TL;DR:
If you have non-native files with name extensions, don't remove the extensions assuming everything will be OK!
3
A new-style MS Office document (docx, xlsx, pptx etc) without file extension opens in the archive manager because those file types are actually just ordinary ZIP compressed files which contain all the XML documents and media files necessary to define the document content. The file format of a ZIP compressed directory is pretty common nowadays btw.
â Byte Commander
Jul 27 '16 at 8:21
1
Already many great answers, but just one more specific to libreoffice that I've noticed. You create a file of comma separated values (CSV) and save it as "test.csv", a window will open asking what type of separator are you using (i.e., libreoffice Calc). If you rename this file to "test.cs", for example, then libreoffice's Writer opens it. So, besides the ZIP example above, it does seem like libreoffice does make use of the file extension.
â Ray
Jul 27 '16 at 8:31
2
The linux filesystem doesn't do anything regarding file types. That is all down to the programs running on top of it.
â Peter Green
Jul 27 '16 at 15:29
@PeterGreen Yes, but the fact that the programs do assign it significance means it's not "just for humans" the way, e.g., classic MacOS had it [there were four-byte "file type" and "creator app" fields that weren't part of the file name, so the OS and applications had all the information they needed without looking at file extensions]
â Random832
Jul 27 '16 at 15:36
3
@PeterGreen The Windows filesystem doesn't do anything regarding file types either. The graphical shell (Windows Explorer) uses file extension to choose an action for double-click, but technically that's just a program running on top of the OS, just as Nautilus is. It would be perfectly possible to write a Linux file manager with that behaviour, or a Windows one which examined the file contents.
â IMSoP
Jul 27 '16 at 17:19
add a comment |Â
up vote
23
down vote
up vote
23
down vote
As mentioned by others, in Linux an interpreter directive method is used (storing some metadata in a file as a header or magic number so the correct interpreter can be told to read it) rather than the filename extension association method used by Windows.
This means you can create a file with almost any name you like... with a few exceptions
However
I would like to add a word of caution.
If you have some files on your system from a system that uses filename association, the files may not have those magic numbers or headers. Filename extensions are used to identify these files by applications that are able to read them, and you may experience some unexpected effects if you rename such files. For example:
If you rename a file My Novel.doc
to My-Novel
, Libreoffice will still be able to open it, but it will open as 'Untitled' and you will have to name it again in order to save it (Libreoffice adds an extension by default, so you would then have two files My-Novel
and My-Novel.odt
, which could be annoying)
More seriously, if you rename a file My Spreadsheet.xlsx to My-Spreadsheet, then try to open it with xdg-open My-Spreadsheet
you will get this (because it's actually a compressed file):
And if you rename a file My Spreadsheet.xls
to My-Spreadsheet
, when you xdg-open My-Spreadsheet
you get an error saying
error opening location: No application is registered as handling this file
(Although in both these cases it works OK if you do soffice My-Spreadsheet
)
If you then rename the extensionless file to My-Spreadsheet.ods
with mv
and try to open it you will get this:
(repair fails)
And you will have to put the original extension back on to open the file correctly (you can then convert the format if you wish)
TL;DR:
If you have non-native files with name extensions, don't remove the extensions assuming everything will be OK!
As mentioned by others, in Linux an interpreter directive method is used (storing some metadata in a file as a header or magic number so the correct interpreter can be told to read it) rather than the filename extension association method used by Windows.
This means you can create a file with almost any name you like... with a few exceptions
However
I would like to add a word of caution.
If you have some files on your system from a system that uses filename association, the files may not have those magic numbers or headers. Filename extensions are used to identify these files by applications that are able to read them, and you may experience some unexpected effects if you rename such files. For example:
If you rename a file My Novel.doc
to My-Novel
, Libreoffice will still be able to open it, but it will open as 'Untitled' and you will have to name it again in order to save it (Libreoffice adds an extension by default, so you would then have two files My-Novel
and My-Novel.odt
, which could be annoying)
More seriously, if you rename a file My Spreadsheet.xlsx to My-Spreadsheet, then try to open it with xdg-open My-Spreadsheet
you will get this (because it's actually a compressed file):
And if you rename a file My Spreadsheet.xls
to My-Spreadsheet
, when you xdg-open My-Spreadsheet
you get an error saying
error opening location: No application is registered as handling this file
(Although in both these cases it works OK if you do soffice My-Spreadsheet
)
If you then rename the extensionless file to My-Spreadsheet.ods
with mv
and try to open it you will get this:
(repair fails)
And you will have to put the original extension back on to open the file correctly (you can then convert the format if you wish)
TL;DR:
If you have non-native files with name extensions, don't remove the extensions assuming everything will be OK!
edited Jul 29 '16 at 6:28
answered Jul 27 '16 at 8:06
![](https://i.stack.imgur.com/8CW8e.png?s=32&g=1)
![](https://i.stack.imgur.com/8CW8e.png?s=32&g=1)
Zanna
48k13119227
48k13119227
3
A new-style MS Office document (docx, xlsx, pptx etc) without file extension opens in the archive manager because those file types are actually just ordinary ZIP compressed files which contain all the XML documents and media files necessary to define the document content. The file format of a ZIP compressed directory is pretty common nowadays btw.
â Byte Commander
Jul 27 '16 at 8:21
1
Already many great answers, but just one more specific to libreoffice that I've noticed. You create a file of comma separated values (CSV) and save it as "test.csv", a window will open asking what type of separator are you using (i.e., libreoffice Calc). If you rename this file to "test.cs", for example, then libreoffice's Writer opens it. So, besides the ZIP example above, it does seem like libreoffice does make use of the file extension.
â Ray
Jul 27 '16 at 8:31
2
The linux filesystem doesn't do anything regarding file types. That is all down to the programs running on top of it.
â Peter Green
Jul 27 '16 at 15:29
@PeterGreen Yes, but the fact that the programs do assign it significance means it's not "just for humans" the way, e.g., classic MacOS had it [there were four-byte "file type" and "creator app" fields that weren't part of the file name, so the OS and applications had all the information they needed without looking at file extensions]
â Random832
Jul 27 '16 at 15:36
3
@PeterGreen The Windows filesystem doesn't do anything regarding file types either. The graphical shell (Windows Explorer) uses file extension to choose an action for double-click, but technically that's just a program running on top of the OS, just as Nautilus is. It would be perfectly possible to write a Linux file manager with that behaviour, or a Windows one which examined the file contents.
â IMSoP
Jul 27 '16 at 17:19
add a comment |Â
3
A new-style MS Office document (docx, xlsx, pptx etc) without file extension opens in the archive manager because those file types are actually just ordinary ZIP compressed files which contain all the XML documents and media files necessary to define the document content. The file format of a ZIP compressed directory is pretty common nowadays btw.
â Byte Commander
Jul 27 '16 at 8:21
1
Already many great answers, but just one more specific to libreoffice that I've noticed. You create a file of comma separated values (CSV) and save it as "test.csv", a window will open asking what type of separator are you using (i.e., libreoffice Calc). If you rename this file to "test.cs", for example, then libreoffice's Writer opens it. So, besides the ZIP example above, it does seem like libreoffice does make use of the file extension.
â Ray
Jul 27 '16 at 8:31
2
The linux filesystem doesn't do anything regarding file types. That is all down to the programs running on top of it.
â Peter Green
Jul 27 '16 at 15:29
@PeterGreen Yes, but the fact that the programs do assign it significance means it's not "just for humans" the way, e.g., classic MacOS had it [there were four-byte "file type" and "creator app" fields that weren't part of the file name, so the OS and applications had all the information they needed without looking at file extensions]
â Random832
Jul 27 '16 at 15:36
3
@PeterGreen The Windows filesystem doesn't do anything regarding file types either. The graphical shell (Windows Explorer) uses file extension to choose an action for double-click, but technically that's just a program running on top of the OS, just as Nautilus is. It would be perfectly possible to write a Linux file manager with that behaviour, or a Windows one which examined the file contents.
â IMSoP
Jul 27 '16 at 17:19
3
3
A new-style MS Office document (docx, xlsx, pptx etc) without file extension opens in the archive manager because those file types are actually just ordinary ZIP compressed files which contain all the XML documents and media files necessary to define the document content. The file format of a ZIP compressed directory is pretty common nowadays btw.
â Byte Commander
Jul 27 '16 at 8:21
A new-style MS Office document (docx, xlsx, pptx etc) without file extension opens in the archive manager because those file types are actually just ordinary ZIP compressed files which contain all the XML documents and media files necessary to define the document content. The file format of a ZIP compressed directory is pretty common nowadays btw.
â Byte Commander
Jul 27 '16 at 8:21
1
1
Already many great answers, but just one more specific to libreoffice that I've noticed. You create a file of comma separated values (CSV) and save it as "test.csv", a window will open asking what type of separator are you using (i.e., libreoffice Calc). If you rename this file to "test.cs", for example, then libreoffice's Writer opens it. So, besides the ZIP example above, it does seem like libreoffice does make use of the file extension.
â Ray
Jul 27 '16 at 8:31
Already many great answers, but just one more specific to libreoffice that I've noticed. You create a file of comma separated values (CSV) and save it as "test.csv", a window will open asking what type of separator are you using (i.e., libreoffice Calc). If you rename this file to "test.cs", for example, then libreoffice's Writer opens it. So, besides the ZIP example above, it does seem like libreoffice does make use of the file extension.
â Ray
Jul 27 '16 at 8:31
2
2
The linux filesystem doesn't do anything regarding file types. That is all down to the programs running on top of it.
â Peter Green
Jul 27 '16 at 15:29
The linux filesystem doesn't do anything regarding file types. That is all down to the programs running on top of it.
â Peter Green
Jul 27 '16 at 15:29
@PeterGreen Yes, but the fact that the programs do assign it significance means it's not "just for humans" the way, e.g., classic MacOS had it [there were four-byte "file type" and "creator app" fields that weren't part of the file name, so the OS and applications had all the information they needed without looking at file extensions]
â Random832
Jul 27 '16 at 15:36
@PeterGreen Yes, but the fact that the programs do assign it significance means it's not "just for humans" the way, e.g., classic MacOS had it [there were four-byte "file type" and "creator app" fields that weren't part of the file name, so the OS and applications had all the information they needed without looking at file extensions]
â Random832
Jul 27 '16 at 15:36
3
3
@PeterGreen The Windows filesystem doesn't do anything regarding file types either. The graphical shell (Windows Explorer) uses file extension to choose an action for double-click, but technically that's just a program running on top of the OS, just as Nautilus is. It would be perfectly possible to write a Linux file manager with that behaviour, or a Windows one which examined the file contents.
â IMSoP
Jul 27 '16 at 17:19
@PeterGreen The Windows filesystem doesn't do anything regarding file types either. The graphical shell (Windows Explorer) uses file extension to choose an action for double-click, but technically that's just a program running on top of the OS, just as Nautilus is. It would be perfectly possible to write a Linux file manager with that behaviour, or a Windows one which examined the file contents.
â IMSoP
Jul 27 '16 at 17:19
add a comment |Â
up vote
19
down vote
I'd like to take a different approach to this from other answers, and challenge the notion that "Linux" or "Windows" have anything to do with this (bear with me).
The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". The other common conventions for identifying the type of a file are comparing its contents against a database of known signatures (the "magic number" approach), and storing it as an extra attribute on the file system (the approach used in the original MacOS).
Since every file on a Windows or a Linux system has both a name and contents, processes which want to know the file type can use either the "extension" or the "magic number" approaches as they see fit. The metadata approach is not generally available, as there is no standard place for this attribute on most file systems.
On Windows, there is a strong tradition of using the file extension as the primary means of identifying a file; most visibly, the graphical file browser (File Manager on Windows 3.1 and Explorer on modern Windows) uses it when you double-click on a file to determine which application to launch. On Linux (and, more generally, Unix-based systems), there is more tradition for inspecting the contents; most notably, the kernel looks at the beginning of a file executed directly to determine how to run it; script files can indicate an interpreter to use by starting with #!
followed by the path to the interpreter.
These traditions influence UI design of programs written for each system, but there are plenty of exceptions, because each approach has pros and cons in different situations. Reasons to use file extensions rather than examining contents include:
- examining file contents is fairly costly compared to examining file names; so for instance "find all files named *.conf" will be a lot quicker than "find all files whose first line matches this signature"
- file contents can be ambiguous; many file formats are actually just text files treated in a special way, many others are specially-structured zip files, and defining accurate signatures for these can be tricky
- a file can genuinely be valid as more than one type; an HTML file may also be valid XML, a zip file and a GIF concatenated together remain valid for both formats
- magic number matching might lead to false positives; a file format that has no header might happen to begin with the bytes "GIF89a" and be misidentified as a GIF image
- renaming a file can be a convenient way to mark it as "disabled"; e.g. changing "foo.conf" to "foo.conf~" to indicate a backup is easier than editing the file to comment out all of its directives, and more convenient than moving it out of an autoloaded directory; similarly, renaming a .php file to .txt will tell Apache to serve its source as plain text, rather than passing it to the PHP engine
Examples of Linux programs which use file names by default (but may have other modes):
- gzip and gunzip have special handling of any file ending ".gz"
- gcc will handle ".c" files as C, and ".cc" or ".C" as C++
Windows also has a strong tradition of hiding the extension if it's "well known" and even DOS allowed a command to omit .COM, .BAT, and .EXE, automatically searching for those to determine what actual program to execute. There is no such tradition in *nix.
â Monty Harder
Jul 28 '16 at 21:59
1
This should be the accepted answer.
â Ave
Jul 31 '16 at 13:29
This is a much better answer but has one factual error... a script cannot be made executable by placing#!
at the beginning. Any file with its executable bit(s) set can executed in one of several ways.#!/bin/bash
and similar signatures just specify which interpreter to use. If no such signature is supplied, the default shell interpreter is assumed. A file containing nothing but the two words 'Hello World', but with its execution bit set, will attempt to find a 'Hello' command when run.
â DocSalvager
Aug 2 '16 at 21:33
1
@DocSalvager Good catch, that was clumsy wording as much as anything. I've reworded it a bit to make clear that the shebang doesn't make the script executable, it just changes how it is executed.
â IMSoP
Aug 2 '16 at 22:08
add a comment |Â
up vote
19
down vote
I'd like to take a different approach to this from other answers, and challenge the notion that "Linux" or "Windows" have anything to do with this (bear with me).
The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". The other common conventions for identifying the type of a file are comparing its contents against a database of known signatures (the "magic number" approach), and storing it as an extra attribute on the file system (the approach used in the original MacOS).
Since every file on a Windows or a Linux system has both a name and contents, processes which want to know the file type can use either the "extension" or the "magic number" approaches as they see fit. The metadata approach is not generally available, as there is no standard place for this attribute on most file systems.
On Windows, there is a strong tradition of using the file extension as the primary means of identifying a file; most visibly, the graphical file browser (File Manager on Windows 3.1 and Explorer on modern Windows) uses it when you double-click on a file to determine which application to launch. On Linux (and, more generally, Unix-based systems), there is more tradition for inspecting the contents; most notably, the kernel looks at the beginning of a file executed directly to determine how to run it; script files can indicate an interpreter to use by starting with #!
followed by the path to the interpreter.
These traditions influence UI design of programs written for each system, but there are plenty of exceptions, because each approach has pros and cons in different situations. Reasons to use file extensions rather than examining contents include:
- examining file contents is fairly costly compared to examining file names; so for instance "find all files named *.conf" will be a lot quicker than "find all files whose first line matches this signature"
- file contents can be ambiguous; many file formats are actually just text files treated in a special way, many others are specially-structured zip files, and defining accurate signatures for these can be tricky
- a file can genuinely be valid as more than one type; an HTML file may also be valid XML, a zip file and a GIF concatenated together remain valid for both formats
- magic number matching might lead to false positives; a file format that has no header might happen to begin with the bytes "GIF89a" and be misidentified as a GIF image
- renaming a file can be a convenient way to mark it as "disabled"; e.g. changing "foo.conf" to "foo.conf~" to indicate a backup is easier than editing the file to comment out all of its directives, and more convenient than moving it out of an autoloaded directory; similarly, renaming a .php file to .txt will tell Apache to serve its source as plain text, rather than passing it to the PHP engine
Examples of Linux programs which use file names by default (but may have other modes):
- gzip and gunzip have special handling of any file ending ".gz"
- gcc will handle ".c" files as C, and ".cc" or ".C" as C++
Windows also has a strong tradition of hiding the extension if it's "well known" and even DOS allowed a command to omit .COM, .BAT, and .EXE, automatically searching for those to determine what actual program to execute. There is no such tradition in *nix.
â Monty Harder
Jul 28 '16 at 21:59
1
This should be the accepted answer.
â Ave
Jul 31 '16 at 13:29
This is a much better answer but has one factual error... a script cannot be made executable by placing#!
at the beginning. Any file with its executable bit(s) set can executed in one of several ways.#!/bin/bash
and similar signatures just specify which interpreter to use. If no such signature is supplied, the default shell interpreter is assumed. A file containing nothing but the two words 'Hello World', but with its execution bit set, will attempt to find a 'Hello' command when run.
â DocSalvager
Aug 2 '16 at 21:33
1
@DocSalvager Good catch, that was clumsy wording as much as anything. I've reworded it a bit to make clear that the shebang doesn't make the script executable, it just changes how it is executed.
â IMSoP
Aug 2 '16 at 22:08
add a comment |Â
up vote
19
down vote
up vote
19
down vote
I'd like to take a different approach to this from other answers, and challenge the notion that "Linux" or "Windows" have anything to do with this (bear with me).
The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". The other common conventions for identifying the type of a file are comparing its contents against a database of known signatures (the "magic number" approach), and storing it as an extra attribute on the file system (the approach used in the original MacOS).
Since every file on a Windows or a Linux system has both a name and contents, processes which want to know the file type can use either the "extension" or the "magic number" approaches as they see fit. The metadata approach is not generally available, as there is no standard place for this attribute on most file systems.
On Windows, there is a strong tradition of using the file extension as the primary means of identifying a file; most visibly, the graphical file browser (File Manager on Windows 3.1 and Explorer on modern Windows) uses it when you double-click on a file to determine which application to launch. On Linux (and, more generally, Unix-based systems), there is more tradition for inspecting the contents; most notably, the kernel looks at the beginning of a file executed directly to determine how to run it; script files can indicate an interpreter to use by starting with #!
followed by the path to the interpreter.
These traditions influence UI design of programs written for each system, but there are plenty of exceptions, because each approach has pros and cons in different situations. Reasons to use file extensions rather than examining contents include:
- examining file contents is fairly costly compared to examining file names; so for instance "find all files named *.conf" will be a lot quicker than "find all files whose first line matches this signature"
- file contents can be ambiguous; many file formats are actually just text files treated in a special way, many others are specially-structured zip files, and defining accurate signatures for these can be tricky
- a file can genuinely be valid as more than one type; an HTML file may also be valid XML, a zip file and a GIF concatenated together remain valid for both formats
- magic number matching might lead to false positives; a file format that has no header might happen to begin with the bytes "GIF89a" and be misidentified as a GIF image
- renaming a file can be a convenient way to mark it as "disabled"; e.g. changing "foo.conf" to "foo.conf~" to indicate a backup is easier than editing the file to comment out all of its directives, and more convenient than moving it out of an autoloaded directory; similarly, renaming a .php file to .txt will tell Apache to serve its source as plain text, rather than passing it to the PHP engine
Examples of Linux programs which use file names by default (but may have other modes):
- gzip and gunzip have special handling of any file ending ".gz"
- gcc will handle ".c" files as C, and ".cc" or ".C" as C++
I'd like to take a different approach to this from other answers, and challenge the notion that "Linux" or "Windows" have anything to do with this (bear with me).
The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". The other common conventions for identifying the type of a file are comparing its contents against a database of known signatures (the "magic number" approach), and storing it as an extra attribute on the file system (the approach used in the original MacOS).
Since every file on a Windows or a Linux system has both a name and contents, processes which want to know the file type can use either the "extension" or the "magic number" approaches as they see fit. The metadata approach is not generally available, as there is no standard place for this attribute on most file systems.
On Windows, there is a strong tradition of using the file extension as the primary means of identifying a file; most visibly, the graphical file browser (File Manager on Windows 3.1 and Explorer on modern Windows) uses it when you double-click on a file to determine which application to launch. On Linux (and, more generally, Unix-based systems), there is more tradition for inspecting the contents; most notably, the kernel looks at the beginning of a file executed directly to determine how to run it; script files can indicate an interpreter to use by starting with #!
followed by the path to the interpreter.
These traditions influence UI design of programs written for each system, but there are plenty of exceptions, because each approach has pros and cons in different situations. Reasons to use file extensions rather than examining contents include:
- examining file contents is fairly costly compared to examining file names; so for instance "find all files named *.conf" will be a lot quicker than "find all files whose first line matches this signature"
- file contents can be ambiguous; many file formats are actually just text files treated in a special way, many others are specially-structured zip files, and defining accurate signatures for these can be tricky
- a file can genuinely be valid as more than one type; an HTML file may also be valid XML, a zip file and a GIF concatenated together remain valid for both formats
- magic number matching might lead to false positives; a file format that has no header might happen to begin with the bytes "GIF89a" and be misidentified as a GIF image
- renaming a file can be a convenient way to mark it as "disabled"; e.g. changing "foo.conf" to "foo.conf~" to indicate a backup is easier than editing the file to comment out all of its directives, and more convenient than moving it out of an autoloaded directory; similarly, renaming a .php file to .txt will tell Apache to serve its source as plain text, rather than passing it to the PHP engine
Examples of Linux programs which use file names by default (but may have other modes):
- gzip and gunzip have special handling of any file ending ".gz"
- gcc will handle ".c" files as C, and ".cc" or ".C" as C++
edited Aug 2 '16 at 22:07
answered Jul 27 '16 at 17:13
![](https://i.stack.imgur.com/MBRE7.jpg?s=32&g=1)
![](https://i.stack.imgur.com/MBRE7.jpg?s=32&g=1)
IMSoP
63758
63758
Windows also has a strong tradition of hiding the extension if it's "well known" and even DOS allowed a command to omit .COM, .BAT, and .EXE, automatically searching for those to determine what actual program to execute. There is no such tradition in *nix.
â Monty Harder
Jul 28 '16 at 21:59
1
This should be the accepted answer.
â Ave
Jul 31 '16 at 13:29
This is a much better answer but has one factual error... a script cannot be made executable by placing#!
at the beginning. Any file with its executable bit(s) set can executed in one of several ways.#!/bin/bash
and similar signatures just specify which interpreter to use. If no such signature is supplied, the default shell interpreter is assumed. A file containing nothing but the two words 'Hello World', but with its execution bit set, will attempt to find a 'Hello' command when run.
â DocSalvager
Aug 2 '16 at 21:33
1
@DocSalvager Good catch, that was clumsy wording as much as anything. I've reworded it a bit to make clear that the shebang doesn't make the script executable, it just changes how it is executed.
â IMSoP
Aug 2 '16 at 22:08
add a comment |Â
Windows also has a strong tradition of hiding the extension if it's "well known" and even DOS allowed a command to omit .COM, .BAT, and .EXE, automatically searching for those to determine what actual program to execute. There is no such tradition in *nix.
â Monty Harder
Jul 28 '16 at 21:59
1
This should be the accepted answer.
â Ave
Jul 31 '16 at 13:29
This is a much better answer but has one factual error... a script cannot be made executable by placing#!
at the beginning. Any file with its executable bit(s) set can executed in one of several ways.#!/bin/bash
and similar signatures just specify which interpreter to use. If no such signature is supplied, the default shell interpreter is assumed. A file containing nothing but the two words 'Hello World', but with its execution bit set, will attempt to find a 'Hello' command when run.
â DocSalvager
Aug 2 '16 at 21:33
1
@DocSalvager Good catch, that was clumsy wording as much as anything. I've reworded it a bit to make clear that the shebang doesn't make the script executable, it just changes how it is executed.
â IMSoP
Aug 2 '16 at 22:08
Windows also has a strong tradition of hiding the extension if it's "well known" and even DOS allowed a command to omit .COM, .BAT, and .EXE, automatically searching for those to determine what actual program to execute. There is no such tradition in *nix.
â Monty Harder
Jul 28 '16 at 21:59
Windows also has a strong tradition of hiding the extension if it's "well known" and even DOS allowed a command to omit .COM, .BAT, and .EXE, automatically searching for those to determine what actual program to execute. There is no such tradition in *nix.
â Monty Harder
Jul 28 '16 at 21:59
1
1
This should be the accepted answer.
â Ave
Jul 31 '16 at 13:29
This should be the accepted answer.
â Ave
Jul 31 '16 at 13:29
This is a much better answer but has one factual error... a script cannot be made executable by placing
#!
at the beginning. Any file with its executable bit(s) set can executed in one of several ways. #!/bin/bash
and similar signatures just specify which interpreter to use. If no such signature is supplied, the default shell interpreter is assumed. A file containing nothing but the two words 'Hello World', but with its execution bit set, will attempt to find a 'Hello' command when run.â DocSalvager
Aug 2 '16 at 21:33
This is a much better answer but has one factual error... a script cannot be made executable by placing
#!
at the beginning. Any file with its executable bit(s) set can executed in one of several ways. #!/bin/bash
and similar signatures just specify which interpreter to use. If no such signature is supplied, the default shell interpreter is assumed. A file containing nothing but the two words 'Hello World', but with its execution bit set, will attempt to find a 'Hello' command when run.â DocSalvager
Aug 2 '16 at 21:33
1
1
@DocSalvager Good catch, that was clumsy wording as much as anything. I've reworded it a bit to make clear that the shebang doesn't make the script executable, it just changes how it is executed.
â IMSoP
Aug 2 '16 at 22:08
@DocSalvager Good catch, that was clumsy wording as much as anything. I've reworded it a bit to make clear that the shebang doesn't make the script executable, it just changes how it is executed.
â IMSoP
Aug 2 '16 at 22:08
add a comment |Â
up vote
14
down vote
Actually, some technologies do rely on file extensions, so if you use those technologies in Ubuntu, you'll have to rely on extensions too. A few examples:
gcc
uses extensions to distinguish between C an C++ files. Without the extension it's pretty much impossible to differentiate them (imagine a C++ file with no classes).- many files (
docx
,jar
,apk
) are just particularly structured ZIP archives. While you can usually infer the type from the content, it may not always be possible (e.g. Java Manifest is optional injar
files).
Not using file extensions in such cases will only be possible with hacky workarounds and is likely to be very error-prone.
Good on you for mentioning programming, but you got most of the details wrong.gcc
is the front-end for C files, for C++ files you need either theg++
front-end or a command-line switch to specify language. More important is themake
program that decides whether to usegcc
org++
to build a particular file -- andmake
is completely dependent on filename patterns (mostly extensions) for its rule-matching.
â Ben Voigt
Jul 29 '16 at 18:50
@BenVoigt When compiling a file with a.cc
extension withgcc
, it really will be compiled as C++, and this is documented inman gcc
: "For any given input file, the file name suffix determines what kind of compilation is done:" followed by a list of extensions and how they are treated.
â hvd
Jul 30 '16 at 10:53
1
@hvd Then maybe it's the default set of libraries that goes horribly wrong if you don't use the right frontend. Anyway make is the prime example because everything it does is based on file extension.
â Ben Voigt
Jul 30 '16 at 13:28
1
@BenVoigtmake
is a good example too, butgcc
relies just as heavily on filenames. Here's an example clearer than.c
vs.cc
: For C,gcc
uses suffixes to tell if its first step is to preprocess (.c
), compile (.i
), assemble (.s
), or link (.o
). Here, I use-E
,-S
and-c
to tellgcc
where to stop, but it uses filenames to know where to start.gcc something.cc
won't link to the right libraries for C++ but it will treat the file as C++, which is why many users are confused by the error messages they get when making that mistake.
â Eliah Kagan
Jan 24 '17 at 13:56
add a comment |Â
up vote
14
down vote
Actually, some technologies do rely on file extensions, so if you use those technologies in Ubuntu, you'll have to rely on extensions too. A few examples:
gcc
uses extensions to distinguish between C an C++ files. Without the extension it's pretty much impossible to differentiate them (imagine a C++ file with no classes).- many files (
docx
,jar
,apk
) are just particularly structured ZIP archives. While you can usually infer the type from the content, it may not always be possible (e.g. Java Manifest is optional injar
files).
Not using file extensions in such cases will only be possible with hacky workarounds and is likely to be very error-prone.
Good on you for mentioning programming, but you got most of the details wrong.gcc
is the front-end for C files, for C++ files you need either theg++
front-end or a command-line switch to specify language. More important is themake
program that decides whether to usegcc
org++
to build a particular file -- andmake
is completely dependent on filename patterns (mostly extensions) for its rule-matching.
â Ben Voigt
Jul 29 '16 at 18:50
@BenVoigt When compiling a file with a.cc
extension withgcc
, it really will be compiled as C++, and this is documented inman gcc
: "For any given input file, the file name suffix determines what kind of compilation is done:" followed by a list of extensions and how they are treated.
â hvd
Jul 30 '16 at 10:53
1
@hvd Then maybe it's the default set of libraries that goes horribly wrong if you don't use the right frontend. Anyway make is the prime example because everything it does is based on file extension.
â Ben Voigt
Jul 30 '16 at 13:28
1
@BenVoigtmake
is a good example too, butgcc
relies just as heavily on filenames. Here's an example clearer than.c
vs.cc
: For C,gcc
uses suffixes to tell if its first step is to preprocess (.c
), compile (.i
), assemble (.s
), or link (.o
). Here, I use-E
,-S
and-c
to tellgcc
where to stop, but it uses filenames to know where to start.gcc something.cc
won't link to the right libraries for C++ but it will treat the file as C++, which is why many users are confused by the error messages they get when making that mistake.
â Eliah Kagan
Jan 24 '17 at 13:56
add a comment |Â
up vote
14
down vote
up vote
14
down vote
Actually, some technologies do rely on file extensions, so if you use those technologies in Ubuntu, you'll have to rely on extensions too. A few examples:
gcc
uses extensions to distinguish between C an C++ files. Without the extension it's pretty much impossible to differentiate them (imagine a C++ file with no classes).- many files (
docx
,jar
,apk
) are just particularly structured ZIP archives. While you can usually infer the type from the content, it may not always be possible (e.g. Java Manifest is optional injar
files).
Not using file extensions in such cases will only be possible with hacky workarounds and is likely to be very error-prone.
Actually, some technologies do rely on file extensions, so if you use those technologies in Ubuntu, you'll have to rely on extensions too. A few examples:
gcc
uses extensions to distinguish between C an C++ files. Without the extension it's pretty much impossible to differentiate them (imagine a C++ file with no classes).- many files (
docx
,jar
,apk
) are just particularly structured ZIP archives. While you can usually infer the type from the content, it may not always be possible (e.g. Java Manifest is optional injar
files).
Not using file extensions in such cases will only be possible with hacky workarounds and is likely to be very error-prone.
answered Jul 27 '16 at 15:52
![](https://i.stack.imgur.com/IvPNx.png?s=32&g=1)
![](https://i.stack.imgur.com/IvPNx.png?s=32&g=1)
Dmitry Grigoryev
1,556619
1,556619
Good on you for mentioning programming, but you got most of the details wrong.gcc
is the front-end for C files, for C++ files you need either theg++
front-end or a command-line switch to specify language. More important is themake
program that decides whether to usegcc
org++
to build a particular file -- andmake
is completely dependent on filename patterns (mostly extensions) for its rule-matching.
â Ben Voigt
Jul 29 '16 at 18:50
@BenVoigt When compiling a file with a.cc
extension withgcc
, it really will be compiled as C++, and this is documented inman gcc
: "For any given input file, the file name suffix determines what kind of compilation is done:" followed by a list of extensions and how they are treated.
â hvd
Jul 30 '16 at 10:53
1
@hvd Then maybe it's the default set of libraries that goes horribly wrong if you don't use the right frontend. Anyway make is the prime example because everything it does is based on file extension.
â Ben Voigt
Jul 30 '16 at 13:28
1
@BenVoigtmake
is a good example too, butgcc
relies just as heavily on filenames. Here's an example clearer than.c
vs.cc
: For C,gcc
uses suffixes to tell if its first step is to preprocess (.c
), compile (.i
), assemble (.s
), or link (.o
). Here, I use-E
,-S
and-c
to tellgcc
where to stop, but it uses filenames to know where to start.gcc something.cc
won't link to the right libraries for C++ but it will treat the file as C++, which is why many users are confused by the error messages they get when making that mistake.
â Eliah Kagan
Jan 24 '17 at 13:56
add a comment |Â
Good on you for mentioning programming, but you got most of the details wrong.gcc
is the front-end for C files, for C++ files you need either theg++
front-end or a command-line switch to specify language. More important is themake
program that decides whether to usegcc
org++
to build a particular file -- andmake
is completely dependent on filename patterns (mostly extensions) for its rule-matching.
â Ben Voigt
Jul 29 '16 at 18:50
@BenVoigt When compiling a file with a.cc
extension withgcc
, it really will be compiled as C++, and this is documented inman gcc
: "For any given input file, the file name suffix determines what kind of compilation is done:" followed by a list of extensions and how they are treated.
â hvd
Jul 30 '16 at 10:53
1
@hvd Then maybe it's the default set of libraries that goes horribly wrong if you don't use the right frontend. Anyway make is the prime example because everything it does is based on file extension.
â Ben Voigt
Jul 30 '16 at 13:28
1
@BenVoigtmake
is a good example too, butgcc
relies just as heavily on filenames. Here's an example clearer than.c
vs.cc
: For C,gcc
uses suffixes to tell if its first step is to preprocess (.c
), compile (.i
), assemble (.s
), or link (.o
). Here, I use-E
,-S
and-c
to tellgcc
where to stop, but it uses filenames to know where to start.gcc something.cc
won't link to the right libraries for C++ but it will treat the file as C++, which is why many users are confused by the error messages they get when making that mistake.
â Eliah Kagan
Jan 24 '17 at 13:56
Good on you for mentioning programming, but you got most of the details wrong.
gcc
is the front-end for C files, for C++ files you need either the g++
front-end or a command-line switch to specify language. More important is the make
program that decides whether to use gcc
or g++
to build a particular file -- and make
is completely dependent on filename patterns (mostly extensions) for its rule-matching.â Ben Voigt
Jul 29 '16 at 18:50
Good on you for mentioning programming, but you got most of the details wrong.
gcc
is the front-end for C files, for C++ files you need either the g++
front-end or a command-line switch to specify language. More important is the make
program that decides whether to use gcc
or g++
to build a particular file -- and make
is completely dependent on filename patterns (mostly extensions) for its rule-matching.â Ben Voigt
Jul 29 '16 at 18:50
@BenVoigt When compiling a file with a
.cc
extension with gcc
, it really will be compiled as C++, and this is documented in man gcc
: "For any given input file, the file name suffix determines what kind of compilation is done:" followed by a list of extensions and how they are treated.â hvd
Jul 30 '16 at 10:53
@BenVoigt When compiling a file with a
.cc
extension with gcc
, it really will be compiled as C++, and this is documented in man gcc
: "For any given input file, the file name suffix determines what kind of compilation is done:" followed by a list of extensions and how they are treated.â hvd
Jul 30 '16 at 10:53
1
1
@hvd Then maybe it's the default set of libraries that goes horribly wrong if you don't use the right frontend. Anyway make is the prime example because everything it does is based on file extension.
â Ben Voigt
Jul 30 '16 at 13:28
@hvd Then maybe it's the default set of libraries that goes horribly wrong if you don't use the right frontend. Anyway make is the prime example because everything it does is based on file extension.
â Ben Voigt
Jul 30 '16 at 13:28
1
1
@BenVoigt
make
is a good example too, but gcc
relies just as heavily on filenames. Here's an example clearer than .c
vs .cc
: For C, gcc
uses suffixes to tell if its first step is to preprocess (.c
), compile (.i
), assemble (.s
), or link (.o
). Here, I use -E
, -S
and -c
to tell gcc
where to stop, but it uses filenames to know where to start. gcc something.cc
won't link to the right libraries for C++ but it will treat the file as C++, which is why many users are confused by the error messages they get when making that mistake.â Eliah Kagan
Jan 24 '17 at 13:56
@BenVoigt
make
is a good example too, but gcc
relies just as heavily on filenames. Here's an example clearer than .c
vs .cc
: For C, gcc
uses suffixes to tell if its first step is to preprocess (.c
), compile (.i
), assemble (.s
), or link (.o
). Here, I use -E
, -S
and -c
to tell gcc
where to stop, but it uses filenames to know where to start. gcc something.cc
won't link to the right libraries for C++ but it will treat the file as C++, which is why many users are confused by the error messages they get when making that mistake.â Eliah Kagan
Jan 24 '17 at 13:56
add a comment |Â
up vote
7
down vote
Your first assumption is correct: the extensions on Linux do not matter and only are useful for humans( and other non-Unix-like OS that care about extensions ). The type of a file is determined by first 32 bits of data in the file , which is known as magic number
This is why shell scripts need #!
line - to tell operating system what interpreter to call. Without it , the shell script is just text file.
As far as file managers go, they do want to know extensions of some files, such as .desktop
files , which basically same as Window's version of shortcuts but with more capabilities. But as far as OS is concerned, it needs to know what's in the file, not what's in its name
3
This isn't quite true. There are programs that expect a specific extension. The most commonly used example is probablygunzip
which won't decompress a file if it isn't calledfoo.gz
.
â terdonâ¦
Jul 27 '16 at 9:27
That's an implementation of specific software. For the most part, utilities on unix-like systems don't expect an extension.
â Sergiy Kolodyazhnyy
Jul 27 '16 at 9:36
7
For the most part they don't, no. Your first sentence, however, claims that they are never used and only matter to humans. That isn't entirely true.gunzip
is one example,eog
is another. Also, many tools won't autocomplete names without the right extension. All I'm saying is that it's a bit more complicated than "extensions are always irrelevant".
â terdonâ¦
Jul 27 '16 at 9:40
1
1 small issue: OP asked about the operating system. 'gunzip' and 'eog' are not the operating system but decided to create their own restrictions (in case of gunzip) or method (eog). "mime types" though.
â Rinzwind
Jul 27 '16 at 17:49
1
@Serg Sure, you can define OS narrowly, and get a trivial answer to the question. It's not a particularly helpful answer, though, because the vast majority of what a user does with a computer involves software you've excluded. Note that the question contrasted "only for humans" against "the operating-system"; I don't think they meant "the kernel".
â IMSoP
Jul 29 '16 at 9:06
 |Â
show 7 more comments
up vote
7
down vote
Your first assumption is correct: the extensions on Linux do not matter and only are useful for humans( and other non-Unix-like OS that care about extensions ). The type of a file is determined by first 32 bits of data in the file , which is known as magic number
This is why shell scripts need #!
line - to tell operating system what interpreter to call. Without it , the shell script is just text file.
As far as file managers go, they do want to know extensions of some files, such as .desktop
files , which basically same as Window's version of shortcuts but with more capabilities. But as far as OS is concerned, it needs to know what's in the file, not what's in its name
3
This isn't quite true. There are programs that expect a specific extension. The most commonly used example is probablygunzip
which won't decompress a file if it isn't calledfoo.gz
.
â terdonâ¦
Jul 27 '16 at 9:27
That's an implementation of specific software. For the most part, utilities on unix-like systems don't expect an extension.
â Sergiy Kolodyazhnyy
Jul 27 '16 at 9:36
7
For the most part they don't, no. Your first sentence, however, claims that they are never used and only matter to humans. That isn't entirely true.gunzip
is one example,eog
is another. Also, many tools won't autocomplete names without the right extension. All I'm saying is that it's a bit more complicated than "extensions are always irrelevant".
â terdonâ¦
Jul 27 '16 at 9:40
1
1 small issue: OP asked about the operating system. 'gunzip' and 'eog' are not the operating system but decided to create their own restrictions (in case of gunzip) or method (eog). "mime types" though.
â Rinzwind
Jul 27 '16 at 17:49
1
@Serg Sure, you can define OS narrowly, and get a trivial answer to the question. It's not a particularly helpful answer, though, because the vast majority of what a user does with a computer involves software you've excluded. Note that the question contrasted "only for humans" against "the operating-system"; I don't think they meant "the kernel".
â IMSoP
Jul 29 '16 at 9:06
 |Â
show 7 more comments
up vote
7
down vote
up vote
7
down vote
Your first assumption is correct: the extensions on Linux do not matter and only are useful for humans( and other non-Unix-like OS that care about extensions ). The type of a file is determined by first 32 bits of data in the file , which is known as magic number
This is why shell scripts need #!
line - to tell operating system what interpreter to call. Without it , the shell script is just text file.
As far as file managers go, they do want to know extensions of some files, such as .desktop
files , which basically same as Window's version of shortcuts but with more capabilities. But as far as OS is concerned, it needs to know what's in the file, not what's in its name
Your first assumption is correct: the extensions on Linux do not matter and only are useful for humans( and other non-Unix-like OS that care about extensions ). The type of a file is determined by first 32 bits of data in the file , which is known as magic number
This is why shell scripts need #!
line - to tell operating system what interpreter to call. Without it , the shell script is just text file.
As far as file managers go, they do want to know extensions of some files, such as .desktop
files , which basically same as Window's version of shortcuts but with more capabilities. But as far as OS is concerned, it needs to know what's in the file, not what's in its name
answered Jul 27 '16 at 7:02
![](https://i.stack.imgur.com/U1Jy6.jpg?s=32&g=1)
![](https://i.stack.imgur.com/U1Jy6.jpg?s=32&g=1)
Sergiy Kolodyazhnyy
64.5k9129280
64.5k9129280
3
This isn't quite true. There are programs that expect a specific extension. The most commonly used example is probablygunzip
which won't decompress a file if it isn't calledfoo.gz
.
â terdonâ¦
Jul 27 '16 at 9:27
That's an implementation of specific software. For the most part, utilities on unix-like systems don't expect an extension.
â Sergiy Kolodyazhnyy
Jul 27 '16 at 9:36
7
For the most part they don't, no. Your first sentence, however, claims that they are never used and only matter to humans. That isn't entirely true.gunzip
is one example,eog
is another. Also, many tools won't autocomplete names without the right extension. All I'm saying is that it's a bit more complicated than "extensions are always irrelevant".
â terdonâ¦
Jul 27 '16 at 9:40
1
1 small issue: OP asked about the operating system. 'gunzip' and 'eog' are not the operating system but decided to create their own restrictions (in case of gunzip) or method (eog). "mime types" though.
â Rinzwind
Jul 27 '16 at 17:49
1
@Serg Sure, you can define OS narrowly, and get a trivial answer to the question. It's not a particularly helpful answer, though, because the vast majority of what a user does with a computer involves software you've excluded. Note that the question contrasted "only for humans" against "the operating-system"; I don't think they meant "the kernel".
â IMSoP
Jul 29 '16 at 9:06
 |Â
show 7 more comments
3
This isn't quite true. There are programs that expect a specific extension. The most commonly used example is probablygunzip
which won't decompress a file if it isn't calledfoo.gz
.
â terdonâ¦
Jul 27 '16 at 9:27
That's an implementation of specific software. For the most part, utilities on unix-like systems don't expect an extension.
â Sergiy Kolodyazhnyy
Jul 27 '16 at 9:36
7
For the most part they don't, no. Your first sentence, however, claims that they are never used and only matter to humans. That isn't entirely true.gunzip
is one example,eog
is another. Also, many tools won't autocomplete names without the right extension. All I'm saying is that it's a bit more complicated than "extensions are always irrelevant".
â terdonâ¦
Jul 27 '16 at 9:40
1
1 small issue: OP asked about the operating system. 'gunzip' and 'eog' are not the operating system but decided to create their own restrictions (in case of gunzip) or method (eog). "mime types" though.
â Rinzwind
Jul 27 '16 at 17:49
1
@Serg Sure, you can define OS narrowly, and get a trivial answer to the question. It's not a particularly helpful answer, though, because the vast majority of what a user does with a computer involves software you've excluded. Note that the question contrasted "only for humans" against "the operating-system"; I don't think they meant "the kernel".
â IMSoP
Jul 29 '16 at 9:06
3
3
This isn't quite true. There are programs that expect a specific extension. The most commonly used example is probably
gunzip
which won't decompress a file if it isn't called foo.gz
.â terdonâ¦
Jul 27 '16 at 9:27
This isn't quite true. There are programs that expect a specific extension. The most commonly used example is probably
gunzip
which won't decompress a file if it isn't called foo.gz
.â terdonâ¦
Jul 27 '16 at 9:27
That's an implementation of specific software. For the most part, utilities on unix-like systems don't expect an extension.
â Sergiy Kolodyazhnyy
Jul 27 '16 at 9:36
That's an implementation of specific software. For the most part, utilities on unix-like systems don't expect an extension.
â Sergiy Kolodyazhnyy
Jul 27 '16 at 9:36
7
7
For the most part they don't, no. Your first sentence, however, claims that they are never used and only matter to humans. That isn't entirely true.
gunzip
is one example, eog
is another. Also, many tools won't autocomplete names without the right extension. All I'm saying is that it's a bit more complicated than "extensions are always irrelevant".â terdonâ¦
Jul 27 '16 at 9:40
For the most part they don't, no. Your first sentence, however, claims that they are never used and only matter to humans. That isn't entirely true.
gunzip
is one example, eog
is another. Also, many tools won't autocomplete names without the right extension. All I'm saying is that it's a bit more complicated than "extensions are always irrelevant".â terdonâ¦
Jul 27 '16 at 9:40
1
1
1 small issue: OP asked about the operating system. 'gunzip' and 'eog' are not the operating system but decided to create their own restrictions (in case of gunzip) or method (eog). "mime types" though.
â Rinzwind
Jul 27 '16 at 17:49
1 small issue: OP asked about the operating system. 'gunzip' and 'eog' are not the operating system but decided to create their own restrictions (in case of gunzip) or method (eog). "mime types" though.
â Rinzwind
Jul 27 '16 at 17:49
1
1
@Serg Sure, you can define OS narrowly, and get a trivial answer to the question. It's not a particularly helpful answer, though, because the vast majority of what a user does with a computer involves software you've excluded. Note that the question contrasted "only for humans" against "the operating-system"; I don't think they meant "the kernel".
â IMSoP
Jul 29 '16 at 9:06
@Serg Sure, you can define OS narrowly, and get a trivial answer to the question. It's not a particularly helpful answer, though, because the vast majority of what a user does with a computer involves software you've excluded. Note that the question contrasted "only for humans" against "the operating-system"; I don't think they meant "the kernel".
â IMSoP
Jul 29 '16 at 9:06
 |Â
show 7 more comments
up vote
5
down vote
This is a too big for a comment answer.
Keep in mind that even "extension" has a lot if different meanings.
What your talking about seems to be the 3 letters after the . DOS made the 8.3 format really popular and windows uses the .3 part to this day.
Linux has a lot of files like .conf or .list or .d or .c that have meaning, but are not really extensions in the 8.3 sense. For example Apache looks at /etc/apache2/sites-enabled/website.conf for it's configuration directive. While the system uses MIME Types and content headers and what not to determine it's a text file, Apache (by default) still isn't going to load it without it ending in .conf.
.c is another great one. Yep it's a text file, but gcc depends on main.c becoming main.o and finally main (after linking). At no time does the system use the .c, .o or no extension to have any meaning as far as content, but the stuff after the . does have some meaning. You would probably setup your SCM to ignore main.o and main.
Point being is this: Extensions are not used the way they are in windows. The kernel will not execute a .txt file because you remove the .txt part of the name. It is also very happy to execute a .txt file if the execute permission is set. That being said, they do have meaning, and are still used on a "computer level" for many things.
1
Windows is also not bound to thex.3
naming scheme any more, you have got longer extensions there as well like.doxc
,.torrent
,.part
, etc. It's just that many file formats and extensions were already defined back in the time when 8.3 naming was still a thing and later formats mostly simply adapted the convention of using up to 3 letters.
â Byte Commander
Jul 27 '16 at 10:07
I don't see how ".conf", ".c", etc, are "a different meaning" from "the 8.3 sense". The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". Not even DOS / Win3.1 required the correct extension (you could call a Word document "STUPIDN.AME" and open it with Ctrl-O in WinWord). It's just that some systems (e.g. double-click on Windows,gzip
, your Makefile, etc) may be written to use this convention to make assumptions about the correct action to take on each file.
â IMSoP
Jul 27 '16 at 16:42
@ByteCommander That's true, but the extension still determines the app used. I am not sure how to edit the answer to reflect that.
â coteyr
Jul 27 '16 at 20:16
1
@coteyr Again, it all depends what we mean by "the OS". The File Manager will certainly look up a registry key for "AME", and will tell me that "foo.txt" is a text file. But runningdir
at a command prompt will tell me no such thing; it simply won't care. Executing files is certainly an exception, on both OSes; if the question was limited to those, the answer would be that DOS/Windows only care about the name, and Unix/Linux only care about the execute permission and the first bytes of the file. Outside of that, there is always some application choosing a convention to follow.
â IMSoP
Jul 27 '16 at 21:07
1
@coteyr You forgot *.scr (screen saver binary) in Windows 3.1 and up. That said, the file extension even in DOS/Windows systems even for executables is still just a convenience. The specifics depend very much on where you draw the line of "operating system", but you can always load a binary into memory and jump into it yourself, doing the work that one normally asks of the OS to do. In MS-DOS, if you look through command.com, I'm pretty sure there's a list like EXE COM that you can edit such that it looks for other extensions if none is specified (not saying it'd be a good idea, mind you).
â Michael Kjörling
Jul 30 '16 at 11:59
 |Â
show 1 more comment
up vote
5
down vote
This is a too big for a comment answer.
Keep in mind that even "extension" has a lot if different meanings.
What your talking about seems to be the 3 letters after the . DOS made the 8.3 format really popular and windows uses the .3 part to this day.
Linux has a lot of files like .conf or .list or .d or .c that have meaning, but are not really extensions in the 8.3 sense. For example Apache looks at /etc/apache2/sites-enabled/website.conf for it's configuration directive. While the system uses MIME Types and content headers and what not to determine it's a text file, Apache (by default) still isn't going to load it without it ending in .conf.
.c is another great one. Yep it's a text file, but gcc depends on main.c becoming main.o and finally main (after linking). At no time does the system use the .c, .o or no extension to have any meaning as far as content, but the stuff after the . does have some meaning. You would probably setup your SCM to ignore main.o and main.
Point being is this: Extensions are not used the way they are in windows. The kernel will not execute a .txt file because you remove the .txt part of the name. It is also very happy to execute a .txt file if the execute permission is set. That being said, they do have meaning, and are still used on a "computer level" for many things.
1
Windows is also not bound to thex.3
naming scheme any more, you have got longer extensions there as well like.doxc
,.torrent
,.part
, etc. It's just that many file formats and extensions were already defined back in the time when 8.3 naming was still a thing and later formats mostly simply adapted the convention of using up to 3 letters.
â Byte Commander
Jul 27 '16 at 10:07
I don't see how ".conf", ".c", etc, are "a different meaning" from "the 8.3 sense". The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". Not even DOS / Win3.1 required the correct extension (you could call a Word document "STUPIDN.AME" and open it with Ctrl-O in WinWord). It's just that some systems (e.g. double-click on Windows,gzip
, your Makefile, etc) may be written to use this convention to make assumptions about the correct action to take on each file.
â IMSoP
Jul 27 '16 at 16:42
@ByteCommander That's true, but the extension still determines the app used. I am not sure how to edit the answer to reflect that.
â coteyr
Jul 27 '16 at 20:16
1
@coteyr Again, it all depends what we mean by "the OS". The File Manager will certainly look up a registry key for "AME", and will tell me that "foo.txt" is a text file. But runningdir
at a command prompt will tell me no such thing; it simply won't care. Executing files is certainly an exception, on both OSes; if the question was limited to those, the answer would be that DOS/Windows only care about the name, and Unix/Linux only care about the execute permission and the first bytes of the file. Outside of that, there is always some application choosing a convention to follow.
â IMSoP
Jul 27 '16 at 21:07
1
@coteyr You forgot *.scr (screen saver binary) in Windows 3.1 and up. That said, the file extension even in DOS/Windows systems even for executables is still just a convenience. The specifics depend very much on where you draw the line of "operating system", but you can always load a binary into memory and jump into it yourself, doing the work that one normally asks of the OS to do. In MS-DOS, if you look through command.com, I'm pretty sure there's a list like EXE COM that you can edit such that it looks for other extensions if none is specified (not saying it'd be a good idea, mind you).
â Michael Kjörling
Jul 30 '16 at 11:59
 |Â
show 1 more comment
up vote
5
down vote
up vote
5
down vote
This is a too big for a comment answer.
Keep in mind that even "extension" has a lot if different meanings.
What your talking about seems to be the 3 letters after the . DOS made the 8.3 format really popular and windows uses the .3 part to this day.
Linux has a lot of files like .conf or .list or .d or .c that have meaning, but are not really extensions in the 8.3 sense. For example Apache looks at /etc/apache2/sites-enabled/website.conf for it's configuration directive. While the system uses MIME Types and content headers and what not to determine it's a text file, Apache (by default) still isn't going to load it without it ending in .conf.
.c is another great one. Yep it's a text file, but gcc depends on main.c becoming main.o and finally main (after linking). At no time does the system use the .c, .o or no extension to have any meaning as far as content, but the stuff after the . does have some meaning. You would probably setup your SCM to ignore main.o and main.
Point being is this: Extensions are not used the way they are in windows. The kernel will not execute a .txt file because you remove the .txt part of the name. It is also very happy to execute a .txt file if the execute permission is set. That being said, they do have meaning, and are still used on a "computer level" for many things.
This is a too big for a comment answer.
Keep in mind that even "extension" has a lot if different meanings.
What your talking about seems to be the 3 letters after the . DOS made the 8.3 format really popular and windows uses the .3 part to this day.
Linux has a lot of files like .conf or .list or .d or .c that have meaning, but are not really extensions in the 8.3 sense. For example Apache looks at /etc/apache2/sites-enabled/website.conf for it's configuration directive. While the system uses MIME Types and content headers and what not to determine it's a text file, Apache (by default) still isn't going to load it without it ending in .conf.
.c is another great one. Yep it's a text file, but gcc depends on main.c becoming main.o and finally main (after linking). At no time does the system use the .c, .o or no extension to have any meaning as far as content, but the stuff after the . does have some meaning. You would probably setup your SCM to ignore main.o and main.
Point being is this: Extensions are not used the way they are in windows. The kernel will not execute a .txt file because you remove the .txt part of the name. It is also very happy to execute a .txt file if the execute permission is set. That being said, they do have meaning, and are still used on a "computer level" for many things.
answered Jul 27 '16 at 9:15
coteyr
11.7k52349
11.7k52349
1
Windows is also not bound to thex.3
naming scheme any more, you have got longer extensions there as well like.doxc
,.torrent
,.part
, etc. It's just that many file formats and extensions were already defined back in the time when 8.3 naming was still a thing and later formats mostly simply adapted the convention of using up to 3 letters.
â Byte Commander
Jul 27 '16 at 10:07
I don't see how ".conf", ".c", etc, are "a different meaning" from "the 8.3 sense". The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". Not even DOS / Win3.1 required the correct extension (you could call a Word document "STUPIDN.AME" and open it with Ctrl-O in WinWord). It's just that some systems (e.g. double-click on Windows,gzip
, your Makefile, etc) may be written to use this convention to make assumptions about the correct action to take on each file.
â IMSoP
Jul 27 '16 at 16:42
@ByteCommander That's true, but the extension still determines the app used. I am not sure how to edit the answer to reflect that.
â coteyr
Jul 27 '16 at 20:16
1
@coteyr Again, it all depends what we mean by "the OS". The File Manager will certainly look up a registry key for "AME", and will tell me that "foo.txt" is a text file. But runningdir
at a command prompt will tell me no such thing; it simply won't care. Executing files is certainly an exception, on both OSes; if the question was limited to those, the answer would be that DOS/Windows only care about the name, and Unix/Linux only care about the execute permission and the first bytes of the file. Outside of that, there is always some application choosing a convention to follow.
â IMSoP
Jul 27 '16 at 21:07
1
@coteyr You forgot *.scr (screen saver binary) in Windows 3.1 and up. That said, the file extension even in DOS/Windows systems even for executables is still just a convenience. The specifics depend very much on where you draw the line of "operating system", but you can always load a binary into memory and jump into it yourself, doing the work that one normally asks of the OS to do. In MS-DOS, if you look through command.com, I'm pretty sure there's a list like EXE COM that you can edit such that it looks for other extensions if none is specified (not saying it'd be a good idea, mind you).
â Michael Kjörling
Jul 30 '16 at 11:59
 |Â
show 1 more comment
1
Windows is also not bound to thex.3
naming scheme any more, you have got longer extensions there as well like.doxc
,.torrent
,.part
, etc. It's just that many file formats and extensions were already defined back in the time when 8.3 naming was still a thing and later formats mostly simply adapted the convention of using up to 3 letters.
â Byte Commander
Jul 27 '16 at 10:07
I don't see how ".conf", ".c", etc, are "a different meaning" from "the 8.3 sense". The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". Not even DOS / Win3.1 required the correct extension (you could call a Word document "STUPIDN.AME" and open it with Ctrl-O in WinWord). It's just that some systems (e.g. double-click on Windows,gzip
, your Makefile, etc) may be written to use this convention to make assumptions about the correct action to take on each file.
â IMSoP
Jul 27 '16 at 16:42
@ByteCommander That's true, but the extension still determines the app used. I am not sure how to edit the answer to reflect that.
â coteyr
Jul 27 '16 at 20:16
1
@coteyr Again, it all depends what we mean by "the OS". The File Manager will certainly look up a registry key for "AME", and will tell me that "foo.txt" is a text file. But runningdir
at a command prompt will tell me no such thing; it simply won't care. Executing files is certainly an exception, on both OSes; if the question was limited to those, the answer would be that DOS/Windows only care about the name, and Unix/Linux only care about the execute permission and the first bytes of the file. Outside of that, there is always some application choosing a convention to follow.
â IMSoP
Jul 27 '16 at 21:07
1
@coteyr You forgot *.scr (screen saver binary) in Windows 3.1 and up. That said, the file extension even in DOS/Windows systems even for executables is still just a convenience. The specifics depend very much on where you draw the line of "operating system", but you can always load a binary into memory and jump into it yourself, doing the work that one normally asks of the OS to do. In MS-DOS, if you look through command.com, I'm pretty sure there's a list like EXE COM that you can edit such that it looks for other extensions if none is specified (not saying it'd be a good idea, mind you).
â Michael Kjörling
Jul 30 '16 at 11:59
1
1
Windows is also not bound to the
x.3
naming scheme any more, you have got longer extensions there as well like .doxc
, .torrent
, .part
, etc. It's just that many file formats and extensions were already defined back in the time when 8.3 naming was still a thing and later formats mostly simply adapted the convention of using up to 3 letters.â Byte Commander
Jul 27 '16 at 10:07
Windows is also not bound to the
x.3
naming scheme any more, you have got longer extensions there as well like .doxc
, .torrent
, .part
, etc. It's just that many file formats and extensions were already defined back in the time when 8.3 naming was still a thing and later formats mostly simply adapted the convention of using up to 3 letters.â Byte Commander
Jul 27 '16 at 10:07
I don't see how ".conf", ".c", etc, are "a different meaning" from "the 8.3 sense". The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". Not even DOS / Win3.1 required the correct extension (you could call a Word document "STUPIDN.AME" and open it with Ctrl-O in WinWord). It's just that some systems (e.g. double-click on Windows,
gzip
, your Makefile, etc) may be written to use this convention to make assumptions about the correct action to take on each file.â IMSoP
Jul 27 '16 at 16:42
I don't see how ".conf", ".c", etc, are "a different meaning" from "the 8.3 sense". The concept of a file extension can be simply expressed as "a convention for identifying the type of a file based on part of its name". Not even DOS / Win3.1 required the correct extension (you could call a Word document "STUPIDN.AME" and open it with Ctrl-O in WinWord). It's just that some systems (e.g. double-click on Windows,
gzip
, your Makefile, etc) may be written to use this convention to make assumptions about the correct action to take on each file.â IMSoP
Jul 27 '16 at 16:42
@ByteCommander That's true, but the extension still determines the app used. I am not sure how to edit the answer to reflect that.
â coteyr
Jul 27 '16 at 20:16
@ByteCommander That's true, but the extension still determines the app used. I am not sure how to edit the answer to reflect that.
â coteyr
Jul 27 '16 at 20:16
1
1
@coteyr Again, it all depends what we mean by "the OS". The File Manager will certainly look up a registry key for "AME", and will tell me that "foo.txt" is a text file. But running
dir
at a command prompt will tell me no such thing; it simply won't care. Executing files is certainly an exception, on both OSes; if the question was limited to those, the answer would be that DOS/Windows only care about the name, and Unix/Linux only care about the execute permission and the first bytes of the file. Outside of that, there is always some application choosing a convention to follow.â IMSoP
Jul 27 '16 at 21:07
@coteyr Again, it all depends what we mean by "the OS". The File Manager will certainly look up a registry key for "AME", and will tell me that "foo.txt" is a text file. But running
dir
at a command prompt will tell me no such thing; it simply won't care. Executing files is certainly an exception, on both OSes; if the question was limited to those, the answer would be that DOS/Windows only care about the name, and Unix/Linux only care about the execute permission and the first bytes of the file. Outside of that, there is always some application choosing a convention to follow.â IMSoP
Jul 27 '16 at 21:07
1
1
@coteyr You forgot *.scr (screen saver binary) in Windows 3.1 and up. That said, the file extension even in DOS/Windows systems even for executables is still just a convenience. The specifics depend very much on where you draw the line of "operating system", but you can always load a binary into memory and jump into it yourself, doing the work that one normally asks of the OS to do. In MS-DOS, if you look through command.com, I'm pretty sure there's a list like EXE COM that you can edit such that it looks for other extensions if none is specified (not saying it'd be a good idea, mind you).
â Michael Kjörling
Jul 30 '16 at 11:59
@coteyr You forgot *.scr (screen saver binary) in Windows 3.1 and up. That said, the file extension even in DOS/Windows systems even for executables is still just a convenience. The specifics depend very much on where you draw the line of "operating system", but you can always load a binary into memory and jump into it yourself, doing the work that one normally asks of the OS to do. In MS-DOS, if you look through command.com, I'm pretty sure there's a list like EXE COM that you can edit such that it looks for other extensions if none is specified (not saying it'd be a good idea, mind you).
â Michael Kjörling
Jul 30 '16 at 11:59
 |Â
show 1 more comment
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2faskubuntu.com%2fquestions%2f803434%2fdo-file-extensions-have-any-purpose-for-the-operating-system%23new-answer', 'question_page');
);
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
5
If you don't get a good response here, remember there is also unix.stackexchange.com
â mchid
Jul 27 '16 at 6:51
@mchid Thanks a lot. That's very kind of you.
â mizech
Jul 27 '16 at 6:52
Related, almost duplicate: askubuntu.com/questions/390015/â¦
â Zzzach...
Jul 27 '16 at 7:01
5
In Windows they do, in Linux/Unix they mostly don't. The main exception are the compression-programs -
gzip
,bzip2
,xz
- and so on. These programs uses suffixes to separate the compressed version of a file from the uncompressed one they replace. Compression-programs will often complain about incorrect suffix, even though the file actually is a compressed file of the type it should handle.â Baard Kopperud
Jul 27 '16 at 10:34
5
I think part of the problem with this question is that "the operating-system" is not a well-defined concept. What is part of the operating system, and what is an application on top of it? Not many parts of the OS (whichever OS we're talking about) care what type a file is - they just do what they're told. So distinctions about how they know are irrelevant; they do neither. Applcations, on the other hand, may well do one or both things.
â IMSoP
Jul 27 '16 at 17:23