GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:16:15 Sequence gi568815592f:35705671_35914780 : 209110 bp : 44.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7043 7052 10 1 1 103 91 2 0.374 2.56 1.02 Intr + 25818 25915 98 1 2 75 73 49 0.000 1.83 1.03 Intr + 29793 29939 147 2 0 68 96 30 0.002 2.23 1.04 Intr + 31424 31601 178 1 1 24 86 223 0.006 15.19 1.05 Intr + 32357 32502 146 0 2 18 94 142 0.962 7.80 1.06 Intr + 32714 32848 135 1 0 85 99 175 0.997 19.06 1.07 Intr + 41591 41764 174 1 0 96 51 144 0.992 11.64 1.08 Intr + 41906 41977 72 1 0 111 94 123 0.999 14.80 1.09 Term + 42868 43200 333 0 0 87 38 322 0.916 21.71 1.10 PlyA + 43215 43220 6 1.05 2.00 Prom + 55940 55979 40 -5.26 2.01 Sngl + 60157 60519 363 0 0 57 41 231 0.843 9.97 2.02 PlyA + 61203 61208 6 1.05 3.00 Prom + 67070 67109 40 -6.36 3.01 Init + 68028 68108 81 2 0 80 65 101 0.550 6.01 3.02 Intr + 70826 71032 207 1 0 137 105 101 0.851 15.97 3.03 Intr + 71789 71911 123 1 0 80 94 40 0.947 4.58 3.04 Intr + 73450 73542 93 0 0 65 42 68 0.190 0.06 3.05 Intr + 81328 81450 123 0 0 84 94 156 0.565 16.58 3.06 Term + 82924 82977 54 0 0 108 41 17 0.443 -3.44 3.07 PlyA + 83609 83614 6 1.05 4.04 PlyA - 84831 84826 6 1.05 4.03 Term - 89607 89476 132 0 0 104 53 221 0.893 18.29 4.02 Intr - 90183 90061 123 0 0 55 98 93 0.994 7.78 4.01 Init - 91618 91535 84 1 0 103 99 162 0.999 17.67 4.00 Prom - 93441 93402 40 -1.66 5.00 Prom + 99145 99184 40 -7.16 5.01 Init + 100001 100412 412 1 1 97 102 607 0.972 59.68 5.02 Intr + 108876 109112 237 1 0 116 46 414 0.916 37.59 5.03 Term + 109948 110006 59 2 2 111 41 22 0.865 -2.35 5.04 PlyA + 111164 111169 6 1.05 6.02 PlyA - 111194 111189 6 1.05 6.01 Sngl - 123223 121895 1329 1 0 39 47 194 0.372 6.52 6.00 Prom - 125740 125701 40 -5.66 7.14 PlyA - 126586 126581 6 1.05 7.13 Term - 129818 129634 185 0 2 113 46 167 0.999 12.71 7.12 Intr - 132759 132667 93 1 0 32 98 72 0.827 2.54 7.11 Intr - 136934 136865 70 2 1 121 113 56 0.998 10.15 7.10 Intr - 144688 144629 60 1 0 84 92 21 0.452 1.03 7.09 Intr - 151698 151591 108 0 0 70 50 57 0.309 0.58 7.08 Intr - 163440 163340 101 1 2 77 103 26 0.526 2.83 7.07 Intr - 164231 163812 420 0 0 88 77 237 0.558 16.52 7.06 Intr - 164824 164611 214 1 1 57 3 241 0.795 10.79 7.05 Intr - 165289 165264 26 2 2 93 100 1 0.776 -0.46 7.04 Intr - 167058 166893 166 0 1 81 45 111 0.719 5.63 7.03 Intr - 182441 182354 88 1 1 86 70 106 0.962 8.57 7.02 Intr - 183253 183145 109 2 1 96 89 -31 0.869 -2.86 7.01 Intr - 185343 185225 119 2 2 42 115 117 0.764 9.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 21912 21813 100 2 1 84 50 66 0.805 -0.00 S.002 Intr - 22590 22535 56 0 2 95 93 87 0.882 7.68 S.003 Init + 31439 31601 163 1 1 88 86 191 0.887 18.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:35705671_35914780|GENSCAN_predicted_peptide_1|430_aa MQGVALTSVTSRGAWPLSIVSWCPGPLRIRTWGEDEGVLGQGSEWAGLAGRSIIGDAVSL PPSILGAAQDPEPSVPVLPKRDQGQGNTEDMGKSIPQYLGQLDIRKSVVSLATGAGAIYL LYKAIKAGIKCKPPLCSNSPICIARLAVERERHGRDSGELRRLLNSLECKQDEYAKSMIL HSITRCVYLLEAEASACTTDDIVLLGYMLDDKDNSVKTQALNTLKAFSGIRKFRLKIQEH SIKVLELISTIWDTELHIAGLRLLNNLPLPDYVHPQLRRVMPALMEILQSDYILAQVQAV RLLSYLAQKNDLLYDILNCQVHSNFLNLFQPTQSGSLLYEVLVFAERLSEGRNAPHYHVV KWHYNEQSLHESLFGEESRLADRLLALVIHPEEDVQIQACKVIVSLQYPQDLRARPSSCQ PSRSYFKNTE >gi568815592f:35705671_35914780|GENSCAN_predicted_CDS_1|1293_bp atgcagggtgttgccttgacatcagtgacgtcgcgaggggcgtggcctctctccatcgtc tcctggtgccctgggcccctccgcatccgaacctggggggaggatgagggtgtactgggc caaggctctgagtgggcaggcctggctggccgcagcatcattggagatgctgtttcccta cctccttccatcctgggtgcagcccaggacccagagccttctgtcccagttctcccaaaa agggatcagggccagggcaacactgaagacatgggcaagagcatcccccaatacctgggg caactggacatccgcaaaagcgtagtcagcctggccacaggcgccggggcgatctacctg ctctacaaggccatcaaggctggcataaaatgcaaaccacccctctgtagcaactcaccc atctgcatcgcccgcctggcagtcgagcgagagcggcacgggcgggactcaggtgagctc cggaggctcctcaactctttggagtgcaaacaggatgagtatgccaagagcatgatcctg cacagtatcactcgctgtgtgtacttgctggaggctgaggcctctgcttgtactacggat gacatcgtgttgctgggctacatgctggatgacaaggacaacagtgtcaaaacccaagct ctgaatacacttaaagctttctctggcatcagaaaattcaggctcaaaatccaggaacac tccatcaaagtactcgaactgatctccaccatctgggacacggaactgcacattgcgggc ctcagactcctcaacaaccttccactgcccgactatgtgcatccacagctgcgacgggtg atgcctgccttgatggagatcctgcagtcagactacatcctggcacaggtgcaagccgta cgactgctgagctacctggcacagaagaatgaccttctctatgacattctcaactgccag gttcactccaacttcctaaacctgttccagcccacacagtcagggagtctcctgtatgag gtactggtgtttgctgagcggctgagtgagggccggaacgcaccccactaccacgtggtg aaatggcattacaacgaacagtccctgcatgaatccctctttggggaagagtcccgactg gcagaccgactacttgccctggtcatccaccctgaggaagatgttcagatccaggcctgc aaggtcattgtcagcctgcagtatccccaggacttgagagcccggccctcctcctgccag cccagtcgttcctactttaaaaacacggaataa >gi568815592f:35705671_35914780|GENSCAN_predicted_peptide_2|120_aa MPWQPAPFPHRLPGTRRTCQPSELRGAVQAEPLPEPLLHVLGLSFPLQTCRPILRCPPGL MKPLVVFVHGGPGALASLRKMATHTFLQESYFMMKGRAQIHSMVNSLKITLKKETWDQWR >gi568815592f:35705671_35914780|GENSCAN_predicted_CDS_2|363_bp atgccctggcagcccgcccctttcccacaccgcctccccggcacacgccgcacctgtcag ccctctgagctccgaggtgcggtgcaggctgagccactgcctgagccgctgctccacgtc ctgggccttagcttcccgctgcagacctgccggccgattcttcgctgccctcccggtctc atgaagccgctggtcgtgtttgtccacggcggtcccggcgcgctggcatccttgagaaag atggctacacacacctttctgcaggagagttacttcatgatgaaaggaagagcccagatt cacagtatggtgaactcactgaaaattacattaaagaaggaaacatgggaccagtggaga taa >gi568815592f:35705671_35914780|GENSCAN_predicted_peptide_3|226_aa MLTGQAAQGFPALETLVILFLLPRNAEASSPSTSLLRRGGLGRRVTWPGKRFYPAAPRNP AAALPPRPMAAALALVAGVLSGAVLPLWSALPQYKKKITDRCFHHSECYSGCCLMDLDSG GAFCAPRARITMICLPQSSQQPYEGGYYYCPHLTDEETELERFIDLLKELKESCIRNQDC ETGCCQRAPDNCESHCAEKGSEGSLCQTQVGKRVEQLVQGHTVGKC >gi568815592f:35705671_35914780|GENSCAN_predicted_CDS_3|681_bp atgctcacaggtcaggcagctcagggattcccggcgctggagaccctggtgatcttgttc ctactcccccgcaatgcagaggcgtcctcgccctccacgtccctcctccgccggggtggc ctggggcgccgggtcacgtggccggggaagaggttttatcccgcggcccctcggaacccc gccgctgctctgccgcccaggcccatggccgcagccctggcgctcgtggcgggggtcctg tcgggggcggtgctgcccctctggagcgcgcttccgcaatataaaaagaaaatcacagac aggtgcttccaccactctgagtgctacagtggctgctgcctcatggacttggactccggt ggagccttctgtgcccccagggccagaataaccatgatctgcttgccccagtcctcacaa caaccctatgaggggggttactattactgtccccatttgacagatgaggaaaccgagctg gagaggttcattgacttgctgaaggagctcaaggagtcttgcatccggaaccaggactgc gagactggctgctgccaacgtgctccagacaattgcgagtcgcactgcgcggagaagggg tccgagggcagtctgtgtcaaacgcaggtagggaaacgggttgagcaacttgttcaaggt cacacagttggaaagtgctga >gi568815592f:35705671_35914780|GENSCAN_predicted_peptide_4|112_aa MEKILILLLVALSVAYAAPGPRGIIINLENGELCMNSAQCKSNCCQHSSALGLARCTSMA SENSECSVKTLYGIYYKCPCERGLTCEGDKTIVGSITNTNFGICHDAGRSKQ >gi568815592f:35705671_35914780|GENSCAN_predicted_CDS_4|339_bp atggagaagatcctgatcctcctgcttgtcgccctctctgtggcctatgcagctcctggc ccccgggggatcattatcaacctggagaacggtgagctctgcatgaatagtgcccagtgt aagagcaattgctgccagcattcaagtgcgctgggcctggcccgctgcacatccatggcc agcgagaacagcgagtgctctgtcaagacgctctatgggatttactacaagtgtccctgt gagcgtggcctgacctgtgagggagacaagaccatcgtgggctccatcaccaacaccaac tttggcatctgccatgacgctggacgctccaagcagtga >gi568815592f:35705671_35914780|GENSCAN_predicted_peptide_5|235_aa MVKLLPAQEAAKIYHTNYVRNSRAVGVMWGTLTICFSVLVMALFIQPYWIGDSVNTPQAG YFGLFSYCVGNVLSSELICKGGPLDFSSIPSRAFKTAMFFVALGMFLIIGSIICFSLFFI CNTATVYKICAWMQLAAATGLMIGCLVYPDGWDSSEVRRMCGEQTGKYTLGHCTIRWAFM LAILSIGDALILSFLAFVLGYRQDKLLPDDYKADGTAVFTLALGLEAAAKLLCAL >gi568815592f:35705671_35914780|GENSCAN_predicted_CDS_5|708_bp atggtgaaattgctgccggcccaggaggcagccaagatctaccataccaactatgtgcgg aactcgcgagccgtgggcgtgatgtggggtaccctcaccatctgcttctccgtactggtc atggccctcttcatccagccctactggatcggcgacagcgtcaacacaccgcaggcaggc tacttcggccttttctcctactgcgtgggtaacgtgctgtcctccgagctcatctgcaag ggcggccccctagacttctcctccatcccctctagagccttcaagactgccatgttcttt gtggccttgggcatgttcctcatcattggctccatcatctgcttcagcctgttcttcatc tgcaacacggccacagtctataagatctgtgcatggatgcagctggctgcggccacaggc ctaatgattggctgcctggtctaccctgatggttgggactcaagtgaggtgcggcgcatg tgtggggagcagacgggcaagtacacgctgggccactgcaccatccgctgggccttcatg ctggccatcctcagcattggcgacgccctcatcctctccttcctggccttcgtgttgggc taccggcaggacaagctcctccctgacgactacaaggcagatggaaccgctgtcttcaca ttagcccttggattagaagcagcagccaaactcctctgtgcactataa >gi568815592f:35705671_35914780|GENSCAN_predicted_peptide_6|442_aa MTFFTELEKTTLKFIWNQKRARIAKSIVSQKNKAGGITLPDFKLYYKATVTKTAWYWYQN RDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFL TPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDL IKLKSFCTAKETTIRVNRQPTTWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWA KDMNRHLSKEDISAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRG CGEIGTLLHCWWDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPNDYKSCCYKDTC TRMFIAALFTIAKTWNQPKCPTMIDWIKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLETI ILSKLSQEQKTKHRIFSLIGGN >gi568815592f:35705671_35914780|GENSCAN_predicted_CDS_6|1329_bp atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcgtaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaaggac ttcatgtccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaacatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcg aaggacatgaacagacacctctcaaaagaagacatttctgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatatcat ctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaggtgctggagagga tgtggagaaataggaacacttttacactgttggtgggactgtaaactagttcaaccattg tggaagtcagtgtggcgattcctcagggatctagaactagaaataccatttgacccagcc atcccattactgggtatatacccaaatgactataaatcatgctgctataaagacacatgc acacgtatgtttattgcggcattattcacaatagcaaagacttggaaccaacccaaatgt ccaacaatgatagactggattaagaaaatgtggcacatatacaccatggaatactatgca gccataaaaaatgatgagttcatgtcctttgtagggacatggatgaaattggaaaccatc attctcagtaaactatcgcaagaacaaaaaaccaaacaccgcatattctcactcataggt gggaattga >gi568815592f:35705671_35914780|GENSCAN_predicted_peptide_7|586_aa XSETQHRGSAPHSESDLPEQEEEILGSDDDEQEDPNDYCKGGYHLVKIGDLFNGRYHVIR KLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVLQGLDYLHTKCRI IHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPQPKPADKMSKNKKK KLKKKQKRQAELLEKRMQEIEEMEKESGPGQKRPNKQEESESPVERPLKENPPNKMTQEK LEESSTIGQDQTLMERDTEGGAAEINCNGVIEVINYTQNSNNETLRHKEDLHNANDCDVQ NLNQESSFLSSQNGDSSTSQETDSCTPITSEVSDTMVCQSSSTVGQSFSEQHISQLQESI RAEIPCEDEQEQEHNGPLDNKGKSTAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFT EDIQTRQYRSLEVLIGSGYNTPADIWSTACMSTKHQDWLLNYVTSPYLRRAAFELATGDY LFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLF EVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS >gi568815592f:35705671_35914780|GENSCAN_predicted_CDS_7|1761_bp nnatctgaaactcagcaccgaggctctgctccccactctgagagtgatctaccagagcag gaagaggagattctgggatctgatgatgatgagcaagaagatcctaatgattattgtaaa ggaggttatcatcttgtgaaaattggagatctattcaatgggagataccatgtgatccga aagttaggctggggacacttttcaacagtatggttatcatgggatattcaggggaagaaa tttgtggcaatgaaagtagttaaaagtgctgaacattacactgaaacagcactagatgaa atccggttgctgaagtcagtgttacagggtcttgattatttacataccaagtgccgtatc atccacactgacattaaaccagagaacatcttattgtcagtgaatgagcagtacattcgg aggctggctgcagaagcaacagaatggcagcgatctggagctcctccgccttccggatct gcagtcagtactgctccccagcctaaaccagctgacaaaatgtcaaagaataagaagaag aaattgaagaagaagcagaagcgccaggcagaattactagagaagcgaatgcaggaaatt gaggaaatggagaaagagtcgggccctgggcaaaaaagaccaaacaagcaagaagaatca gagagtcctgttgaaagacccttgaaagagaacccacctaataaaatgacccaagaaaaa cttgaagagtcaagtaccattggccaggatcaaacgcttatggaacgtgatacagagggt ggtgcagcagaaattaattgcaatggagtgattgaagtcattaattatactcagaacagt aataatgaaacattgagacataaagaggatctacataatgctaatgactgtgatgtccaa aatttgaatcaggaatctagtttcctaagctcccaaaatggagacagcagcacatctcaa gaaacagactcttgtacacctataacatctgaggtgtcagacaccatggtgtgccagtct tcctcaactgtaggtcagtcattcagtgaacaacacattagccaacttcaagaaagcatt cgggcagagataccctgtgaagatgaacaagagcaagaacataacggaccactggacaac aaaggaaaatccacggctggaaattttcttgttaatccccttgagccaaaaaatgcagaa aagctcaaggtgaagattgctgaccttggaaatgcttgttgggtgcacaaacatttcact gaagatattcaaacaaggcaatatcgttccttggaagttctaatcggatctggctataat acccctgctgacatttggagcacggcatgcatgagcactaaacaccaggactggttactt aattatgttacttctccatatctaagacgagcagcctttgaactggccacaggtgactat ttgtttgaacctcattcaggggaagagtacactcgagatgaagatcacattgcattgatc atagaacttctggggaaggtgcctcgcaagctcattgtggcaggaaaatattccaaggaa tttttcaccaaaaaaggtgacctgaaacatatcacgaagctgaaaccttggggccttttt gaggttctagtggagaagtatgagtggtcgcaggaagaggcagctggcttcacagatttc ttactgcccatgttggagctgatccctgagaagagagccactgccgccgagtgtctccgg cacccttggcttaactcctaa