GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:10:46 Sequence gi568815592r:35695149_35897288 : 202140 bp : 45.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 17565 17574 10 2 1 103 91 2 0.415 2.56 1.02 Intr + 36340 36437 98 2 2 75 73 49 0.000 1.83 1.03 Intr + 40315 40461 147 0 0 68 96 30 0.002 2.23 1.04 Intr + 41946 42123 178 2 1 24 86 223 0.006 15.19 1.05 Intr + 42879 43024 146 1 2 18 94 142 0.962 7.80 1.06 Intr + 43236 43370 135 2 0 85 99 175 0.997 19.06 1.07 Intr + 52113 52286 174 2 0 96 51 144 0.992 11.64 1.08 Intr + 52428 52499 72 2 0 111 94 123 0.999 14.80 1.09 Term + 53390 53722 333 1 0 87 38 322 0.916 21.71 1.10 PlyA + 53737 53742 6 1.05 2.00 Prom + 66462 66501 40 -5.26 2.01 Sngl + 70679 71041 363 1 0 57 41 231 0.843 9.97 2.02 PlyA + 71725 71730 6 1.05 3.00 Prom + 77592 77631 40 -6.36 3.01 Init + 78550 78630 81 0 0 80 65 101 0.550 6.01 3.02 Intr + 81348 81554 207 2 0 137 105 101 0.851 15.97 3.03 Intr + 82311 82433 123 2 0 80 94 40 0.947 4.58 3.04 Intr + 83972 84064 93 1 0 65 42 68 0.190 0.06 3.05 Intr + 91850 91972 123 1 0 84 94 156 0.565 16.58 3.06 Term + 93446 93499 54 1 0 108 41 17 0.443 -3.44 3.07 PlyA + 94131 94136 6 1.05 4.04 PlyA - 95353 95348 6 1.05 4.03 Term - 100129 99998 132 1 0 104 53 221 0.893 18.29 4.02 Intr - 100705 100583 123 1 0 55 98 93 0.994 7.78 4.01 Init - 102140 102057 84 2 0 103 99 162 0.999 17.67 4.00 Prom - 103963 103924 40 -1.66 5.00 Prom + 109667 109706 40 -7.16 5.01 Init + 110523 110934 412 2 1 97 102 607 0.972 59.68 5.02 Intr + 119398 119634 237 2 0 116 46 414 0.916 37.59 5.03 Term + 120470 120528 59 0 2 111 41 22 0.865 -2.35 5.04 PlyA + 121686 121691 6 1.05 6.02 PlyA - 121716 121711 6 1.05 6.01 Sngl - 133745 132417 1329 2 0 39 47 194 0.372 6.52 6.00 Prom - 136262 136223 40 -5.66 7.14 PlyA - 137108 137103 6 1.05 7.13 Term - 140340 140156 185 1 2 113 46 167 0.999 12.71 7.12 Intr - 143281 143189 93 2 0 32 98 72 0.827 2.54 7.11 Intr - 147456 147387 70 0 1 121 113 56 0.998 10.15 7.10 Intr - 155210 155151 60 2 0 84 92 21 0.452 1.03 7.09 Intr - 162220 162113 108 1 0 70 50 57 0.309 0.58 7.08 Intr - 173962 173862 101 2 2 77 103 26 0.526 2.83 7.07 Intr - 174753 174334 420 1 0 88 77 237 0.558 16.52 7.06 Intr - 175346 175133 214 2 1 57 3 241 0.795 10.79 7.05 Intr - 175811 175786 26 0 2 93 100 1 0.776 -0.46 7.04 Intr - 177580 177415 166 1 1 81 45 111 0.719 5.63 7.03 Intr - 192963 192876 88 2 1 86 70 106 0.963 8.57 7.02 Intr - 193775 193667 109 0 1 96 89 -31 0.869 -2.86 7.01 Intr - 195865 195747 119 0 2 42 115 117 0.821 9.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 32434 32335 100 0 1 84 50 66 0.801 -0.00 S.002 Intr - 33112 33057 56 1 2 95 93 87 0.882 7.68 S.003 Init + 41961 42123 163 2 1 88 86 191 0.887 18.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:35695149_35897288|GENSCAN_predicted_peptide_1|430_aa MQGVALTSVTSRGAWPLSIVSWCPGPLRIRTWGEDEGVLGQGSEWAGLAGRSIIGDAVSL PPSILGAAQDPEPSVPVLPKRDQGQGNTEDMGKSIPQYLGQLDIRKSVVSLATGAGAIYL LYKAIKAGIKCKPPLCSNSPICIARLAVERERHGRDSGELRRLLNSLECKQDEYAKSMIL HSITRCVYLLEAEASACTTDDIVLLGYMLDDKDNSVKTQALNTLKAFSGIRKFRLKIQEH SIKVLELISTIWDTELHIAGLRLLNNLPLPDYVHPQLRRVMPALMEILQSDYILAQVQAV RLLSYLAQKNDLLYDILNCQVHSNFLNLFQPTQSGSLLYEVLVFAERLSEGRNAPHYHVV KWHYNEQSLHESLFGEESRLADRLLALVIHPEEDVQIQACKVIVSLQYPQDLRARPSSCQ PSRSYFKNTE >gi568815592r:35695149_35897288|GENSCAN_predicted_CDS_1|1293_bp atgcagggtgttgccttgacatcagtgacgtcgcgaggggcgtggcctctctccatcgtc tcctggtgccctgggcccctccgcatccgaacctggggggaggatgagggtgtactgggc caaggctctgagtgggcaggcctggctggccgcagcatcattggagatgctgtttcccta cctccttccatcctgggtgcagcccaggacccagagccttctgtcccagttctcccaaaa agggatcagggccagggcaacactgaagacatgggcaagagcatcccccaatacctgggg caactggacatccgcaaaagcgtagtcagcctggccacaggcgccggggcgatctacctg ctctacaaggccatcaaggctggcataaaatgcaaaccacccctctgtagcaactcaccc atctgcatcgcccgcctggcagtcgagcgagagcggcacgggcgggactcaggtgagctc cggaggctcctcaactctttggagtgcaaacaggatgagtatgccaagagcatgatcctg cacagtatcactcgctgtgtgtacttgctggaggctgaggcctctgcttgtactacggat gacatcgtgttgctgggctacatgctggatgacaaggacaacagtgtcaaaacccaagct ctgaatacacttaaagctttctctggcatcagaaaattcaggctcaaaatccaggaacac tccatcaaagtactcgaactgatctccaccatctgggacacggaactgcacattgcgggc ctcagactcctcaacaaccttccactgcccgactatgtgcatccacagctgcgacgggtg atgcctgccttgatggagatcctgcagtcagactacatcctggcacaggtgcaagccgta cgactgctgagctacctggcacagaagaatgaccttctctatgacattctcaactgccag gttcactccaacttcctaaacctgttccagcccacacagtcagggagtctcctgtatgag gtactggtgtttgctgagcggctgagtgagggccggaacgcaccccactaccacgtggtg aaatggcattacaacgaacagtccctgcatgaatccctctttggggaagagtcccgactg gcagaccgactacttgccctggtcatccaccctgaggaagatgttcagatccaggcctgc aaggtcattgtcagcctgcagtatccccaggacttgagagcccggccctcctcctgccag cccagtcgttcctactttaaaaacacggaataa >gi568815592r:35695149_35897288|GENSCAN_predicted_peptide_2|120_aa MPWQPAPFPHRLPGTRRTCQPSELRGAVQAEPLPEPLLHVLGLSFPLQTCRPILRCPPGL MKPLVVFVHGGPGALASLRKMATHTFLQESYFMMKGRAQIHSMVNSLKITLKKETWDQWR >gi568815592r:35695149_35897288|GENSCAN_predicted_CDS_2|363_bp atgccctggcagcccgcccctttcccacaccgcctccccggcacacgccgcacctgtcag ccctctgagctccgaggtgcggtgcaggctgagccactgcctgagccgctgctccacgtc ctgggccttagcttcccgctgcagacctgccggccgattcttcgctgccctcccggtctc atgaagccgctggtcgtgtttgtccacggcggtcccggcgcgctggcatccttgagaaag atggctacacacacctttctgcaggagagttacttcatgatgaaaggaagagcccagatt cacagtatggtgaactcactgaaaattacattaaagaaggaaacatgggaccagtggaga taa >gi568815592r:35695149_35897288|GENSCAN_predicted_peptide_3|226_aa MLTGQAAQGFPALETLVILFLLPRNAEASSPSTSLLRRGGLGRRVTWPGKRFYPAAPRNP AAALPPRPMAAALALVAGVLSGAVLPLWSALPQYKKKITDRCFHHSECYSGCCLMDLDSG GAFCAPRARITMICLPQSSQQPYEGGYYYCPHLTDEETELERFIDLLKELKESCIRNQDC ETGCCQRAPDNCESHCAEKGSEGSLCQTQVGKRVEQLVQGHTVGKC >gi568815592r:35695149_35897288|GENSCAN_predicted_CDS_3|681_bp atgctcacaggtcaggcagctcagggattcccggcgctggagaccctggtgatcttgttc ctactcccccgcaatgcagaggcgtcctcgccctccacgtccctcctccgccggggtggc ctggggcgccgggtcacgtggccggggaagaggttttatcccgcggcccctcggaacccc gccgctgctctgccgcccaggcccatggccgcagccctggcgctcgtggcgggggtcctg tcgggggcggtgctgcccctctggagcgcgcttccgcaatataaaaagaaaatcacagac aggtgcttccaccactctgagtgctacagtggctgctgcctcatggacttggactccggt ggagccttctgtgcccccagggccagaataaccatgatctgcttgccccagtcctcacaa caaccctatgaggggggttactattactgtccccatttgacagatgaggaaaccgagctg gagaggttcattgacttgctgaaggagctcaaggagtcttgcatccggaaccaggactgc gagactggctgctgccaacgtgctccagacaattgcgagtcgcactgcgcggagaagggg tccgagggcagtctgtgtcaaacgcaggtagggaaacgggttgagcaacttgttcaaggt cacacagttggaaagtgctga >gi568815592r:35695149_35897288|GENSCAN_predicted_peptide_4|112_aa MEKILILLLVALSVAYAAPGPRGIIINLENGELCMNSAQCKSNCCQHSSALGLARCTSMA SENSECSVKTLYGIYYKCPCERGLTCEGDKTIVGSITNTNFGICHDAGRSKQ >gi568815592r:35695149_35897288|GENSCAN_predicted_CDS_4|339_bp atggagaagatcctgatcctcctgcttgtcgccctctctgtggcctatgcagctcctggc ccccgggggatcattatcaacctggagaacggtgagctctgcatgaatagtgcccagtgt aagagcaattgctgccagcattcaagtgcgctgggcctggcccgctgcacatccatggcc agcgagaacagcgagtgctctgtcaagacgctctatgggatttactacaagtgtccctgt gagcgtggcctgacctgtgagggagacaagaccatcgtgggctccatcaccaacaccaac tttggcatctgccatgacgctggacgctccaagcagtga >gi568815592r:35695149_35897288|GENSCAN_predicted_peptide_5|235_aa MVKLLPAQEAAKIYHTNYVRNSRAVGVMWGTLTICFSVLVMALFIQPYWIGDSVNTPQAG YFGLFSYCVGNVLSSELICKGGPLDFSSIPSRAFKTAMFFVALGMFLIIGSIICFSLFFI CNTATVYKICAWMQLAAATGLMIGCLVYPDGWDSSEVRRMCGEQTGKYTLGHCTIRWAFM LAILSIGDALILSFLAFVLGYRQDKLLPDDYKADGTAVFTLALGLEAAAKLLCAL >gi568815592r:35695149_35897288|GENSCAN_predicted_CDS_5|708_bp atggtgaaattgctgccggcccaggaggcagccaagatctaccataccaactatgtgcgg aactcgcgagccgtgggcgtgatgtggggtaccctcaccatctgcttctccgtactggtc atggccctcttcatccagccctactggatcggcgacagcgtcaacacaccgcaggcaggc tacttcggccttttctcctactgcgtgggtaacgtgctgtcctccgagctcatctgcaag ggcggccccctagacttctcctccatcccctctagagccttcaagactgccatgttcttt gtggccttgggcatgttcctcatcattggctccatcatctgcttcagcctgttcttcatc tgcaacacggccacagtctataagatctgtgcatggatgcagctggctgcggccacaggc ctaatgattggctgcctggtctaccctgatggttgggactcaagtgaggtgcggcgcatg tgtggggagcagacgggcaagtacacgctgggccactgcaccatccgctgggccttcatg ctggccatcctcagcattggcgacgccctcatcctctccttcctggccttcgtgttgggc taccggcaggacaagctcctccctgacgactacaaggcagatggaaccgctgtcttcaca ttagcccttggattagaagcagcagccaaactcctctgtgcactataa >gi568815592r:35695149_35897288|GENSCAN_predicted_peptide_6|442_aa MTFFTELEKTTLKFIWNQKRARIAKSIVSQKNKAGGITLPDFKLYYKATVTKTAWYWYQN RDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFL TPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDL IKLKSFCTAKETTIRVNRQPTTWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWA KDMNRHLSKEDISAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRG CGEIGTLLHCWWDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPNDYKSCCYKDTC TRMFIAALFTIAKTWNQPKCPTMIDWIKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLETI ILSKLSQEQKTKHRIFSLIGGN >gi568815592r:35695149_35897288|GENSCAN_predicted_CDS_6|1329_bp atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcgtaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaaggac ttcatgtccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaacatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcg aaggacatgaacagacacctctcaaaagaagacatttctgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatatcat ctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaggtgctggagagga tgtggagaaataggaacacttttacactgttggtgggactgtaaactagttcaaccattg tggaagtcagtgtggcgattcctcagggatctagaactagaaataccatttgacccagcc atcccattactgggtatatacccaaatgactataaatcatgctgctataaagacacatgc acacgtatgtttattgcggcattattcacaatagcaaagacttggaaccaacccaaatgt ccaacaatgatagactggattaagaaaatgtggcacatatacaccatggaatactatgca gccataaaaaatgatgagttcatgtcctttgtagggacatggatgaaattggaaaccatc attctcagtaaactatcgcaagaacaaaaaaccaaacaccgcatattctcactcataggt gggaattga >gi568815592r:35695149_35897288|GENSCAN_predicted_peptide_7|586_aa XSETQHRGSAPHSESDLPEQEEEILGSDDDEQEDPNDYCKGGYHLVKIGDLFNGRYHVIR KLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDEIRLLKSVLQGLDYLHTKCRI IHTDIKPENILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPQPKPADKMSKNKKK KLKKKQKRQAELLEKRMQEIEEMEKESGPGQKRPNKQEESESPVERPLKENPPNKMTQEK LEESSTIGQDQTLMERDTEGGAAEINCNGVIEVINYTQNSNNETLRHKEDLHNANDCDVQ NLNQESSFLSSQNGDSSTSQETDSCTPITSEVSDTMVCQSSSTVGQSFSEQHISQLQESI RAEIPCEDEQEQEHNGPLDNKGKSTAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHFT EDIQTRQYRSLEVLIGSGYNTPADIWSTACMSTKHQDWLLNYVTSPYLRRAAFELATGDY LFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLF EVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS >gi568815592r:35695149_35897288|GENSCAN_predicted_CDS_7|1761_bp nnatctgaaactcagcaccgaggctctgctccccactctgagagtgatctaccagagcag gaagaggagattctgggatctgatgatgatgagcaagaagatcctaatgattattgtaaa ggaggttatcatcttgtgaaaattggagatctattcaatgggagataccatgtgatccga aagttaggctggggacacttttcaacagtatggttatcatgggatattcaggggaagaaa tttgtggcaatgaaagtagttaaaagtgctgaacattacactgaaacagcactagatgaa atccggttgctgaagtcagtgttacagggtcttgattatttacataccaagtgccgtatc atccacactgacattaaaccagagaacatcttattgtcagtgaatgagcagtacattcgg aggctggctgcagaagcaacagaatggcagcgatctggagctcctccgccttccggatct gcagtcagtactgctccccagcctaaaccagctgacaaaatgtcaaagaataagaagaag aaattgaagaagaagcagaagcgccaggcagaattactagagaagcgaatgcaggaaatt gaggaaatggagaaagagtcgggccctgggcaaaaaagaccaaacaagcaagaagaatca gagagtcctgttgaaagacccttgaaagagaacccacctaataaaatgacccaagaaaaa cttgaagagtcaagtaccattggccaggatcaaacgcttatggaacgtgatacagagggt ggtgcagcagaaattaattgcaatggagtgattgaagtcattaattatactcagaacagt aataatgaaacattgagacataaagaggatctacataatgctaatgactgtgatgtccaa aatttgaatcaggaatctagtttcctaagctcccaaaatggagacagcagcacatctcaa gaaacagactcttgtacacctataacatctgaggtgtcagacaccatggtgtgccagtct tcctcaactgtaggtcagtcattcagtgaacaacacattagccaacttcaagaaagcatt cgggcagagataccctgtgaagatgaacaagagcaagaacataacggaccactggacaac aaaggaaaatccacggctggaaattttcttgttaatccccttgagccaaaaaatgcagaa aagctcaaggtgaagattgctgaccttggaaatgcttgttgggtgcacaaacatttcact gaagatattcaaacaaggcaatatcgttccttggaagttctaatcggatctggctataat acccctgctgacatttggagcacggcatgcatgagcactaaacaccaggactggttactt aattatgttacttctccatatctaagacgagcagcctttgaactggccacaggtgactat ttgtttgaacctcattcaggggaagagtacactcgagatgaagatcacattgcattgatc atagaacttctggggaaggtgcctcgcaagctcattgtggcaggaaaatattccaaggaa tttttcaccaaaaaaggtgacctgaaacatatcacgaagctgaaaccttggggccttttt gaggttctagtggagaagtatgagtggtcgcaggaagaggcagctggcttcacagatttc ttactgcccatgttggagctgatccctgagaagagagccactgccgccgagtgtctccgg cacccttggcttaactcctaa