GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:42:07 Sequence gi568815581f:37412273_37612866 : 200594 bp : 42.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11237 11343 107 0 2 95 93 106 0.770 10.81 1.02 Intr + 14678 14737 60 1 0 95 50 86 0.866 3.51 1.03 Intr + 25466 25557 92 1 2 87 91 122 0.966 10.27 1.04 Intr + 28233 28390 158 0 2 72 93 166 0.999 14.23 1.05 Intr + 30292 30380 89 2 2 62 75 115 0.566 6.37 1.06 Intr + 32424 32496 73 2 1 62 47 75 0.115 -1.14 1.07 Intr + 53159 53269 111 0 0 80 53 188 0.336 13.93 1.08 Intr + 58128 58260 133 1 1 98 84 148 0.993 14.18 1.09 Intr + 58822 58865 44 1 2 115 91 2 0.789 0.17 1.10 Intr + 60886 60911 26 2 2 77 77 11 0.373 -4.27 1.11 Intr + 61460 61529 70 1 1 120 97 26 0.614 4.54 1.12 Intr + 62284 62357 74 2 2 121 77 19 0.982 2.31 1.13 Term + 64525 64710 186 0 0 88 48 152 0.996 7.71 1.14 PlyA + 65131 65136 6 1.05 2.07 PlyA - 66988 66983 6 1.05 2.06 Term - 74578 74414 165 1 0 -3 42 137 0.573 -3.07 2.05 Intr - 74869 74776 94 0 1 56 98 81 0.329 4.95 2.04 Intr - 77161 77079 83 1 2 105 96 69 0.976 6.92 2.03 Intr - 77379 77221 159 0 0 75 64 76 0.881 3.16 2.02 Intr - 78381 77839 543 0 0 77 75 205 0.741 10.26 2.01 Init - 89232 89053 180 0 0 47 58 120 0.628 4.13 2.00 Prom - 92637 92598 40 -5.05 3.00 Prom + 96931 96970 40 -6.15 3.01 Init + 98266 98344 79 0 1 12 85 79 0.843 1.27 3.02 Intr + 98405 98492 88 0 1 120 81 -7 0.846 -0.09 3.03 Intr + 99909 100013 105 0 0 89 27 77 0.462 0.11 3.04 Term + 100114 100597 484 1 1 89 39 630 0.891 51.43 3.05 PlyA + 100868 100873 6 1.05 4.23 PlyA - 102071 102066 6 1.05 4.22 Term - 106799 106668 132 2 0 89 47 135 0.995 6.61 4.21 Intr - 108376 108266 111 1 0 100 114 84 0.998 11.86 4.20 Intr - 117579 117511 69 0 0 96 64 42 0.343 1.06 4.19 Intr - 123855 123707 149 1 2 99 72 135 0.598 12.03 4.18 Intr - 126148 126052 97 1 1 100 77 41 0.669 2.86 4.17 Intr - 128271 128108 164 1 2 110 78 191 0.909 19.07 4.16 Intr - 130293 129700 594 1 0 102 113 445 0.981 40.01 4.15 Intr - 141787 140843 945 2 0 86 94 992 0.982 89.78 4.14 Intr - 148985 148923 63 0 0 114 97 44 0.980 5.67 4.13 Intr - 149317 149199 119 0 2 90 83 102 0.987 9.09 4.12 Intr - 156652 156519 134 1 2 57 75 127 0.985 6.82 4.11 Intr - 158613 158365 249 0 0 82 99 178 0.981 15.01 4.10 Intr - 159715 159519 197 2 2 81 97 196 0.992 18.01 4.09 Intr - 164146 164069 78 2 0 80 98 23 0.663 1.10 4.08 Intr - 165341 165108 234 0 0 67 77 122 0.927 5.74 4.07 Intr - 167162 166860 303 0 0 81 93 97 0.936 5.14 4.06 Intr - 172487 172376 112 2 1 45 68 72 0.945 -0.07 4.05 Intr - 173158 173053 106 0 1 95 62 144 0.995 11.80 4.04 Intr - 173636 173499 138 1 0 76 19 111 0.737 1.66 4.03 Intr - 174277 174147 131 1 2 116 55 146 0.988 12.67 4.02 Intr - 184072 183951 122 2 2 73 80 86 0.464 5.59 4.01 Intr - 194043 193861 183 1 0 26 14 205 0.286 5.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:37412273_37612866|GENSCAN_predicted_peptide_1|407_aa XDPSDKPPCRGCSSYLMEPYIKCAECGPPPFFLCLQCFTRGFEYKKHQSDHTYEIMTSDF PVLDPSWTAQEEMALLEAVMDCGFGNWQDVANQMCTKTKEECEKHYMKHFINNPLFASTL LNLKQAEEAKTADTAIPFHSTDDPPRPTFDSLLSRDMAGYMPARADFIEEFDNYAEWDLR DIDFVEDDSDILHVMERRYPKEVQDLYETMRRFARIVGPVEHDKFIESHACARTYDHLKK TREEERLKRTMLSEVLQYIQDSSACQQWLRRQADIDSGLSPSIPMASNSETRSYYITQVC ELTSAPALPYNGSLFLFVDLAGRRSAPPLNLTGLPGTEKLNEKEKELCQMVRLVPGAYLE YKSALLNECNKQGGLRLAQARALIKIDVNKTRKIYDFLIREGYITKG >gi568815581f:37412273_37612866|GENSCAN_predicted_CDS_1|1224_bp natgatccctctgataagccaccttgccgaggctgctcctcctacctcatggagccttat atcaagtgtgctgaatgtgggccacctccttttttcctctgcttgcagtgtttcactcga ggctttgagtacaagaaacatcaaagcgatcatacttatgaaataatgacttcagatttt cctgtccttgatcccagctggactgctcaagaagaaatggcccttttagaagctgtgatg gactgtggctttggaaattggcaggatgtagccaatcaaatgtgcaccaagaccaaggag gagtgtgagaagcactatatgaagcatttcatcaataaccctctgtttgcatctaccctg ctgaacctgaaacaagcagaggaagcaaaaactgctgacacagccattccatttcactct acagatgaccctccccgacctacctttgactccttgctttctcgggacatggccgggtac atgccagctcgagcagatttcattgaggaatttgacaattatgcagaatgggacttgaga gacattgattttgttgaagatgactcggacattttacatgtaatggaacggcggtatccc aaggaggtccaggacctgtatgaaacaatgaggcgatttgcaagaattgtggggccagtg gaacatgacaaattcattgaaagccatgcatgtgccagaacctacgatcacctcaagaag acacgggaggaagagcgccttaaacgcactatgctctcagaagttctccagtatatccag gacagtagtgcttgccagcagtggctccgccggcaagctgacattgattccggcctgagt ccttccattccaatggcttcgaattcagaaacaaggtcttactatatcacccaggtgtgt gagttgacctcagcccctgctttgccatataatggctcattgttcctgtttgtggatctt gcaggtagacggagtgcaccacccttgaacctcactggcctccctggcacagagaagctg aatgaaaaagaaaaggagctctgtcagatggtgaggttggtccctggagcctatttagaa tacaaatctgctctattgaacgaatgtaacaagcaaggaggcttaagactggcgcaggca agagcactcatcaagatagatgtgaacaaaacccggaaaatctatgatttcctcatcaga gaaggatacatcactaaaggctaa >gi568815581f:37412273_37612866|GENSCAN_predicted_peptide_2|407_aa MGAILHWNGEYSPFLQPSPFSKCDPSPSSINTTWRLERNAERRLYSKTYRIGSSRLTRSQ PLGCPVTITLAGTTGNIHLNKIQLYNPLKAATARTPSLQTTTQLRSGARVLRRNKKTDVS TAHTHLPPQRKTQVGHLASLKVKLLLSMHQKPRGDAKSGGALQSGWMPARVGGLSGVGRR RRRRGRGCWSPGAQGQGRGGGAARRGSGRGRTLRAGEGALRKVSGLSSAGRPEESRTARE QVPWSLTLPHTPATPRFPSQRLRMSRARGGLRGRSAGVLSGLERIKGEKANPIQPGEPGG WDPRAKGKSVARQFLADACWNREGLGSPDSQLNNQCFRAGQHLRRVRADMINTNWGTAVN KAPSLQKFDFRVPSGLVAHRDSEGLLACASNSKLYAIDGYKNATPLR >gi568815581f:37412273_37612866|GENSCAN_predicted_CDS_2|1224_bp atgggggctattctccattggaatggagaatacagcccttttctacagccctcaccattt agcaagtgtgatccaagccccagcagcatcaataccacctggaggcttgaaagaaatgca gaacgcaggctctattccaagacctaccgaatcggctccagcaggttaacaagatcccag cctctgggatgcccagttaccataacactggctggtaccactggcaacatccacctaaat aaaatacagttatacaatccacttaaggcggccacagcacgcaccccctctttacaaaca acaactcaacttcgctccggagcaagggtacttaggagaaataaaaagacagatgttagc accgcgcacacccatctaccaccgcagagaaagactcaagttggacacttggcatcgctg aaagtcaagttgcttctatccatgcaccagaaacctcggggagatgcaaaaagcggcggc gcgctccagagcgggtggatgccagcgagagtaggcggcctcagcggggtcggaaggcga cggaggaggcgcggccgaggctgctggagccccggggcacagggacagggtcgcgggggc ggcgcggcgcggcgggggagcggcagggggcgcacgctcagggccggcgagggtgcgctg cggaaggtctcgggcctctcctccgcgggccgccccgaagagtcaaggaccgccagggag caggtcccctggtctcttaccctgcctcacacaccggcgacccctcgcttccccagtcag cgtctgcgtatgagccgggctcgaggcgggctgcgggggcgctctgctggggtgctttcg ggattggagaggattaaaggcgagaaagcaaatccgatacagcccggagagcccgggggt tgggacccaagagcgaagggaaaatcagtagcgcggcagttcctggctgacgcttgctgg aacagggagggtttgggttctcctgactcccagctaaataaccagtgctttcgagcaggg cagcatctacgaagagtgagagctgatatgattaatacaaattggggaacagcggtcaac aaggctcccagtctacagaagtttgatttcagggtgccatcgggtcttgtggctcacaga gattcggaaggcctgcttgcctgtgcctctaattccaaattatatgccattgatggctat aaaaatgcaactcccctgagataa >gi568815581f:37412273_37612866|GENSCAN_predicted_peptide_3|251_aa MHTCECSVHLPLPGSISVKPEVVVKVDLICIHCHQHCSLRTFWIRTQAAHTGLLRKKETL ILDSLVEENKTLWSCRQRCKCDCWRLHELQRGSVASNRHLLQARGITCIVNATIEIPNFN WPQFEYVKVPLADMPHAPIGLYFDTVADKIHSVSRKHGATLVHCAAGVSRSATLCIAYLM KFHNVCLLEAYNWVKARRPVIRPNVGFWRQLIDYERQLFGKSTVKMVQTPYGIVPDVYEK ESRHLMPYWGI >gi568815581f:37412273_37612866|GENSCAN_predicted_CDS_3|756_bp atgcacacctgcgaatgttcagtgcaccttccgcttcctggctctatttcagtcaaacct gaggtcgtagtgaaagtcgatttgatttgtatccactgtcaccagcactgctcacttagg actttctggatccggacccaggcagcgcacactggactcttgaggaagaaggagactcta attttggattccttggtggaggaaaataaaacactctggtcttgccgccaacgatgcaag tgtgactgctggcgtcttcatgagctccagagaggcagtgtggcctccaatcggcacctc ctccaggctcgtggcatcacctgcattgttaatgctaccattgagatccctaatttcaac tggccccaatttgagtatgttaaagtgcctctggctgacatgccgcatgcccccattgga ctgtactttgacaccgtggctgacaagatccacagtgtgagcaggaagcacggggccacc ttggtgcactgtgctgcaggggtgagccgctcagccacgctgtgtatcgcgtacctgatg aaattccacaacgtgtgcctgctggaggcgtacaactgggtgaaagcccggcgacctgtc atcaggcccaacgtaggcttctggaggcaactgatagactacgagcgccagctctttggg aagtcgacagttaaaatggtacagacaccttatggcatagttcccgacgtctatgagaag gagtcccgacacctgatgccttactgggggatttag >gi568815581f:37412273_37612866|GENSCAN_predicted_peptide_4|1476_aa XISKAKKWYFIWPTELEEEEAVKRSSQRWEGLVNTKETMYLQCQMTVALSGREEFCEIKL QTGLMPMQQQGFPMVSVMQPNMQGIMGMNYSSQMSQGPIAMQAGIPMGPMPAAGMPYLGQ APFLGMRPPGPQYTPDMQKQFAEEQHESEVQENETTCQSRLHSYQQSLDPKAGFLIPKSL LFLLYDTAFPSRKRFEQQQKLLEEERKRRQFEEQKQKLRLLSSVKPKTGEKSRDDALEAI KGNLDGFSRDAKMHPTPASHPKKPDCPTSSHSTKTVSPSPAFLDEEEFSDFMQGPVEVPP CGPSSTSQPFQSFHPSTPLGQLHTQKAGTQPLPPSQSPVPFALHGVPGQIPYFSTASASH SVPEAGPSLEEKFLVSCDISTSGQEQIKLNTSEVGHKALGPGSSKKYPSLMASNGVAVDG CVSGTTTAEAENTSDQNLSIEESGVGVFPSQDPAQPRMPPWIYNESLVPDAYKKILETTM TPTGIDTAKLYPILMSSGLPRETLGQIWALANRTTPGKLTKEELYTVLAMIAVTQRGVPA MSPDALNQFPAAPIPTLSGFSMTLPTPVSQPTVIPSGPAGSMPLSLGQPVMGINLVGPVG GAAAQASSGFIPTYPANQVVKPEEDDFQDFQDASKSGSLDDSFSDFQELPASSKTSNSQH GNSAPSLLMPLPGTKALPSMDKYAVFKGIAADKSSENTVPPGDPGDKYSAFRELEQTAEN KPLGESFAEFRSAGTDDGFTDFKTADSVSPLEPPTKDKTFPPSFPSGTIQQKQQTQVKNP LNLADLDMFSSVNCSSEKPLSFSAVFSTSKSVSTPQSTGSAATMTALAATKTSSLADDFG EFSLFGEYSGLAPVGEQDDFADFMAFSNSSISSEQKPDDKYDALKEEASPVPLTSNVGST VKGGQNSTAASTKYDVFRQLSLEGSGLGVEDLKDNTPSGKSDDDFADFHSSKFSSINSDK SLGEKAVAFRHTKEDSASVKSLDLPSIGGSSVGKEDSEDALSVQFDMKLADVGGDLKHVM SDSSLDLPTVSGQHPPAADIEDLKYAAFGSYSSNFAVSTLTSYDWSDRDDATQGRKLSPF VLSAGSGSPSATSILQKKETSFGSSENITMTSLSKVTTFVSEDALPETTFPALASFKDTI PQTSEQKEYENRDYKDFTKQDLPTAERSQEATCPSPASSGASQETPNECSDDFGEFQSEK PKISKFDFLVATSQSKMKSSEEMIKSELATFDLSVQGSHKRSLSLGDKEISRSSPSPALE QPFRDRSNTLNEKPALPVIRDKYKDLTGEVEVIKKANDTLNGISSSSVCTEVIQSAQGME YLLGVVEVYRVTKRVELGIKATAVCSEKLQQLLKDIDKVWNNLIGFMSLATLTCCWEKMT VITKHLSPYHELLEEKPDENSLDFSSCMLRPGIKNAQELACGVCLLNVDSRSRAFNSETD SFKLAYGGHQYHASCANFWINCVEPKPPGLVLPDLL >gi568815581f:37412273_37612866|GENSCAN_predicted_CDS_4|4431_bp nctatatcaaaggcaaagaagtggtatttcatttggcccacagagttagaggaagaggaa gcggtcaagagatcgtcacagagatgggaaggcctagtcaacactaaggaaacgatgtac ctgcagtgccagatgactgtggctctgagtggcagagaagagttttgtgagataaaactg caaacaggcctgatgccgatgcagcaacaaggatttcctatggtctctgtcatgcagcct aatatgcaaggcattatgggaatgaattacagctctcagatgtcccaaggacctattgct atgcaggcaggaataccaatgggaccaatgccagcagcgggaatgccttacctaggacaa gcacccttcctgggcatgcgtcctccaggcccacagtacactccagacatgcagaagcag tttgccgaagagcagcatgaaagtgaagtccaggaaaatgaaacaacttgtcagtccagg ttacacagttaccagcaaagcctggacccaaaggcaggcttcctgattcccaaatcatta ctctttctgctgtatgacactgccttcccaagcaggaaacgatttgaacagcagcaaaaa ctcttagaagaagaaagaaaaagacgccagtttgaagagcagaagcaaaagctcagactt ttgagcagtgtgaaacccaagacaggagagaagagtagagatgatgctttggaagccata aaaggaaatttagatgggttttccagagatgcaaaaatgcaccctactccagcatcgcac cccaagaaaccagattgccccacatcatctcattctactaaaactgtctccccatcacct gcctttcttgatgaagaagaattcagtgactttatgcaggggcctgttgaagttccccca tgtgggccttcttccacgtcccagccctttcagtctttccacccctccaccccacttggc cagttgcatacacagaaggctgggacacagcctctccctccaagtcagagtccagtgcct tttgccttacatggcgtccctgggcaaattccttacttctctactgcttcagcctcacac agtgtaccagaagcaggcccttccttggaggagaagttcctagtatcttgtgatataagt acatctgggcaggaacaaattaaattaaatacttctgaagttggccacaaagccctaggc ccaggttccagtaagaagtatcccagtttaatggccagtaacggggttgctgtagatgga tgtgtaagtggtaccaccactgcagaggcagaaaatacttcagatcaaaacctgtcaatt gaagagagtggtgtgggagtatttccctcacaggatcctgctcagcccagaatgcctcct tggatttacaatgagagtttggttccagatgcctataagaaaatcttagaaaccacaatg actccaactggaatagatactgccaaactgtatcccattctgatgtcatctgggcttccc agggaaactcttggacagatatgggccttagctaatcgaactacacctggcaaacttaca aaagaagaactttataccgttctagccatgatagcggtaacacagaggggcgttcctgca atgagtcctgatgctttaaaccagttcccagcagctcctattccaactttaagtggcttt tctatgactctgcctacaccggtgagtcagccaactgtgataccttcaggtcctgcgggc tccatgcccctcagccttggacagccagtcatgggcattaaccttgttggaccagtgggt ggagctgcagcccaggcttctagtggtttcataccaacctaccctgcaaatcaggtagta aagccagaagaagatgacttccaggattttcaagatgcttctaagtcaggatcccttgat gactcattcagtgatttccaagagttgcctgcttcttcaaaaacaagtaactcccagcat ggaaacagtgccccttctttgttgatgccacttcctggaactaaagcattgccttcaatg gacaaatatgctgtgtttaaaggaattgcagctgacaagtcctctgaaaatactgttcca cctggagatcctggtgataaatatagtgctttcagagaacttgaacagacagcagagaat aaacctttaggagaaagctttgcagaattcagatctgcaggaactgatgatggtttcacc gattttaaaacagccgatagtgtatcaccactagagccaccaacaaaagacaaaactttt ccaccatccttcccctcaggaactatacaacagaaacaacaaacacaagtgaaaaaccct ctgaacttagcagacctagatatgttttcctcagttaattgcagcagcgagaaaccattg tctttttcagctgtgtttagcacatcaaaatcagtttctacaccacagtcaacaggttct gctgctactatgacagcattggcagcaacaaaaacttctagtttggctgatgattttgga gaattcagcctttttggggaatattctggtctagcacctgttggggagcaggatgacttt gcagattttatggctttcagtaatagctctatttcatctgagcaaaagccggatgacaaa tatgatgcccttaaagaggaagccagtcctgttcctctaaccagcaacgtgggcagcaca gtgaagggtggacaaaactcgactgctgcgtctaccaagtacgatgtcttcagacaactt tctctggaagggtctggactaggtgttgaagacctgaaagataacactccttcaggaaaa agtgatgatgattttgctgacttccactccagtaaattttcttccataaactcggacaaa tccctgggagagaaagcagtggctttcagacacaccaaagaagactctgcatcagtgaag tccttagatctcccttccattggtggcagcagtgttggcaaggaggactctgaagatgca ctctctgttcagtttgacatgaaattggctgatgtgggaggagatcttaagcatgtcatg tctgatagctctttggatttaccaacagttagtggccagcatcctcctgctgcagatata gaggacttaaaatatgctgcttttggaagctacagtagcaattttgcagtgagcacactt acaagctatgactggtcagacagggatgatgcaactcagggcagaaaactctctccattt gtcctctcagcaggaagtggatccccctcagccacctcaattcttcaaaagaaagagact tcatttggcagttctgaaaacatcaccatgacatctctctccaaagtaacgacctttgta agtgaagatgctcttccagagaccaccttcccagctcttgccagttttaaagacacgatt cctcagaccagtgagcaaaaggaatatgaaaacagagactataaagatttcacaaaacag gacctgcctacggctgaacggagccaggaggccacgtgtcccagcccagcgtccagtggt gcctctcaagaaaccccgaacgaatgttcggatgactttggagagtttcaaagtgaaaag cccaaaatcagcaaatttgacttcttagtagccacttcacaaagcaaaatgaaatccagt gaagaaatgatcaaaagtgagctggcaacctttgacctttctgttcaaggatcacacaag aggagtttgagccttggtgataaagaaataagccgttcttctccttctccagctttggag cagcctttcagagaccgttccaatactctgaatgagaagcccgccctgcccgtcatccga gacaagtacaaagacctgacgggagaggtggaggtcattaagaaggcaaatgatacctta aatggaatcagtagtagttctgtttgcacagaagtaattcagtcagctcaaggcatggaa tatttattaggtgttgttgaagtgtacagggtaaccaagcgtgtggagctggggataaaa gccactgcagtgtgcagtgagaaactccagcagttgctgaaggacatcgataaagtatgg aataacctaatcggcttcatgtcactcgccacactcacatgctgctgggaaaagatgact gtgattacaaagcatctctctccttaccatgagctcttagaagaaaagccagatgaaaac tcgctggatttttcctcctgtatgttacggcctgggattaaaaatgctcaggagcttgcc tgtggagtgtgcctcttgaatgtggactcgaggagccgggcattcaactcagaaacagac agtttcaagctggcctatggagggcaccagtatcacgccagctgtgccaacttctggatc aactgtgtcgaaccaaagcctcctggcctcgtcctgcctgacctgctctga