GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:07:48 Sequence gi568815587f:123806432_124007373 : 200942 bp : 37.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 919 914 6 1.05 1.02 Term - 21327 20882 446 1 2 42 36 162 0.383 0.81 1.01 Init - 22596 22170 427 0 1 81 53 246 0.758 16.91 1.00 Prom - 30308 30269 40 -3.65 2.02 PlyA - 31464 31459 6 1.05 2.01 Sngl - 35117 34524 594 2 0 69 41 375 0.355 26.84 2.00 Prom - 36699 36660 40 -3.45 3.07 PlyA - 37071 37066 6 1.05 3.06 Term - 48630 48220 411 0 0 94 41 212 0.611 11.36 3.05 Intr - 48772 48685 88 0 1 19 95 53 0.892 -1.85 3.04 Intr - 51081 50970 112 1 1 100 90 58 0.884 5.72 3.03 Intr - 52359 52244 116 2 2 45 67 75 0.619 0.27 3.02 Intr - 55865 55694 172 0 1 105 7 111 0.124 2.78 3.01 Init - 62386 62254 133 1 1 92 40 109 0.736 6.85 3.00 Prom - 63007 62968 40 -3.95 4.00 Prom + 70284 70323 40 -4.55 4.01 Init + 75803 75990 188 1 2 70 16 81 0.274 -2.32 4.02 Intr + 76752 76875 124 0 1 78 92 108 0.324 9.87 4.03 Term + 79189 79248 60 0 0 97 46 64 0.352 -0.07 4.04 PlyA + 79613 79618 6 1.05 5.00 Prom + 89970 90009 40 -5.45 5.01 Init + 100352 100701 350 1 2 114 56 169 0.094 13.22 5.02 Intr + 118208 118233 26 2 2 86 84 -2 0.000 -4.05 5.03 Term + 133418 134142 725 0 2 71 43 551 0.575 41.55 5.04 PlyA + 135086 135091 6 -0.45 6.02 PlyA - 135443 135438 6 1.05 6.01 Sngl - 137407 136436 972 1 0 67 39 627 0.910 52.18 6.00 Prom - 142575 142536 40 -6.25 7.00 Prom + 142929 142968 40 -4.65 7.01 Init + 148916 148991 76 1 1 94 89 57 0.873 7.70 7.02 Term + 166708 166898 191 2 2 41 45 203 0.629 7.93 7.03 PlyA + 167310 167315 6 1.05 8.02 PlyA - 167354 167349 6 1.05 8.01 Sngl - 171323 170265 1059 2 0 47 34 920 0.438 79.39 8.00 Prom - 173433 173394 40 -6.35 9.04 PlyA - 174443 174438 6 1.05 9.03 Term - 177739 177381 359 2 2 58 48 289 0.994 15.59 9.02 Intr - 178173 178027 147 1 0 73 80 122 0.957 9.19 9.01 Init - 184620 184611 10 0 1 57 111 5 0.298 -0.28 9.00 Prom - 185141 185102 40 -6.75 10.03 PlyA - 185663 185658 6 1.05 10.02 Term - 188684 187732 953 0 2 84 38 453 0.562 31.14 10.01 Init - 192062 192005 58 2 1 90 84 64 0.813 7.72 10.00 Prom - 198445 198406 40 -6.25 11.03 PlyA - 198517 198512 6 1.05 11.02 Term - 198808 198691 118 0 1 68 48 183 0.807 9.33 11.01 Init - 199381 199287 95 1 2 76 84 46 0.505 2.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_1|290_aa MARELRDACTSFNSRCNQVEEKVSVIEHQINEIKQEDKVREKRVKRNEQSLQEIWDCVKR PNLCLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSRREIPR HITVRFTKVEIKEKMLRAAREKDRSMRQKVNKDIQDLNSALHQADLIDIYRTLHPKSSEY TFFSAPHCTYSKIGHIIGSEALLSKCKRTEITTNRLSDHSAIILELRTKKLTQNRTTTWK LNNLLLNDYWVNNEMKAGIKIFFETNENKDTTYQNLWDTFKAVCRGKLIY >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_1|873_bp atggcacgagaacttcgtgacgcatgcacaagcttcaatagccgatgcaatcaagtggaa gaaaaggtatcagtgattgaacatcaaattaatgaaataaagcaagaagataaagttaga gaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatatgggactgtgtgaaaaga ccaaatctatgtttgattggtgtacctgaaagtgatggggagaatggaaccaagttggaa aacactcttcaggatattatccaggagaacttccccaacctagcaaggcaggccaacatt caaattcaggaaatacagagaacaccacaaagatactcctcgagaagagaaatcccaaga cacataactgtcagattcaccaaggttgaaatcaaggaaaaaatgttaagggcagccaga gagaaagacagatcgatgagacagaaggttaataaggatatccaggacttgaactcagct ctgcaccaagcagacctaatagacatctatagaactctccaccccaaatcatcagaatat acattcttctcagcaccacattgcacctattctaaaattggccacataattggaagtgaa gcactcctcagcaaatgtaaaagaacagaaatcacaacaaaccgtctctcagaccacagt gcaatcatattagaactcaggactaagaaactcactcaaaaccgcacaactacatggaaa ctgaacaacctgctcctgaatgactactgggtgaataatgaaatgaaggcaggaataaag attttctttgaaaccaatgaaaacaaagacacaacataccagaatctctgggacacattt aaagcagtgtgtagagggaaacttatatactaa >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_2|197_aa MSFDCYVAICDPLHYTIIMNSRACLLLVLGCWVGAFLSVLCPTIVVSRLPFCYKEISHFF CDITPLLHVSCIDTHFIEMINFLLSSLILLTSLVLTTVSYIYIISTILHIPSAQGRRKAF STCASHITVISIAYISNIFRYVRPSQSHSMGFDKVTAVPTMVTPLLNPFTYSLRNEKVKA VLKEAVSKIMSSWHRRT >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_2|594_bp atgtcctttgactgctacgtggccatctgtgaccccctgcactacaccattatcatgaac agcagggcctgcctcctactagttctgggctgctgggttggagccttcctgtctgtgttg tgcccaaccattgtggtgtccagattgcctttctgttacaaggaaattagtcacttcttc tgtgacatcacccctctgctacatgtgtcctgtatagacactcatttcatcgagatgata aacttcctcttatcttccctcatcctcctgacctcactggtgctcaccactgtgtcctac atctacatcatttctaccatcctgcacatcccctcagcccaaggacgtcggaaggccttt tccacgtgcgcttcccacatcaccgtcatttccatcgcttatataagcaacatcttcagg tatgtgaggcccagccagagtcattcaatgggttttgacaaggtgacagctgtccccaca atggtgacccctcttctgaatcccttcacttatagtctaagaaatgaaaaggtaaaggca gtcttgaaagaagcagtcagcaaaattatgtcctcatggcacaggagaacttaa >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_3|343_aa MDAELRIALTVETKSVPFLINTEATHSTLPSFQESVSLASITVVGTVAFIPLAVTSFKHC MATCDPLCSTIIAKSRACLLLALGCWMGTFLAVLRLTIVVSRPVSQPVCTAVVWIGGEVM FKCKEEVAFTWIQQKAYRPIGPRGSEARIQCAQQLGELPEIPVSLTSLKALRGLRMIGAE DPPSKEKWVGVPVKEVVWLKKQSVHHLAGMAESTEPQIWWSSLLPGTPCQGEIGVLSVEQ RTHAVRRTMSGPRLKKQSGPDLARPLCCTVGAPIHLDCMRSPQPASWNGCFPLNHKDGGR PSLQEQGLVSSRLNPLPLAGQILTQWVLTHEVLWKWGLQNDAA >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_3|1032_bp atggatgccgagcttcgaatagctctcacagtggaaactaagtccgtccccttcttaatc aatacggaggctacccactccacattaccttcttttcaagagtctgtttcccttgcctcc ataactgttgtagggacagtggcgtttatccccttggcagtgacatccttcaaacactgc atggcaacctgtgaccccctgtgcagcaccatcattgcaaaaagcagggcctgcctcctg ctggctctgggatgctggatgggaaccttcctggctgtgttgcgcctgactattgtggtg tccagacccgtatcccaaccagtttgtacagcggtggtctggattggaggtgaagttatg ttcaagtgcaaagaggaagttgctttcacttggatacaacaaaaggcctataggcctata ggaccaagagggagtgaagctcgcatccaatgtgcccagcagttaggggagcttcctgag atacctgtttctctcacttccctcaaagcattgagaggacttagaatgataggagctgag gacccacccagtaaggagaagtgggtcggggtcccggttaaagaagtagtctggctaaag aagcagtctgtccaccatctggctgggatggctgagtctacagaaccacagatatggtgg tcatccctcctaccaggaactccctgccagggagagatcggagttctgtctgtggaacaa aggacccacgcagtaaggaggactatgtcagggcccaggttaaagaagcagtctggcccc gatctggcaaggccattgtgctgcaccgtgggggcccctattcatctggactgtatgcgt tctccacagccagcaagctggaatggctgtttccctctgaaccataaagatggtggccgt ccttcccttcaggaacaaggtcttgtctccagccgacttaaccctctgcccttggctggc cagattctaacccagtgggtcttaacccatgaggtactatggaagtggggcctacagaac gatgctgcttga >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_4|123_aa MFLGIYPKKLKAYVHTKTCTQVFITVLFITAQTWKQPRYPSVGEIDKPTVVQSYDGILLS PTKEFTVCARTMLRGIAVHSGNEISSIEFSDSLHSLSDLWMFRQGQPAVRTANCHGIIYW FDP >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_4|372_bp atgttccttggtatttacccaaaaaagttgaaagcttatgttcacacaaaaacctgcaca caggtgtttataacagttttattcataactgcccaaacttggaaacaaccaagatatcct tcagtaggtgaaatagataaaccaactgtggtacagtcttacgatggaatattacttagc cccacaaaggaattcacagtgtgtgcacggacaatgctacgaggcattgcagtgcattct ggtaatgaaatatcttcgatagaattctcagattccttacattcgttgtcagatttatgg atgttcaggcagggccagcctgctgtcaggactgccaactgccatggaatcatctattgg tttgacccttga >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_5|366_aa MACDRYVAICSPLLYRVIMSPRVCSLLVAAVFSVGFTDAVIHGGCILRLSFCGSNIIKHY FCDIVPLIKLSCSSTYIDELLIFVIGGFNMVATSLTIIISYAFILTSILRIHSKKGSHFR VSGLKSPRMLVDLLSGNPTISFGGCLTQLFFFHFIGGIKIFLLTVMAYDRYIAISQPLHY TLIMNQTVCALLMAASWVGGFIHSIVQIALTIQLPFCGPDKLDNFYCDVPQLIKLACTDT FVLELLMVSNNGLVTLMCFLVLLGSYTALLVMLRSHSREGRSKALSTCASHIAVVTLIFV PCIYVYTRPFRTFPMDKAVSVLYTIVTPMLNPAIYTLRNKEVIMAMKKLWRRKKDPIGPL EHRPLH >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_5|1101_bp atggcctgcgatcgctacgtggccatctgcagcccactgctctacagggtcatcatgtcc cctagggtctgttctctgctggtggctgctgtcttctcagtaggtttcactgatgctgtg atccatggaggttgtatactcaggttgtctttctgtggatcaaacatcattaaacattat ttctgtgacattgtccctcttattaaactctcctgctccagcacttatattgatgagctt ttgatttttgtcattggtggatttaacatggtggccacaagcctaacaatcattatttca tatgcttttatcctcaccagcatcctgcgcatccactctaaaaagggcagtcattttaga gtatctggtctcaagtcacctaggatgctggttgacttgctctcaggcaaccctaccatt tcctttggtggatgcctgactcaactcttcttcttccacttcattggaggcatcaagatc ttcctgctgactgtcatggcgtatgaccgctacattgccatttcccagcccctgcactac acgctcattatgaatcagactgtctgtgcactccttatggcagcctcctgggtggggggc ttcatccactccatagtacagattgcattgactatccagctgccattctgtgggcctgac aagctggacaacttttattgtgatgtgcctcagctgatcaaattggcctgcacagatacc tttgtcttagagcttttaatggtgtctaacaatggcctggtgaccctgatgtgttttctg gtgcttctgggatcgtacacagcactgctagtcatgctccgaagccactcacgggagggc cgcagcaaggccctgtctacctgtgcctctcacattgctgtggtgaccttaatctttgtg ccttgcatctacgtctatacaaggccttttcggacattccccatggacaaggccgtctct gtgctatacacaattgtcacccccatgctgaatcctgccatctataccctgagaaacaag gaagtgatcatggccatgaagaagctgtggaggaggaaaaaggaccctattggtcccctg gagcacagacccttacattag >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_6|323_aa MNPENWTQVTSFVLLGFPSSHLIQFLVFLGLMVTYIVTATGKLLIIVLSWIDQRLHIQMY FFLRNFSFLELLLVTVVVPKMLVVILTGDHTISFVSCIIQSYLYFFLGTTDFFLLAVMSL DRYLAICRPLRYETLMNGHVCSQLVLASWLAGFLWVLCPTVLMASLPFCGPNGIDHFFRD SWPLLRLSCGDTHLLKLVAFMLSTLVLLGSLALTSVSYACILATVLRAPTAAERRKAFST CASHLTVVVIIYGSSIFLYIRMSEAQSKLLNKGASVLSCIITPLLNPFIFTLRNDKVQQA LREALGWPRLTAVMKLRVTSQRK >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_6|972_bp atgaaccctgaaaactggactcaggtaacaagctttgtccttctgggtttccccagtagc cacctcatacagttcctggtgttcctggggttaatggtgacctacattgtaacagccaca ggcaagctgctaattattgtgctcagctggatagaccaacgcctgcacatacagatgtac ttcttcctgcggaatttctccttcctggagctgttgctggtaactgttgtggttcccaag atgcttgtcgtcatcctcacgggggatcacaccatctcatttgtcagctgcatcatccag tcctacctctacttctttctaggcaccactgacttcttcctcttggccgtcatgtctctg gatcgttacctggcaatctgccgaccactccgctatgagaccctgatgaatggccatgtc tgttcccaactagtgctggcctcctggctagctggattcctctgggtcctttgccccact gtcctcatggccagcctgcctttctgtggccccaatggtattgaccacttctttcgtgac agttggcccttgctcaggctttcttgtggggacacccacctgctgaaactggtggctttc atgctctctacgttggtgttactgggctcactggctctgacctcagtttcctatgcctgc attcttgccactgttctcagggcccctacagctgctgagcgaaggaaagcgttttccact tgcgcctcgcatcttacagtggtggtcatcatctatggcagttccatctttctctacatt cgtatgtcagaggctcagtccaaactgctcaacaaaggtgcctccgtcctgagctgcatc atcacacccctcttgaacccattcatcttcactctccgcaatgacaaggtgcagcaagca ctgagagaagccttggggtggcccaggctcactgctgtgatgaaactgagggtcacaagt caaaggaaatga >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_7|88_aa MEEAGNHYPQKTNAGTVNQNPHVLTYLKKNYMTISADGEKALDNIRHPLIKTLNNLEVKG DFLNLMKDVYENPTPNLSKYPEKLNAFP >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_7|267_bp atggaggaagctggaaaccattatcctcagaaaacgaatgcaggaacagtaaaccaaaac ccgcatgttcttacttacttaaagaaaaattatatgaccatctcagcagatggagaaaaa gcactggacaatatccggcatccattaataaaaactctcaataacttagaagtaaaaggg gacttcctcaacctgatgaaggatgtctatgaaaatcctacacctaacctaagcaaatat cctgaaaaactaaatgctttcccctag >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_8|352_aa MTSQERDTAIYSINVSFVAKGMTSRSVCEKMTMTTENPNQTVVSHFFLEGLRYTAKHSSL FFLLFLLIYSITVAGNLLILLTVGSDSHLSLPMYHFLGHLSFLDACLSTVTVPKVMAGLL TLDGKVISFEGCAVQLYCFHFLASTECFLYTVMAYDRYLAICQPLHYPVAMNRRMCAEMA GITWAIGATHAAIHTSLTFRLLYCGPCHIAYFFCDIPPVLKLACTDTTINELVMLASIGI VAAGCLILIVISYIFIVAAVLRIRTAQGRQRAFSPCTAQLTGVLLYYVPPVCIYLQPRSS EAGAGAPAVFYTIVTPMLNPFIYTLRNKEVKHALQRLLCSSFRESTAGSPPP >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_8|1059_bp atgacatctcaggaaagggatacagctatttattccattaatgtcagttttgttgcaaag gggatgactagccgctctgtgtgtgagaagatgaccatgacaacggagaaccccaaccag actgtggtgagccacttcttcctggagggtttgaggtacaccgctaaacattctagcctc ttcttcctcctcttcctcctcatctacagcatcactgtggctgggaatctcctcatcctc ctaactgtgggctctgactctcacctcagcttacccatgtaccacttcctggggcacctc tccttcctggatgcctgtttgtctacagtgacagtgcccaaggtcatggcaggcctgctg actctggatgggaaggtgatctcctttgagggctgtgccgtacagctttattgcttccac tttctggccagcactgagtgcttcctgtacacagtcatggcctatgaccgctatctggct atctgtcaacccctgcactacccagtggccatgaacagaaggatgtgtgcagaaatggct ggaatcacctgggccataggtgccacgcacgctgcaatccacacctccctcaccttccgc ctgctctactgtgggccttgccacattgcctacttcttctgcgacataccccctgtccta aagctcgcctgtacagacaccaccattaatgagctagtcatgcttgccagcattggcatc gtggctgcaggctgcctcatcctcatcgttatttcctacatcttcatcgtggcagctgtg ttgcgcatccgcacagcccagggccggcagcgggccttctccccctgcactgcccagctc actggggtgctcctgtactacgtgccacctgtctgtatctacctgcagcctcgctccagt gaggcaggagctggggcccctgctgtcttctacacaatcgtaactccaatgctcaaccca ttcatttacactttgcggaacaaggaggtgaagcatgctctgcaaaggcttttgtgcagc agcttccgagagtctacagcaggcagcccacccccatag >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_9|171_aa MMFGKTPTEVVRHSIQEHSYWHQISAPKGQRSQRKEEASIFAVLQPPQVTSPARVQNWME DEMDDLTEASFSRWVITNFTEVKEHVLTQCKEAKNLEKEELLTRIISLERKVNNLMELKN TARELREAYTSINSQMDYAEERISEMEDYLAEIRQTDKIREKRMKKHKQNL >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_9|516_bp atgatgtttggtaagacaccaacagaggttgtcagacactctatacaggagcattcctac tggcatcagatcagtgcccctaaaggtcagagatcccagaggaaggaggaggcatccatc tttgctgtccttcagcctcctcaagtgacatctccagcaagggtgcagaactggatggag gatgagatggatgatttgacagaagcaagcttcagtagatgggtaataacaaacttcact gaggtaaaggagcatgttctaacccaatgcaaagaagctaagaaccttgaaaaagaggag ctgctaactagaataatcagtttagagaggaaagtaaacaacctgatggagctgaaaaac acagcacgagaacttcgtgaagcatacacaagtatcaatagccaaatggattacgcagaa gaaagaatatcagagatggaagactatcttgctgaaataaggcagacagacaagattaga gaaaaaagaatgaaaaagcacaaacaaaacctctaa >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_10|336_aa MEGTLDPPNGIMLYKYLERDVNSKELQSGNQTSVSHFILVGLHHPPQLGAPLFLAFLVIY LLTVSGNGLIILTVLVDIRLHRPMCLFLCHLSFLDMTISCAIVPKMLAGFLLGSRIISFG GCVIQLFSFHFLGCTECFLYTLMAYDRFLAICKPLHYATIMTHRVCNSLALGTWLGGTIH SLFQTSFVFRLPFCGPNRVDYIFCDIPAMLRLACADTAINELVTFADIGFLALTCFMLIL TSYGYIVAAILRIPSADGRRNAFSTCAAHLTVVIVYYVPCTFIYLRPCSQEPLDGVVAVF YTVITPLLNSIIYTLCNKEMKAALQRLGGHKEVQPH >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_10|1011_bp atggaaggcacattagatcctcctaatggcattatgctttacaaatacctggagagggat gtgaacagcaaggaactgcaaagtggaaaccagacttctgtgtctcacttcattttggtg ggcctgcaccacccaccacagctgggagcgccactcttcttagctttccttgtcatctat ctcctcactgtttctggaaatgggctcatcatcctcactgtcttagtggacatccggctc catcgtcccatgtgcttgttcctgtgtcacctctccttcttggacatgaccatttcttgt gctattgtccccaagatgctggctggctttctcttgggtagtaggattatctcctttggg ggctgtgtaatccaactattttctttccatttcctgggctgtactgagtgcttcctttac acactcatggcttatgaccgtttccttgccatttgtaagcccttacactatgctaccatc atgacccacagagtctgtaactccctggctttaggcacctggctgggagggactatccat tcacttttccaaacaagttttgtattccggctgcccttctgtggccccaatcgggtcgac tacatcttctgtgacattcctgccatgctgcgtctagcctgcgccgatacggccatcaac gagctggtcacctttgcagacattggcttcctggccctcacctgcttcatgctcatcctc acttcctatggctatattgtagctgccatcctgcgaattccgtcagcagatgggcgccgc aatgccttctccacttgtgctgcccacctcactgttgtcattgtttactatgtgccctgc accttcatttacctgcggccttgttcacaggagcccctggatggggtggtagctgtcttt tacactgtcatcactcccttgcttaactccatcatctacacactgtgcaacaaagaaatg aaggcagcattacagaggctagggggccacaaggaagtgcagcctcactga >gi568815587f:123806432_124007373|GENSCAN_predicted_peptide_11|70_aa MNGKLVTKKFGEEVCGWTSLSGQKMWGYLYPIVLPLTKALTLRLKKYINGLVIMEFTGLT TFLIILKQLD >gi568815587f:123806432_124007373|GENSCAN_predicted_CDS_11|213_bp atgaatggaaaattggtgacaaagaagtttggagaagaggtatgtggatggacctctctg agtggtcaaaaaatgtggggatatttgtatcccatagtattgcctctgaccaaggcactc actttacggctaaagaagtacatcaatgggcttgtgatcatggaattcactggtcttacc acgttcctcatcattctgaagcagctggattga