GENSCAN 1.0 Date run: 3-Jul-120 Time: 20:43:09 Sequence gi568815587f:89077954_89395363 : 317410 bp : 35.10% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 21794 21833 40 -3.55 1.01 Init + 29738 29894 157 1 1 61 80 182 0.722 14.82 1.02 Intr + 64344 64437 94 1 1 14 87 118 0.229 2.40 1.03 Term + 90512 90641 130 2 1 66 38 113 0.042 0.77 1.04 PlyA + 90791 90796 6 1.05 2.00 Prom + 98630 98669 40 -6.65 2.01 Init + 100001 100819 819 1 0 59 110 559 0.726 47.95 2.02 Intr + 113249 113465 217 1 1 35 84 171 0.782 8.55 2.03 Term + 131638 131948 311 2 2 34 42 249 0.023 9.14 2.04 PlyA + 132431 132436 6 1.05 3.00 Prom + 138219 138258 40 -4.95 3.01 Init + 154403 154476 74 1 2 78 23 102 0.429 3.29 3.02 Intr + 168803 168963 161 2 2 35 87 179 0.583 11.21 3.03 Intr + 206820 207001 182 1 2 81 103 106 0.356 10.07 3.04 Term + 210485 210568 84 1 0 94 43 32 0.103 -4.03 3.05 PlyA + 211696 211701 6 1.05 4.04 PlyA - 212502 212497 6 1.05 4.03 Term - 216140 215788 353 0 2 37 32 299 0.668 12.86 4.02 Intr - 237332 235761 1572 0 0 72 60 337 0.385 17.36 4.01 Init - 238917 237483 1435 0 1 44 40 569 0.420 39.02 4.00 Prom - 239010 238971 40 -6.15 5.02 PlyA - 239179 239174 6 1.05 5.01 Sngl - 240423 239830 594 0 0 88 49 581 0.999 50.14 5.00 Prom - 244984 244945 40 -6.35 6.00 Prom + 246670 246709 40 -6.45 6.01 Sngl + 250003 250254 252 0 0 64 38 268 0.996 14.25 6.02 PlyA + 250593 250598 6 1.05 7.08 PlyA - 252037 252032 6 1.05 7.07 Term - 252667 252601 67 0 1 109 43 20 0.031 -3.97 7.06 Intr - 257992 257892 101 1 2 52 110 38 0.099 0.39 7.05 Intr - 259562 259494 69 2 0 97 91 62 0.689 5.86 7.04 Intr - 263923 263830 94 0 1 52 115 16 0.411 -0.25 7.03 Intr - 264240 264121 120 2 0 83 28 130 0.778 5.19 7.02 Intr - 287120 287022 99 1 0 97 74 58 0.243 3.61 7.01 Init - 289540 289464 77 1 2 46 100 45 0.546 2.11 7.00 Prom - 294035 293996 40 -6.15 8.00 Prom + 304952 304991 40 -2.75 8.01 Init + 307260 307406 147 2 0 50 65 93 0.865 3.34 8.02 Intr + 307728 307901 174 2 0 60 86 130 0.926 9.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:89077954_89395363|GENSCAN_predicted_peptide_1|126_aa MAVHECDELIKTKERNTTKRREEEAVKLTNGQKQFVFPLNRVKNVIIIHEASANNEIQSE NGRVTGIDREGEMERWCMPVSVPRQWLPASFCKATDPASRLSRYSDLLCVPVQLFIALAL HSQCTA >gi568815587f:89077954_89395363|GENSCAN_predicted_CDS_1|381_bp atggctgtgcatgagtgtgatgaattaatcaagaccaaggaaagaaacaccacaaagagg cgagaggaagaagctgtgaagctcacaaatggccagaaacagtttgtgttcccactaaac agggtgaaaaacgtcatcataattcatgaagcatcagcaaataacgagattcagtcagaa aatggtcgtgtgactggaattgatagagaaggagaaatggagaggtggtgtatgcctgtg agtgtgccgagacagtggttgcctgcatccttctgtaaggccacagatcctgccagcagg ctcagtcgatacagtgatcttctctgtgttccagttcaattgttcattgcccttgccctt cattcccagtgtactgcgtga >gi568815587f:89077954_89395363|GENSCAN_predicted_peptide_2|448_aa MLLAVLYCLLWSFQTSAGHFPRACVSSKNLMEKECCPPWSGDRSPCGQLSGRGSCQNILL SNAPLGPQFPFTGVDDRESWPSVFYNRTCQCSGNFMGFNCGNCKFGFWGPNCTERRLLVR RNIFDLSAPEKDKFFAYLTLAKHTISSDYVIPIGTYGQMKNGSTPMFNDINIYDLFVWMH YYVSMDALLGGSEIWRDIDFAHEAPAFLPWHRLFLLRWEQEIQKLTGDENFTIPYWDWRD AEKCDICTDEYMGGQHPTNPNLLSPASFFSSWQIVCSRLEEYNSHQSLCNGTPEGPLRRN PGNHDKSRTPRLPSSADVEFCLSLTQYESGSMDKAANFSFRNTLEALKRAVALGAQRSSS DNGQTASSSGSLTPCSLTGRHYPVEADSCLIQAGAPLGRSFQRKDRAAIFAVLQAPLVIP RQTGSGADLKQTPTDLQLRGLSVTRKTN >gi568815587f:89077954_89395363|GENSCAN_predicted_CDS_2|1347_bp atgctcctggctgttttgtactgcctgctgtggagtttccagacctccgctggccatttc cctagagcctgtgtctcctctaagaacctgatggagaaggaatgctgtccaccgtggagc ggggacaggagtccctgtggccagctttcaggcagaggttcctgtcagaatatccttctg tccaatgcaccacttgggcctcaatttcccttcacaggggtggatgaccgggagtcgtgg ccttccgtcttttataataggacctgccagtgctctggcaacttcatgggattcaactgt ggaaactgcaagtttggcttttggggaccaaactgcacagagagacgactcttggtgaga agaaacatcttcgatttgagtgccccagagaaggacaaattttttgcctacctcacttta gcaaagcataccatcagctcagactatgtcatccccatagggacctatggccaaatgaaa aatggatcaacacccatgtttaacgacatcaatatttatgacctctttgtctggatgcat tattatgtgtcaatggatgcactgcttgggggatctgaaatctggagagacattgatttt gcccatgaagcaccagcttttctgccttggcatagactcttcttgttgcggtgggaacaa gaaatccagaagctgacaggagatgaaaacttcactattccatattgggactggcgggat gcagaaaagtgtgacatttgcacagatgagtacatgggaggtcagcaccccacaaatcct aacttactcagcccagcatcattcttctcctcttggcagattgtctgtagccgattggag gagtacaacagccatcagtctttatgcaatggaacgcccgagggacctttacggcgtaat cctggaaaccatgacaaatccagaaccccaaggctcccctcttcagctgatgtagaattt tgcctgagtttgacccaatatgaatctggttccatggataaagctgccaatttcagcttt agaaatacactggaagctctgaagagagcagtggctcttggagcacagcgttcgagctcc gataatggacagacggcctcctcaagtgggtccctgaccccgtgtagcctgactgggaga cactacccagtagaggccgacagctgcctcatacaggcaggtgcccctctgggacgaagc ttccagaggaaggatcgggcagcaatatttgctgttctgcaggctccattggtgataccc aggcaaacagggtctggagcggacctcaagcaaactccaacagacctgcagctcaggggc ctgtctgttacaaggaaaactaactaa >gi568815587f:89077954_89395363|GENSCAN_predicted_peptide_3|166_aa MEYYKKEQNDVLRSIVDVAVGHNLKRLANLAAQKQSNQKGGSKKGPWGFRNEQQIASLLV TARLKLAQKGRNLRVMGVVFLSSGSEGTVLFKKFIQKPMHPLDITGNPTWFLLYHCTEMV ISLFHPKIWAMTIAIYKIQCPNGYFLLQRQQNFLDEPGEKRLNESY >gi568815587f:89077954_89395363|GENSCAN_predicted_CDS_3|501_bp atggaatactataagaaagaacaaaatgatgtccttcgcagcatcgtggatgtagctgta ggccataatctgaaacgccttgccaacttggcagcccagaaacagtccaaccagaagggc ggcagtaagaaaggtccctggggcttcaggaatgaacagcagattgcttccctgctggtg actgcaaggctgaagcttgctcagaaaggcagaaatctcagggttatgggggtagtattt ttgagcagtggctccgaaggcaccgtcctcttcaagaagtttatccagaagccaatgcac ccattggacataaccgggaatcctacatggttccttttataccactgtacagaaatggtg atttctttatttcatccaaagatctgggctatgactatagctatctacaagattcagtgt cccaatggatactttttactacaaaggcaacaaaatttcctggatgagccaggggagaag aggctcaatgagtcatattag >gi568815587f:89077954_89395363|GENSCAN_predicted_peptide_4|1119_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFISAPHHTYS KIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDCWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKGKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAEVKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQTDTIKNDKGDITTDPTETQTTIREYYKHLYANKLENVEEMDKFLDTYTLPRLN QEEVESLNRPITGVEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLVL EVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYK INVQKPQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEI KEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQK RARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHTY NYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWINDLNVRPK TIKTLEENLGITIQDIGMGKGFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQ PTTWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWARDMNRHFSKEDIYAAKKHM KKCSSSLAIREMQIKTTMRYHLTPLRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKVVQP LWKSVWRFLRDLELEIPFDPAIPLLGIYPNDYKSCCYKDTCTQNLDCPELIETFLNSKKP GKKKLQKERKSLSDNESNDSKSKKRDAADKPRGFTRDLDPERIIGATGSSGEWMFLLKWE YSHQADLLLAKAANMKCLEIVIVFYKERLSWHSCPDDEA >gi568815587f:89077954_89395363|GENSCAN_predicted_CDS_4|3360_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttatttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactgctgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggatgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagggaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaagtgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccactagcaagactaataaag aaaaaaagagagaagaatcaaacagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaacacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatgtagaagaaatggataaattcctggacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagttgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattagtgttg gaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaa gaggaagtcaaattgtccctgtttgcagacgacatgattgtttatctagaaaaccccatt gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaaccacaagcattcctatacaccaacaacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttccaagagaataaaatacctaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatata gtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagcta ccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacacta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagatcaatggaacagaacagagccctcagaaataatgccgcatacctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaatgatttaaacgttagacctaaa accataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaag ggcttcatgtccaaaacaccaaaagcaatggcaaccaaagacaaaattgacaaatgggat ctcattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctacaacatgggagaaaattttcacaacctactcatctgacaaagggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgg gcaagggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatg aaaaaatgctcatcatcgctggccatcagagaaatgcaaatcaaaaccactatgagatat catctcacaccacttagaatggcaatcattaaaaagtcaggaaacaacaggtgctggaga ggatgtggagaaataggaacacttttacactgttggtgggactgtaaagtagttcaacca ttgtggaagtcagtgtggcgatttctcagggatctagaactagaaataccatttgaccca gccatcccattactgggtatatacccaaatgactataaatcatgctgctataaagacaca tgcacacaaaatttagattgtccagagttaattgaaacattccttaattctaaaaaacct ggtaaaaaaaaattacaaaaagaaagaaagtctttatctgacaatgaatctaatgacagc aaatcaaagaaaagagatgctgccgacaaaccaagaggcttcaccagagatcttgatcct gaaagaataattggtgccacaggcagcagtggagaatggatgtttctcctgaaatgggaa tattcacatcaggccgacttgctgctggcaaaagcggcaaatatgaagtgtcttgaaatt gtaattgttttttacaaagagagactaagttggcattcttgtccagatgatgaagcttaa >gi568815587f:89077954_89395363|GENSCAN_predicted_peptide_5|197_aa MGKKQNRKTANSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDLQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCYQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDAENGTKLENPLQD IIQENFPNLARQANVQI >gi568815587f:89077954_89395363|GENSCAN_predicted_CDS_5|594_bp atggggaaaaaacagaacagaaaaactgcaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgat gagctgagagaggaaggcttcagacgatcaaattactctgagctacgggaggaccttcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgctatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatgcggagaatggaaccaagttggaaaaccctctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaatgttcagatttag >gi568815587f:89077954_89395363|GENSCAN_predicted_peptide_6|83_aa MIPEKREENKLSTVTLFAYCLEAICWLQQTGETPAKIGDYTEETKNGVTLRPKQLEFVEQ NMEGKGDPQKDASEIGRGESSSL >gi568815587f:89077954_89395363|GENSCAN_predicted_CDS_6|252_bp atgatccctgagaaaagggaagaaaacaagctaagcaccgtcactctctttgcttattgc ctggaggcaatctgctggctgcagcagacaggggaaactccagcaaagattggtgactac actgaggagacaaagaacggagttacattaaggcccaaacagctggaatttgtggagcag aatatggaagggaagggagatccacagaaagacgcctcggagattggtagaggagaatcc tcaagtctttga >gi568815587f:89077954_89395363|GENSCAN_predicted_peptide_7|208_aa MSLGRCKGKVWRWYILLRITEGIGDQQEESKRREGRKGSTVYSRERIRQYDGSGKNPGLL YIDGPFGSPFEESLNYEVSLCVAGGIGVTPFASILNTLLKMSYHSYALLCPFPLLKVIRL HPSFSMSSSAFWQENRPDYVNIQLYLSQTDGIQKIIGEKYHALNSRLFIGRPRWKLLFDE IAKYNRGNCFRLLVPELDSLDQHQLESC >gi568815587f:89077954_89395363|GENSCAN_predicted_CDS_7|627_bp atgtcattaggaagatgcaaaggaaaggtctggcgatggtacatattgctgcgaattact gagggcattggagaccagcaagaagaaagtaaaaggagggaggggagaaaagggagtact gtatattctcgggagagaataaggcagtatgacggcagtggaaagaaccctggactgctg tatattgatggtccttttggaagtccatttgaggaatcactgaactatgaggtcagcctc tgcgtggctggaggcattggagtaactccatttgcatcaatactcaacaccctgttaaag atgtcatatcacagttatgcactcctctgtccttttcctctgttaaaagtcattcgcctc catccttcattttctatgtcctcatcagcgttttggcaagagaacagacctgactatgtc aacatccagctgtacctcagtcaaacagatgggatacagaagataattggagaaaaatat catgcactgaattcaagactgtttataggacgtcctcggtggaaacttttgtttgatgaa atagcaaaatataacagaggtaattgctttaggttgttggttcctgaactggattccttg gaccagcatcaattggaatcttgttag >gi568815587f:89077954_89395363|GENSCAN_predicted_peptide_8|107_aa MQGSSIINASLIKTLLKATLLPKEAGVIHCKGHQKASDPIPQGNAYAVKGFFRPPPFPTS QAREFAPDQDWQIDFTHMTRVRKLKYLLVWVDTFIGWIEAFPTGSEK >gi568815587f:89077954_89395363|GENSCAN_predicted_CDS_8|321_bp atgcaaggttcctccatcattaatgcctctttaataaaaacacttctcaaggccacttta cttccaaaggaagctggagtcattcactgcaagggtcatcaaaaggcatcagatcccatc cctcagggtaatgcttatgctgttaagggatttttcaggccccctcccttccctacaagt caagctcgagaatttgcccctgaccaggactggcaaattgactttactcacatgacccga gtcaggaaactaaaatacctcttggtctgggtagacactttcattggatggatagaggcc tttcccacagggtctgagaag