GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:42:16 Sequence gi568815594f:82935225_83175427 : 240203 bp : 41.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 894 757 138 0 0 85 78 80 0.837 6.44 1.06 Intr - 4512 4315 198 0 0 92 91 109 0.942 10.23 1.05 Intr - 4738 4665 74 2 2 55 108 76 0.993 4.51 1.04 Intr - 11250 11034 217 0 1 83 96 191 0.992 16.55 1.03 Intr - 35245 35103 143 2 2 120 93 62 0.968 9.05 1.02 Intr - 43782 43659 124 0 1 45 29 107 0.585 -0.26 1.01 Init - 49620 48937 684 0 0 91 98 446 0.834 41.04 1.00 Prom - 52641 52602 40 -3.65 2.00 Prom + 63679 63718 40 -3.95 2.01 Init + 64960 65037 78 0 0 114 36 22 0.817 0.71 2.02 Term + 75367 75864 498 0 0 51 49 240 0.862 9.93 2.03 PlyA + 75869 75874 6 1.05 3.00 Prom + 88414 88453 40 -3.65 3.01 Init + 100001 100074 74 1 2 91 89 102 0.028 11.19 3.02 Intr + 110402 110481 80 2 2 67 108 147 0.856 12.88 3.03 Intr + 113942 114093 152 0 2 33 98 63 0.719 0.76 3.04 Intr + 114657 114760 104 2 2 81 14 64 0.474 -3.65 3.05 Intr + 121702 121855 154 1 1 67 87 149 0.978 11.85 3.06 Intr + 122034 122184 151 2 1 87 97 110 0.993 10.61 3.07 Intr + 127852 128022 171 2 0 80 50 189 0.750 13.29 3.08 Intr + 131214 131329 116 1 2 58 65 77 0.695 1.65 3.09 Intr + 133214 133298 85 1 1 63 69 56 0.677 -0.33 3.10 Term + 140073 140206 134 1 2 98 41 113 0.967 4.87 3.11 PlyA + 140546 140551 6 1.05 4.06 PlyA - 143555 143550 6 1.05 4.05 Term - 149131 149011 121 0 1 117 48 88 0.010 4.67 4.04 Intr - 172711 172540 172 2 1 51 53 138 0.001 4.68 4.03 Intr - 182747 182660 88 2 1 87 101 130 0.984 12.82 4.02 Intr - 200431 199864 568 0 1 2 25 389 0.120 15.85 4.01 Init - 201132 200588 545 0 2 49 45 322 0.602 18.41 4.00 Prom - 210716 210677 40 -5.45 5.00 Prom + 220651 220690 40 -4.95 5.01 Init + 228845 228903 59 1 2 95 109 79 0.911 11.53 5.02 Intr + 229598 229839 242 2 2 29 71 161 0.760 4.77 5.03 Term + 229925 230856 932 0 2 51 42 641 0.856 47.11 5.04 PlyA + 231111 231116 6 1.05 6.00 Prom + 231847 231886 40 -10.65 6.01 Sngl + 232082 232903 822 1 0 33 48 321 0.986 18.09 6.02 PlyA + 232995 233000 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 67385 67510 126 1 0 104 38 109 0.948 4.80 S.002 Init + 144186 144284 99 2 0 40 100 106 0.816 7.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:82935225_83175427|GENSCAN_predicted_peptide_1|526_aa MEVVPAEVNSLLPEEIMDTGITLVDDDSIEAVIVSSPIPMETELEEIVNINSTGDSTATP ISTEPITVYSNHTNQVAVNTTITKADSNTTVKPAFPSGLQKLGAQTPVTISANQIILNKV SQTSDLKLGNQTLKPDGQKLILTTLGKSGSPIVLALPHSQLPQAQKVTTQAQSGDAKLPP QQIKVVTIGGRPEVKPVIGVSALTPGSQLINTTTQPSVLQTQQLKTVQIAKKPRTPTSGP VITKLIFAKPINSKAVTGQTTQVSPPVIAGRVLSQSTPGTPSKTITISESGVIGSTLNST TQTPNKIAISPLKSPNKAVKSTVQTITVGGVSTSQFKTIIPLATAPNVQQIQVPGSKFHY VRLVTATSASSSTQPVSQNPSTNTQPLQQAKPVVVNTTPVRMSVPIVSAQAVKQVVPKPI NPTSQIVTTSQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVGYAVLPAQYVTQ ACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYE >gi568815594f:82935225_83175427|GENSCAN_predicted_CDS_1|1578_bp atggaggtggtgccagctgaggtgaatagtttgcttccagaggaaataatggacactggt ataactttagtggatgatgatagtattgaggctgttattgtttcatccccaattcccatg gagacagaactggaagaaattgtcaacataaattctactggtgactctacagccacgccc atttccacggaaccaatcacagtgtacagtaaccacactaaccaagttgcagtgaatacc acaattactaaagcagattctaataccacagtgaaaccagcttttccaagtggccttcaa aaacttggtgctcagactcctgtgactatatcagccaatcagattattttaaacaaagta tcacagacatctgatcttaaacttggcaatcagacccttaaaccagatggacagaagtta attttaacaactttgggcaagtctggttcaccaattgttttagcactaccccatagccaa ctaccccaggctcagaaagttacaactcaggcccagtcaggagatgctaagttaccaccg cagcaaattaaagtagttaccattggagggaggccagaggtgaaacctgtcattggtgtc tcagcattgaccccaggaagtcaactgattaatactacaactcagccctctgtgttacag acccaacagttaaaaacagtacagattgctaagaagcctcgaacgccaacctctggtcca gtaatcacgaagctgatctttgcaaaaccaattaatagtaaagcagttacaggacagaca actcaagtttcaccaccagttattgcaggtagggttctttcacagtctactcccggaact ccatcaaagaccataacaatatctgaaagtggtgttattggatcaactttaaattctaca acacagacaccaaataaaatagccatctcacctttgaaatcgccaaataaggcagtgaaa tcaactgtgcagaccatcactgttggaggagtgagcacatcacagtttaagacaattatt cctctggcaactgctcccaatgtccagcagattcaagtgcctggaagcaagtttcattat gtccgacttgttactgccacatcagccagtagctcaacccagccagttagtcagaatccc agtacaaacactcagcctcttcagcaagcaaagccagtggttgttaatacaaccccagtg cggatgtcagttccaattgtctcagctcaggctgtcaaacaagttgttccaaaaccaatc aatccaacttcacaaatagtaactactagccagccacagcaacggcttatcatgcctgcc acaccactgccacagatccagcccaacctcactaacctgccaccaggcactgtcctggca ccagctccgggaacagggaatgtgggttatgcagtgcttccagctcagtatgttactcag gcatgccttgacagaaatccagaagcctttaagcctaagatagggaaaggaaaggaggga gaatctgatcgacgtcatagcaaagggtgtaattgcaaacgatcaggatgtcttaaaaac tactgtgaatgctatgag >gi568815594f:82935225_83175427|GENSCAN_predicted_peptide_2|191_aa MPSPNRLSMQMKVPYSGKECHKRHLLLWPAASFPSLSKLERSRVPGDREGSRGGDVAVAA ASGMSGAGIVFRKIRHFHLVITITAQQLPRQPEPGPPPPPPPPVTSPFLLLPSPPAPPGR RTPTYPGKPPAAPPTFKRTRAVQVHTGYAETFHPRAEHGPPQGRQRKSRHLRGPKSGIFL FARPKAMRRVA >gi568815594f:82935225_83175427|GENSCAN_predicted_CDS_2|576_bp atgcccagcccaaacaggctttcaatgcagatgaaagtgccttattctggaaaagaatgc cataaaagacatttattactctggccagcagcttcctttcccagtttgtccaaactggag cgctccagggtacccggggaccgagaagggagccggggtggcgacgtcgccgtcgccgcc gcctctggtatgtcaggggccgggattgtatttcgaaagatccgccattttcacctcgtc atcaccatcacagctcagcagcttccccgacagccggagcccgggccgccgccgccgccg ccaccaccagtaacctcccccttcctcctcctcccctccccgcccgccccacccgggagg cgcacacccacatatccggggaagcctcccgccgccccgccaaccttcaaacgcacaagg gccgtgcaagtgcacacgggctacgccgagacctttcacccccgggcggagcacgggccc ccgcagggacgccaaaggaagtcgcgccacctgcggggtcccaagtccggcatatttttg tttgcaagaccaaaagccatgcggagggtggcataa >gi568815594f:82935225_83175427|GENSCAN_predicted_peptide_3|406_aa MAAAVRQDLAQLMNSSGSHKDLAGKYRQILEKAIQLSGAEQLEALKAFVEAMVNENVSLV ISRQLLTDFCTHLPNLPDSTAKEIYHFTLEKIQPRVISFEEQVASIRQHLASIYEKEEDW RNAAQVLVGIPLETGQKQYNVDYKLETYLKIARLYLEDDDPVQAEAYINRASLLQNESTN EQLQIHYKVCYARVLDYRRKFIEAAQRYNELSYKTIVHESERLEALKHALHCTILASAGQ QRSRMLATLFKDERCQQLAAYGILEKMYLDRIIRGNQLQEFAAMLMPHQKATTADGSSIL DRAVIEHNLLSASKLYNNITFEELGALLEIPAAKAEKIASQMITEGRMNGFIDQIDGIVH FETREALPTWDKQIQSLCFQVNNLLEKISQTAPEWTAQAMEAQMAQ >gi568815594f:82935225_83175427|GENSCAN_predicted_CDS_3|1221_bp atggcggcagccgtgcgacaggatttggcccagctcatgaattcgagcggctctcataaa gatctggctggcaagtatcgtcagatcctggaaaaagccattcagttatctggagcagaa caactagaagctttgaaagcttttgtggaagcaatggtaaatgagaatgtcagtctcgtg atctcgcggcagttgctgactgatttttgcacacatcttcctaacttgcctgatagcaca gccaaagaaatctatcacttcaccttggaaaagatccagcctagagtcatttcatttgag gagcaggttgcttccataagacagcatcttgcatctatatatgagaaagaagaagattgg agaaatgcagcccaagtgttggtgggaattcctttggaaacaggacaaaaacagtacaat gtagattataaactggagacttacttgaagattgctaggctatatctggaggatgatgat ccagtccaggcagaggcttacataaatcgagcatcgttgcttcagaatgaatcaaccaat gaacaattacagatacattataaggtatgctatgcacgtgttcttgattatagaagaaaa ttcattgaagctgcacaaaggtacaatgagctctcttacaagacaatagtccacgaaagt gaaagactagaggccttaaaacatgctttgcactgtacgatcttagcatcagcagggcag cagcgttctcggatgctagctactctttttaaggatgaaaggtgccagcaacttgctgcc tatgggatcctagagaaaatgtatctagataggatcatcagaggaaatcaacttcaagaa tttgctgccatgctgatgcctcaccaaaaagcaactacagctgatggttccagcatcttg gacagagctgttattgaacacaatttgttgtctgcaagcaaattatataataatattacc ttcgaagaacttggagctcttttagagatccctgcagctaaggcggaaaagatagcatct caaatgataaccgaaggacgtatgaatggatttattgaccagattgatggaatagttcat tttgaaacacgagaagccctgccaacgtgggataagcagatccaatcactttgtttccaa gtgaataaccttttggagaaaattagtcaaacagcaccagaatggacagcacaagccatg gaagcccagatggctcagtga >gi568815594f:82935225_83175427|GENSCAN_predicted_peptide_4|497_aa MWKQLWNGVTGRGWNSLEGSEEDKKMWEILELPRDLLNGFDKNANSDMNNKVQAEMVSDG DEELVGNWSKGDSSYVLAKRLVAFCPCPRDLWNFELERDDLGYLAEEISKQQSIQEVTWV LLKAFSFIRETEHKSLENLQPDNAIEKKNPFSEEKFKPAAEICISNEASDVNTQDNRENV SRGQHRAQAVASEGASLKLWQLPRGVESVSTQMSRIEVWEPPPRFQKMYGNAWMSKQRSA ARAGLLWRTLLLVWKGNVGLELPHRVPTWTLPSEAVRRGPPSSGAQNGRSTDSLHCPLGK AANTQHQPVKASGREAKASRREAVTGAELPKTMGTPVLHQCDLDMRHEVKGDHFGALRFA CPAGFRTCMAPDQDQGEANKVPTGTEFKKALTLDETGGPKCPALKMQAQAPVVVVTQPGV GPGPAPQNSNWQTGMCDCFSDCGVCKCVGKEPPYLIPSCVVVLRISLRAREGEHSNCDAL NSVLSCYGRKENHTQLS >gi568815594f:82935225_83175427|GENSCAN_predicted_CDS_4|1494_bp atgtggaagcaactttggaatggggtaacaggcagaggttggaacagtttggagggctca gaagaagacaagaaaatgtgggaaattttggaacttcctagagatttgttgaatggcttt gacaaaaatgccaatagtgatatgaacaataaagtccaggctgagatggtctcagatgga gatgaggaacttgttgggaactggagcaaaggtgactcttcttatgttttagcaaagaga ctggtggcattttgcccctgccctagagatttgtggaattttgaacttgagagagatgat ttagggtatctagcggaagaaatttctaagcagcaaagcattcaagaggtgacttgggtg ctgttaaaggcatttagttttataagggaaacagagcataaaagtttggaaaatttgcag cctgacaatgcaatagaaaagaaaaaccctttttctgaggagaaattcaagccagctgca gaaatttgcataagtaatgaggcgtcggatgttaatacccaagacaacagggaaaatgtc tccaggggccaacacagagctcaggccgtggcttcagagggtgcaagcctcaagctttgg cagcttccacgtggtgttgagtctgtgagtacacagatgtcaagaattgaggtttgggaa cctccacctagatttcagaagatgtatggaaatgcctggatgtccaagcagaggtctgct gcaagggcagggctcttgtggagaacccttctgctagtgtggaagggaaatgtgggattg gagctcccacacagagtccctacttggacactgcctagtgaagctgtgagaagagggcca ccatcctccggagcccagaatggtagatccactgacagcttacactgtccacttggaaaa gctgcaaacactcaacaccagcccgtgaaagcatctgggagggaggcgaaagcatctagg agggaggctgtaacaggggctgaactgcccaagaccatgggaacccccgtcttgcatcag tgtgacctggatatgagacatgaagtcaaaggagatcattttggagctttaaggtttgcc tgccctgctggatttcggacttgtatggccccggaccaggaccagggtgaggcaaacaag gtccctacgggcacagaatttaagaaggcactcactcttgatgaaactggaggaccaaaa tgccctgcactgaaaatgcaagctcaggcgccggtggtcgttgtgacccaacctggagtc ggtcccggtccggccccccagaactccaactggcagacaggcatgtgtgactgtttcagc gactgcggagtctgtaagtgtgtggggaaagaacccccttatttgataccgagctgtgtt gtggtactgaggatatctctgcgtgcacgggagggagaacacagcaactgtgatgcactg aactcagtgctgtcctgttacggcaggaaggaaaaccacacccaactcagctga >gi568815594f:82935225_83175427|GENSCAN_predicted_peptide_5|410_aa MAESEQLQSTALSVSDGEDRCPSEMKLPDEQSGSNICHSAIFAVLQHLLVIPRQTGSGVD LQQTPKDLQLRVLTVRRKTNKQKGHPHQNPICTSPSSKTKERSSLPAMEQSWMENDFENL REEGFRRSVITNFSELKEDVRIHCKEAKNLEKTLDEWLTRKNSIEKTLNDQMEVKTMARE LRDACTRLSSRFDQLEERVSVIEDQMNEMKQEEKFREKRVKRNEQSLQEIWDYVKRPNLH LIGVLESDRENGTKLENTLQDIIQENFPNLARQANIQIQEIQRMPQRYSSRRATPRHIIV SFTEVEMKEKMLRAAREKGWVTHKGKPIRLTVDLLAKTLQARRQWRPIFNILKEKNYQPR ISYPAKLGFVSEGEIKSFTEKQTLRDFVTTRPALQELLKEALNMERNNRY >gi568815594f:82935225_83175427|GENSCAN_predicted_CDS_5|1233_bp atggccgaatcggaacagctccagtctacagctctcagcgtgagcgacggagaagacagg tgcccctctgagatgaagcttccagacgaacaatcaggcagcaacatttgccattctgca atatttgcggttctgcagcatctgctggtgatacccaggcaaacagggtctggagtggac ctccagcaaactccaaaagacctgcagctgagggtcctgactgttagaaggaaaactaac aaacagaaaggacatccacatcaaaaccccatctgtacatcaccatcatcaaagaccaaa gaacgcagctccttgccagcaatggaacaaagctggatggagaatgactttgaaaatttg agagaagaaggcttcagaagatcggtaataacaaacttctctgagctaaaggaagatgtt cgaatccattgcaaagaagctaaaaaccttgagaaaacattggatgaatggctaactaga aaaaatagcatagagaaaaccttaaatgaccagatggaggtgaaaaccatggcacgagaa ctacgtgacgcatgcacaaggttaagtagccgttttgatcaacttgaagaaagggtatca gtgattgaagatcaaatgaatgaaatgaagcaagaagagaagtttagagagaaaagagta aaaagaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacat ctgattggtgtacttgaaagtgacagggagaatgggaccaagttggaaaacactcttcag gatattatccaggagaacttccccaacctagcaaggcaggccaacattcaaattcaggaa atacagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtc agcttcactgaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggttgg gttacccacaaagggaaacccatcagactaacagtggatctcttggcaaaaactctacaa gccagaagacagtggaggccaatattcaacattcttaaagaaaagaattatcaacccaga atttcatatccagccaaactaggcttcgtaagtgaaggagaaataaaatcctttacagaa aagcaaacgctgagagattttgtcaccaccaggcctgccttacaagagctcctgaaggaa gcactaaacatggaaaggaacaaccggtactag >gi568815594f:82935225_83175427|GENSCAN_predicted_peptide_6|273_aa MKNKREKNQIDAIKNDKGDITTNPTEIQTTIGEYLYKHLYTNKLEDLEEMDKFLDTYTLP RLNQEEVESLSRPKTGSEIEAIINSLPTKRSPGPDGFTAEFYQRYKEEVVPFLLKPFQSI EKEGILPNSFYEASIILIPKPGRDTTKKEDFRPISLMNIDAKILNKILANRIQQHIKKLI HHVQVGFIPGMQGWFNIRKSINVIHHINGTKDKNHLIISIDAEKAFDKIQQPFMINTLNK LGIDGTYLKIIRAIYDKPTANIILNGQNWKHSL >gi568815594f:82935225_83175427|GENSCAN_predicted_CDS_6|822_bp atgaagaataaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatc accaccaatcccacagaaatacaaactaccattggagaatacttatataaacacctctac acaaataaactagaagatctagaagaaatggataaattcctggacacatacaccctccca agactaaaccaggaagaagttgaatccctgagtagaccaaaaacaggctctgaaattgag gcaataattaatagtctaccaaccaaaagaagtccaggaccagatggattcacagccgaa ttctaccagaggtacaaagaggaggtggtaccattccttctgaaaccattccaatcaata gaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctgataccaaag cctggcagggacacaacaaaaaaagaggattttagaccaatatccctgatgaacattgat gcgaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaaacttatc caccatgttcaagtgggcttcatccctgggatgcaaggctggttcaacatacgcaaatca ataaatgtaatccatcatataaatggaaccaaagacaaaaaccacttgattatctcaata gatgcagaaaaggcctttgacaaaattcaacagcccttcatgataaatactctcaataaa ttaggtattgatgggacatatctcaaaataataagagctatctatgacaaacccacagcc aatatcatactgaatgggcaaaactggaagcattccctttga