GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:53:57 Sequence gi568815578r:36898762_37195928 : 297167 bp : 44.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 1500 1495 6 -0.45 1.12 Term - 3378 3293 86 1 2 96 47 50 0.370 -0.48 1.11 Intr - 12572 12457 116 1 2 42 110 100 0.963 7.69 1.10 Intr - 13791 13700 92 0 2 103 64 82 0.988 6.09 1.09 Intr - 18069 17961 109 2 1 76 121 64 0.998 8.79 1.08 Intr - 18288 18188 101 0 2 82 85 21 0.979 0.11 1.07 Intr - 20758 20603 156 1 0 64 111 107 0.907 10.81 1.06 Intr - 28491 28421 71 1 2 60 92 23 0.833 -1.20 1.05 Intr - 32114 31999 116 1 2 99 111 53 0.983 8.79 1.04 Intr - 36428 36268 161 2 2 43 90 128 0.993 7.29 1.03 Intr - 42350 42278 73 1 1 37 116 26 0.132 -0.29 1.02 Intr - 48043 47977 67 2 1 93 116 8 0.160 2.06 1.01 Init - 52882 52675 208 1 1 91 59 119 0.245 8.28 1.00 Prom - 59664 59625 40 -3.46 2.00 Prom + 60688 60727 40 -6.86 2.01 Init + 61315 61378 64 0 1 94 53 46 0.451 3.01 2.02 Intr + 64252 64285 34 2 1 81 110 59 0.589 4.68 2.03 Intr + 85129 85198 70 1 1 94 100 24 0.294 3.38 2.04 Intr + 94385 94762 378 1 0 4 47 199 0.114 2.56 2.05 Intr + 95689 95805 117 0 0 69 54 114 0.744 6.76 2.06 Term + 95939 96109 171 1 0 11 48 61 0.185 -7.77 2.07 PlyA + 97021 97026 6 1.05 3.39 PlyA - 97621 97616 6 1.05 3.38 Term - 100168 99998 171 1 0 86 35 204 0.992 12.73 3.37 Intr - 105030 104941 90 0 0 43 116 66 0.805 5.09 3.36 Intr - 108798 108650 149 1 2 68 80 62 0.760 3.35 3.35 Intr - 136747 136481 267 2 0 92 52 105 0.826 4.80 3.34 Intr - 141524 141392 133 2 1 87 92 66 0.993 7.12 3.33 Intr - 145489 145325 165 1 0 121 81 67 0.986 9.46 3.32 Intr - 148429 148292 138 1 0 128 100 21 0.708 7.96 3.31 Intr - 162508 162413 96 1 0 96 39 61 0.249 2.21 3.30 Intr - 163509 163323 187 2 1 58 24 183 0.472 8.59 3.29 Intr - 166712 166663 50 2 2 73 67 23 0.949 -3.82 3.28 Intr - 168123 167963 161 1 2 65 79 142 0.992 10.71 3.27 Intr - 168360 168232 129 1 0 69 116 9 0.820 2.47 3.26 Intr - 173982 173835 148 0 1 64 19 73 0.001 -2.19 3.25 Intr - 196478 196340 139 1 1 98 27 79 0.630 3.27 3.24 Intr - 197264 196998 267 1 0 24 75 353 0.140 24.25 3.23 Intr - 204074 203939 136 0 1 67 28 112 0.167 2.73 3.22 Intr - 209903 209838 66 0 0 91 127 38 0.970 6.88 3.21 Intr - 213685 213559 127 1 1 88 47 64 0.792 2.55 3.20 Intr - 215384 215229 156 2 0 95 94 140 0.999 15.51 3.19 Intr - 216576 216412 165 0 0 38 58 132 0.469 5.36 3.18 Intr - 221857 221743 115 0 1 92 94 30 0.730 4.45 3.17 Intr - 225038 224848 191 2 2 104 29 156 0.476 9.68 3.16 Intr - 230406 230271 136 2 1 58 61 82 0.874 3.07 3.15 Intr - 239209 239049 161 1 2 111 92 12 0.539 2.69 3.14 Intr - 242553 242401 153 0 0 62 92 50 0.842 3.07 3.13 Intr - 245080 244956 125 2 2 84 90 96 0.996 9.70 3.12 Intr - 249148 249013 136 1 1 84 56 36 0.931 0.14 3.11 Intr - 256406 256302 105 2 0 98 105 83 0.910 11.41 3.10 Intr - 259194 259099 96 0 0 81 92 15 0.709 1.41 3.09 Intr - 260275 260131 145 0 1 52 78 83 0.956 3.98 3.08 Intr - 261415 261336 80 1 2 136 64 78 0.978 8.55 3.07 Intr - 262905 262868 38 1 2 83 100 29 0.698 1.58 3.06 Intr - 268779 268696 84 1 0 50 70 77 0.568 1.89 3.05 Intr - 273296 273162 135 0 0 28 89 124 0.843 7.04 3.04 Intr - 275355 275234 122 2 2 112 98 88 0.994 12.34 3.03 Intr - 280702 280462 241 2 1 102 82 67 0.327 4.11 3.02 Intr - 285374 285164 211 2 1 52 84 93 0.001 3.79 3.01 Init - 293749 293705 45 1 0 63 96 43 0.068 3.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 42283 42278 6 1 0 63 116 10 0.812 1.61 S.002 Intr + 285419 285612 194 0 2 47 116 216 0.857 19.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:36898762_37195928|GENSCAN_predicted_peptide_1|451_aa MQRADSEQPSKRPRCDDSPRTPSNTPSAEADWSPGLELHPDYKTWGPEQVCSFLRRGGFE EPVLLKNIRENEITGALLPCLDESRFENLGVSSLGERKKLLSYIQRLVQIHVDTMKVIND PIHGHIELHPLLVRIIDTPQFQRLRYIKQLGGGYYVFPGASHNRFEHSLGVGYLAGCLVH ALGEKQPELQISERDVLCVQIAGLCHDLGHGPFSHMFDGRFIPLARPEVKWTHEQGSVMM FEHLINSNGIKPVMEQYGLIPEEDICFIKEQIVGPLESPVEDSLWPYKGRPENKSFLYEI VSNKRNGIDVDKWDYFARDCHHLGIQNNFDYKRFIKFARVCEVDNELRICARDKEVGNLY DMFHTRNSLHRRAYQHKVGNIIDTMITDAFLKADDYIEITGAGGKKYRISTAIDDMEAYT KLTELQPCFPPPAVFILMSNPTFGSYFQDKL >gi568815578r:36898762_37195928|GENSCAN_predicted_CDS_1|1356_bp atgcagcgagccgattccgagcagccctccaagcgtccccgttgcgatgacagcccgaga accccctcaaacaccccttccgcagaggcagactggtccccgggcctggaactccatccc gactacaagacatggggtccggagcaggtgtgctccttcctcaggcgcggtggctttgaa gagccggtgctgctgaagaacatccgagaaaatgaaatcacaggcgcattactgccttgt cttgatgagtctcgttttgaaaatcttggagtaagttccttgggggagaggaagaagctg cttagttatatccagcgattggttcaaatccacgttgatacaatgaaggtaattaatgat cctatccatggccacattgagctccaccctctcctcgtccgaatcattgatacacctcaa tttcaacgtcttcgatacatcaaacagctgggaggtggttactatgtttttccaggagct tcacacaatcgatttgagcatagtctaggggtggggtatctagcaggatgtctagttcac gcactgggtgaaaaacaaccagagctgcagataagtgaacgagatgttctctgtgttcag attgctggactttgtcatgatctcggtcatgggccattttctcacatgtttgatggacga tttattccacttgctcgcccggaggtgaaatggacgcatgaacaaggctcagttatgatg tttgagcaccttattaattctaatggaattaagcctgtcatggaacaatatggtctcatc cctgaagaagatatttgctttataaaggaacaaattgtaggaccacttgaatcacctgtc gaagattcattgtggccatataaagggcgtcctgaaaacaaaagcttcctttatgagata gtatctaataaaagaaatggcattgatgtggacaaatgggattattttgccagggactgc catcatcttggaatccaaaataattttgattacaagcgctttattaagtttgcccgtgtc tgtgaagtagacaatgagttgcgtatttgtgctagagataaggaagttggaaatctgtat gacatgttccacactcgcaactctttacaccgtagagcttatcaacacaaagttggcaac attattgatacaatgattacagatgctttcctcaaagcagatgactacatagagattaca ggtgctggaggaaaaaagtatcgcatttctacagcaattgacgacatggaagcctatact aagctgacagaactccaaccatgcttcccacctccagctgtattcatcctcatgagtaac ccaacctttggctcctatttccaagataaactatag >gi568815578r:36898762_37195928|GENSCAN_predicted_peptide_2|277_aa MRPIHTSAITGNILSIRRMPSHSLQQLQDFDHRPLESALPYTKPVSSPSRLCLLEEKWGA AEKIPEYVEVTLELGNRQRLEQFGGSEEDRKMWDSFEFPRNLLNGFDQNADNNMDNEIQH EVLLDGDKKLVVNWSKGDSCYVLADLLVAFCPCPRDLWNFELQRDDLRYLVEEISKQQSI QEPGNLVPYVLAAPAVTEGVNVELGPWLQSMYASSLGSFHMCGKEMWGWSPHKESLLGHC LVELREEGHCPPDPRMIDPLIACTVCLEKSQTLNTIL >gi568815578r:36898762_37195928|GENSCAN_predicted_CDS_2|834_bp atgaggccgattcacacttcagctatcacaggaaatatcctctccatacggcgtatgccg agtcatagcttgcagcagctgcaagattttgaccacagacctttggaatcagccctgccc tacacaaagcccgttagttctccctccaggctctgcttgttagaggaaaagtggggtgct gctgaaaagatacctgaatatgtggaagtgactttggaactgggaaataggcagaggttg gaacaatttggagggtcagaagaagacaggaaaatgtgggacagttttgaatttcctaga aacttgctgaatggctttgatcaaaatgctgataataatatggacaatgaaattcagcat gaagtgctcttagatggagataagaaacttgttgtgaactggagcaaaggtgactcttgt tatgttttagcagatctactggtggcattttgcccctgccctagagatctgtggaacttt gaacttcagagagatgatttaaggtatctggtggaagaaatttctaagcagcaaagcatt caagagcctgggaacttggtgccctacgtcctagctgctccagccgtgactgaaggggtc aatgtagagcttgggccatggcttcagagcatgtacgcctcaagccttggcagctttcac atgtgtggaaaagagatgtggggttggagtccccacaaagagtcgctactggggcactgc ctagtggagctgagagaagagggccactgtcctccagaccccagaatgatagatccactg atagcttgcactgtgtgcctggaaaaatcgcagacactcaacaccatcctgtga >gi568815578r:36898762_37195928|GENSCAN_predicted_peptide_3|1752_aa MAALQCKIVAFLADRVLYTVPMMMHTKIWEPNHMGKGEGIQGFNLRRSLGKAGSLLFNWW KDSPPSSLQASKAQALSYYSHLKDVWEGDPAWPALERRLVTSRGRQPSPQSHAQLLTRRR HSSEQVPPESEPRADFRSGKWLQEPATGDARDSRQALRARMSSKHRICSQEEVVIPCAYD SDSESVDLELSNLEIIKKGSSSIELTDLDIPDIPGLHCEPLSHSPRHLTQQDPLSEAIVE KLIQSIQKVFNVPDSSRNCLGNLGYKDKEDKIPIYAAKQGKRNPLEAAETQKVLVQEERP HSLSSSMRQEVFVTIADLSYQDVHLLLGSEDRAELFSLTIKSIITLPSVRTLTQIQEIMP NGTCNTECLYRQTFQAFSEMLQSLVVKDPHLENLDTIIKHLVPWLQSVKDHERERATASM AQVLKCLSKHLNLKLPLRFQRLGHLVALMALLCGDPQEKVAEEAAEGIHSLLHITLRLKY ITHDKKDQQNLKRALTKCREFLELHSSAAKCFYNCPFRIAQVFEGFLDSNELCQFIMTTF DTLKTLKHPCIQRSAGELLLTLAKNTESQFEKVPEIMGVICAQLSIISQPRVRQQIINTV SLFISRPKYTDIVLSFLLCHPVPYNRHLAEVWRMLSVELPSTTWILWRLLRKLQKCHNEP AQEKMAYVAVAATDALYEVFLGNRLRAATFRLFPQLLMTLLIQIHHSIGLTMSDVDIPSG LYTEQEVPSEVTPLCALLERNQLLAQKVMYLLVPLLNRGNDKHKLTSAGFFVEREDIKSL LPYIVDSLRETDEKIVLSAIQILLQLVRTMDFTTLAAMMRTLFSLFGDVRSDVHRFSVTL FGAAIKSVKNPDKKSIENQVLDSLVPLLLYSQDENDAVAEESRQVLTICAQFLKWKLPQE VYSKDPWHIKPTEAGTICRFFVCLLSKYMDHNELRRMGTDWIEDDLRDLLCDPEPSLCII ASQTLLLVQMARAEPKPKQRVNWLQKLMGRFSRALAQVVVGSAPGREKEVGGRGAQPAGP EGMFEDKPHAEGAAVVAAAGEALQALCQELNLDEGSAAEALDDFTAIRGNYSLEVSGSRV ELPVAVQQFDQSQCAHEADLSPTFPPPSEHRPDALHQCLALNKTNEIDPALRIHHFFTSL HQQLQIQNLKQMYLIKDIRLYAFILASSKADLFSGNFRMIGDDLVNSYHLLLCCLDLIFA NAIMCPNRQDLLNPSFKGLPSDFHTADFTASEEPPCIIAVLCELHDGLLVEAKGIKEHYF KPYISKLFDRKILKGECLLDLSSFTDNSKAVNKEYEEYVLTVGDFDERIFLGADAEEEIG TPRKFTRDTPLGKLTAQANVEYNLQQHFEKKRSFAPSTPLTGRRYLREKEAVITPVASAT QSVLLEQDIFHRSLMACCLEIVLFAYSSPRTFPWIIEVLNLQPFYFYKVIEVVIRSEEGL SRDMVKHLNSIEEQILESLAWSHDSALWEALQVSANKVPTCEEVIFPNNFETGNGGNVQG HLPLMPMSPLMHPRVKEVRTDSGSLRRDMQPLSPISVHERYSSPTAGSAKRRLFGEDPPK EMLMDKIITEGTKLKIAPSSSITAENVSILPGQTLLTMATAPVTGTTGHKVTIPLHDLED ATKTPDCSSGPVKEERGDLIKFYNTIYVGRVKSFALKYDLANQDHMHSIYISPHKNGSGL TPRSALLYKFNGSPSKSLKDINNMIRQGEQRTKKRVIAIDSDAESPAKRVCQENDDVLLK RLQDVVSERANH >gi568815578r:36898762_37195928|GENSCAN_predicted_CDS_3|5259_bp atggcagcactccagtgcaagattgtggctttcttggcagacagggtgctatacacagtc ccaatgatgatgcacaccaagatctgggaaccaaatcacatggggaagggcgagggaatc caggggtttaacctgaggagaagccttgggaaggcagggagcctgctcttcaactggtgg aaggacagtcctcccagcagcctccaagcatctaaagcacaggccttatcctactattcc catctgaaagatgtctgggaaggggatcccgcctggccggctctcgagcggcgactagta acctcccgcgggcgacagccctctccccaaagccacgcgcaactcctcacccggcggcgc cattcctccgagcaggtcccgccggagtccgagccgcgggctgacttccgctcgggaaag tggctccaagagcccgccacaggggacgccagggactcgcggcaggctctgcgggccagg atgagttctaagcacaggatctgtagtcaggaagaagtagtgatcccctgtgcctatgac agtgattcagaaagtgtggatttggagctgagcaacttagagattattaaaaaaggctca agtagcattgaactgacagacttggacatccctgacatccctggactccattgtgagccc ctgtcacatagccccagacacctgacccaacaggacccgctcagtgaggccattgttgag aaactgatccagtccatccagaaggttttcaatgtgcctgacagttccaggaactgtctt gggaatttgggctacaaagacaaagaagacaaaatccctatttatgcagccaagcaaggt aagagaaatcctctagaagcagctgaaacacaaaaggtactggtacaagaggaacgcccg cattctctgtccagttccatgcgccaggaggtctttgtcaccatcgctgatctcagttac caagatgtccatttgctgttgggctctgaagatcgagctgagttgttcagtcttaccatc aagagtataatcactctgccctctgtaaggacccttacccagatacaggaaatcatgccc aatgggacctgcaacacagagtgtctttacaggcagacgtttcaggcattctctgagatg ctccagagtttggtggtaaaagacccacatttggaaaatcttgacaccattattaagcac ttggtcccctggttacagtcagtcaaagaccatgagcgggaacgggccacggccagcatg gctcaagttctgaagtgcctatccaaacatctcaacttgaagcttccactgcgattccaa agacttggacacctagtggctctgatggcactgctctgtggggacccacaggaaaaggtg gctgaggaggctgcagagggcattcactccctgctgcatatcaccctgaggctgaagtat atcactcatgacaagaaagatcagcaaaacttgaaaagagcattgacaaaatgtcgagaa ttcctggagctccacagctctgccgctaaatgcttctacaactgtcccttcagaattgcc caggtctttgaaggttttcttgattcaaatgagctctgccagtttataatgactacattt gataccctgaaaaccctgaaacatccctgcatccagcgatcagcaggagaattactgcta actttggcaaaaaatacagagtcccaatttgagaaggtgccagaaattatgggagttatc tgtgcccagttatccataatcagccagcctagagtccgccaacaaatcataaataccgtg agtttatttatatccagacccaagtacacagatatagtgctcagcttccttctgtgtcat ccagtgccgtataacaggcacctggctgaggtgtggagaatgctgtcggtggagcttccc agcacgacctggattctgtggaggctcctgaggaagctgcagaaatgccataatgagcct gcacaggagaagatggcatatgtggctgtggctgcaacagatgccctttatgaggtgttt ttgggaaacaggcttcgagcagctacgttccgactctttcctcagcttctcatgacactg cttatccagattcatcacagcatcggcctcaccatgtctgatgtcgacatcccaagtggc ctgtacacagaacaggaagtgccttcagaggtcacccctttgtgcgcattgctggaaaga aatcagctccttgcacagaaggtcatgtacttattagtccctcttcttaaccgagggaat gataaacataaactcacatctgcaggcttttttgtggagagagaagacatcaagagcctg ttgccatacattgtagacagcttgcgtgaaaccgatgagaagatcgttctgtcagccatc cagatactcctgcaacttgttagaacaatggatttcactaccctggctgccatgatgagg accctgttctccttatttggtgatgtgagatctgatgttcatcgtttctccgtgactctc tttggagccgccataaagtctgtaaaaaacccagataagaagagtatagagaaccaagtc ctggacagcttggtcccactacttctgtattctcaggatgaaaatgatgcagtagctgag gagagcaggcaagtcctaactatatgtgcccagttcctgaagtggaagctgccccaagaa gtgtactccaaagatccctggcacatcaaacctactgaagcaggaacaatctgcagattc tttgtatgccttttatcgaagtacatggatcacaatgagctcaggaggatgggtactgac tggatagaggacgatctgagagacctgctgtgtgaccctgagccctcgctgtgcatcatc gcttcccagactctgttactagtccagatggcgagggccgaaccaaaacctaagcagaga gtgaactggttgcagaagctcatgggcagattttcgcgcgctttggcgcaggtggttgtg ggtagcgcgcctgggagggagaaagaagtcgggggccgtggcgcgcagcccgcggggcct gaagggatgttcgaggacaagccccacgctgagggggcggcggtggtcgccgcagccggg gaggcgctacaggccctgtgccaggagctgaacctggacgaggggagcgcggccgaagcc ctggacgactttactgccatccgaggcaactacagcctagaggtgagcggcagcagggtg gagctgccggtcgctgtgcagcagtttgatcaaagccaatgtgcacacgaagctgatctc agccctacgtttcctcctccgtcagagcatcgtcctgatgctttgcatcagtgtctggct ctgaataaaacaaatgagattgatcctgctctaagaattcaccacttcttcacttctctg caccagcagcttcagattcaaaatctcaagcagatgtatctgataaaggatatcagatta tatgccttcatcctagcttccagtaaagctgatctgtttagtggtaattttcggatgatt ggggatgacttagtaaactcttatcatttacttctatgctgcttggatctgatttttgcc aatgcgattatgtgcccaaatagacaagacttgctaaatccatcatttaaaggtttacca tctgattttcatactgctgactttacggcttctgaagagccaccctgcatcattgctgta ctgtgtgaactgcatgatggacttctcgtagaagcaaaaggaataaaggagcactacttt aagccatatatttcaaaactctttgacaggaagatattaaaaggagaatgcctcctggac ctttcaagttttactgataatagcaaagcagtgaataaggagtatgaagagtatgttcta actgttggtgattttgatgagaggatctttttgggagcagacgcagaagaggaaattgga acacctcgaaagttcactcgtgacaccccattagggaaactgacagcacaggctaatgtg gagtataaccttcaacagcactttgaaaaaaaaaggtcatttgcaccttctaccccactg accggacggagatatttacgagaaaaagaagcagtcattactcctgttgcatcagccacc caaagtgttcttttagagcaagatatatttcatcgttccttgatggcttgttgtttggaa attgtgctctttgcctatagctcacctcgtacttttccttggattattgaagttctcaac ttgcaaccattttacttttataaggttattgaggtggtgatccgctcagaagaggggctc tcaagggacatggtgaaacacctaaacagcattgaagaacagattttggagagtttagca tggagtcacgattctgcactgtgggaggctctccaggtttctgcaaacaaagttcctacc tgtgaagaagttatattcccaaataactttgaaacaggaaatggaggaaatgtgcaggga catcttcccctgatgccaatgtctcctctaatgcacccaagagtcaaggaagttcgaact gacagtgggagtcttcgaagagatatgcaaccattgtctccaatttctgtccatgaacgc tacagttctcctaccgcagggagtgctaagagaagactctttggagaggaccccccaaag gaaatgcttatggacaagatcataacagaaggaacaaaattgaaaatcgctccttcttca agcattactgctgaaaatgtatcaattttacctggtcaaactcttctaacaatggccaca gccccagtaacaggaacaacaggacataaagttacaattccattacatgacttagaagat gctacaaaaacacctgactgttccagtggaccagtgaaagaggaaagaggtgatcttata aaattttacaatacaatatatgtaggaagagtgaagtcatttgcactgaaatacgacttg gcgaatcaggaccatatgcactccatttatatttccccgcacaagaatgggtcaggcctt acaccaagaagcgctctgctgtacaagttcaatggcagcccttctaagagtttgaaagat atcaacaacatgataaggcaaggtgagcagagaaccaagaagcgagtaatagccatcgat agtgatgcagaatcccctgccaaacgcgtctgtcaagaaaatgatgacgttttactgaaa cgactacaggatgttgtcagtgaaagagcaaatcattaa