GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:21:15 Sequence gi568815586r:120920564_121138971 : 218408 bp : 47.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 2368 2363 6 1.05 1.05 Term - 10952 10848 105 2 0 77 43 52 0.067 -1.99 1.04 Intr - 21379 21193 187 0 1 70 37 153 0.030 8.09 1.03 Intr - 35001 34877 125 0 2 -2 63 122 0.029 0.08 1.02 Intr - 48486 48385 102 0 0 103 45 43 0.415 1.87 1.01 Init - 49098 49096 3 0 0 113 81 0 0.471 1.80 1.00 Prom - 55754 55715 40 -7.16 2.00 Prom + 57412 57451 40 -3.26 2.01 Init + 58206 58531 326 2 2 83 64 486 0.830 40.50 2.02 Intr + 59107 59194 88 1 1 69 55 69 0.612 1.67 2.03 Intr + 66457 66515 59 0 2 132 48 9 0.282 -1.02 2.04 Intr + 68270 68469 200 2 2 87 94 346 0.410 34.09 2.05 Intr + 72957 73143 187 1 1 100 55 166 0.861 13.25 2.06 Intr + 73601 73842 242 2 2 59 127 248 0.888 22.99 2.07 Intr + 75699 75850 152 1 2 79 105 137 0.999 14.38 2.08 Intr + 75978 76179 202 2 1 97 111 187 0.857 20.76 2.09 Intr + 76911 77102 192 1 0 110 101 245 0.991 27.46 2.10 Intr + 78705 78826 122 1 2 65 110 269 0.999 27.01 2.11 Intr + 78920 79064 145 1 1 95 94 126 0.902 13.76 2.12 Term + 80502 80629 128 1 2 63 41 162 0.992 7.44 2.13 PlyA + 82143 82148 6 -0.45 3.07 PlyA - 82318 82313 6 -0.45 3.06 Term - 83929 83590 340 0 1 90 46 443 0.970 34.31 3.05 Intr - 84530 84440 91 0 1 60 94 52 0.991 2.05 3.04 Intr - 85831 85758 74 0 2 89 101 72 0.998 7.65 3.03 Intr - 90363 90265 99 2 0 58 97 190 0.997 16.13 3.02 Intr - 90583 90541 43 2 1 119 79 -16 0.434 -2.10 3.01 Init - 95911 95767 145 1 1 89 70 108 0.986 9.38 3.00 Prom - 99494 99455 40 -7.76 4.07 PlyA - 99750 99745 6 1.05 4.06 Term - 100495 99998 498 1 0 133 29 657 0.987 58.82 4.05 Intr - 103574 103427 148 1 1 106 90 197 0.596 21.94 4.04 Intr - 107254 107013 242 1 2 90 69 268 0.996 21.45 4.03 Intr - 111054 110879 176 1 2 105 94 216 0.999 23.56 4.02 Intr - 113180 112898 283 2 1 106 82 278 0.999 25.99 4.01 Init - 118408 118211 198 1 0 78 116 199 0.932 20.19 4.00 Prom - 126389 126350 40 -5.46 5.06 PlyA - 127253 127248 6 1.05 5.05 Term - 128006 127822 185 0 2 54 43 105 0.357 0.31 5.04 Intr - 128296 128262 35 0 2 87 36 47 0.110 -2.73 5.03 Intr - 133258 133179 80 1 2 98 52 55 0.502 1.25 5.02 Intr - 133482 133291 192 0 0 98 49 114 0.078 8.19 5.01 Init - 147068 146961 108 2 0 92 31 127 0.283 7.62 5.00 Prom - 152565 152526 40 -2.86 6.03 PlyA - 160836 160831 6 1.05 6.02 Term - 165952 165680 273 1 0 58 47 154 0.471 3.77 6.01 Init - 168606 168481 126 0 0 32 65 106 0.244 2.58 6.00 Prom - 172827 172788 40 -3.76 7.00 Prom + 173822 173861 40 -3.66 7.01 Init + 175184 175401 218 1 2 61 94 60 0.065 2.06 7.02 Intr + 178111 178205 95 1 2 58 74 37 0.023 -1.09 7.03 Term + 186757 186905 149 2 2 34 50 241 0.965 12.96 7.04 PlyA + 188062 188067 6 1.05 8.06 PlyA - 191278 191273 6 1.05 8.05 Term - 196428 196333 96 0 0 54 54 80 0.555 -0.83 8.04 Intr - 201075 200861 215 1 2 51 79 94 0.667 3.23 8.03 Intr - 205056 204955 102 1 0 24 116 70 0.458 3.55 8.02 Intr - 206083 205980 104 0 2 93 70 29 0.610 1.42 8.01 Init - 207682 207525 158 1 2 87 79 8 0.547 -1.36 8.00 Prom - 207744 207705 40 -3.16 9.00 Prom + 211854 211893 40 -7.36 9.01 Init + 212408 212532 125 1 2 108 111 148 0.986 18.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 139934 140008 75 1 0 117 55 57 0.821 3.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_1|173_aa MNIPLTFSPLCLCASCSLYLETSICQAAAVFKIYLSTASASVKADSPHGHKVAATTHSGT FVSSGEKEELLTLISPEMRAKEFKMVEPYYAKEPNPSVHWKKTAQDFYLPGNICPESTSW KKTAQDFYLPGNICIRLCGWQQLFLAKSLALVAAIQLPLVSANFLILFLLPIL >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_1|522_bp atgaacatacccctaactttctcacctctgtgcctttgtgcaagctgctccctctacctg gaaacctccatttgtcaagctgctgctgtcttcaagatctacctgtctacagcatcagct tctgtgaaggctgattcccctcatggtcacaaagtggctgccaccactcactcaggcacc tttgtctcctctggggagaaggaggagctgctcaccctcatcagcccagagatgagggca aaagagtttaagatggtggagccttattatgcaaaggaacccaatcccagcgtccactgg aagaagactgcccaggacttctaccttcccgggaacatctgcccagagtccaccagctgg aagaagactgcccaggacttctaccttcctgggaacatctgcatcagactctgtgggtgg cagcagctgtttctggcaaaatccttagcgctggttgcagccattcagctccccttggtt tctgccaatttcctgatcctatttctcttgccaattctctaa >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_2|680_aa MVSKLSQLQTELLAALLESGLSKEALIQALGEPGPYLLAGEGPLDKGESCGGGRGELAEL PNGLGETRGSEDETDDDGEDFTPPILKELENLSPEEAAHQKAVVETLLQHPTSPAGAIRG CPFYIPIRWRTPRLLENWGDPQQRHDSQVASGPFESLWEDPWRVAKMVKSYLQQHNIPQR EVVDTTGLNQSHLSQHLNKGTPMKTQKRAALYTWYVRKQREVAQQFTHAGQGGLIEEPTG DELPTKKGRRNRFKWGPASQQILFQAYERQKNPSKEERETLVEECNRAECIQRGVSPSQA QGLGSNLVTEVRVYNWFANRRKEEAFRHKLAMDTYSGPPPGPGPGPALPAHSSPGLPPPA LSPSKVHGVRYGQPATSETAEVPSSSGGPLVTVSTPLHQVSPTGLEPSHSLLSTEAKLVS AAGGPLPPVSTLTALHSLEQTSPGLNQQPQNLIMASLPGVMTIGPGEPASLGPTFTNTGA STLVIGLASTQAQSVPVINSMGSSLTTLQPVQFSQPLHPSYQQPLMPPVQSHVTQSPFMA TMAQLQSPHALYSHKPEVAQYTHTGLLPQTMLITDTTNLSALASLTPTKQVFTSDTEASS ESGLHTPASQATTLHVPSQDPASIQHLQPAHRLSASPTVSSSSLVLYQSSDSSNGQSHLL PSNHSVIETFISTQMASSSQ >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_2|2043_bp atggtttctaaactgagccagctgcagacggagctcctggcggccctgctcgagtcaggg ctgagcaaagaggcactgatccaggcactgggtgagccggggccctacctcctggctgga gaaggccccctggacaagggggagtcctgcggcggcggtcgaggggagctggctgagctg cccaatgggctgggggagactcggggctccgaggacgagacggacgacgatggggaagac ttcacgccacccatcctcaaagagctggagaacctcagccctgaggaggcggcccaccag aaagccgtggtggagacccttctgcagcaccccacctcaccagcaggcgccattagaggc tgcccgttctacatccccatccgctggcggactccccgtctcctggagaactggggagac ccacagcagagacatgactcacaggtggcatcaggtccctttgagtctctctgggaggac ccgtggcgtgtggcgaagatggtcaagtcctacctgcagcagcacaacatcccacagcgg gaggtggtcgataccactggcctcaaccagtcccacctgtcccaacacctcaacaagggc actcccatgaagacgcagaagcgggccgccctgtacacctggtacgtccgcaagcagcga gaggtggcgcagcagttcacccatgcagggcagggagggctgattgaagagcccacaggt gatgagctaccaaccaagaaggggcggaggaaccgtttcaagtggggcccagcatcccag cagatcctgttccaggcctatgagaggcagaagaaccctagcaaggaggagcgagagacg ctagtggaggagtgcaatagggcggaatgcatccagagaggggtgtccccatcacaggca caggggctgggctccaacctcgtcacggaggtgcgtgtctacaactggtttgccaaccgg cgcaaagaagaagccttccggcacaagctggccatggacacgtacagcgggcccccccca gggccaggcccgggacctgcgctgcccgctcacagctcccctggcctgcctccacctgcc ctctcccccagtaaggtccacggtgtgcgctatggacagcctgcgaccagtgagactgca gaagtaccctcaagcagcggcggtcccttagtgacagtgtctacacccctccaccaagtg tcccccacgggcctggagcccagccacagcctgctgagtacagaagccaagctggtctca gcagctgggggccccctcccccctgtcagcaccctgacagcactgcacagcttggagcag acatccccaggcctcaaccagcagccccagaacctcatcatggcctcacttcctggggtc atgaccatcgggcctggtgagcctgcctccctgggtcctacgttcaccaacacaggtgcc tccaccctggtcatcggcctggcctccacgcaggcacagagtgtgccggtcatcaacagc atgggcagcagcctgaccaccctgcagcccgtccagttctcccagccgctgcacccctcc taccagcagccgctcatgccacctgtgcagagccatgtgacccagagccccttcatggcc accatggctcagctgcagagcccccacgccctctacagccacaagcccgaggtggcccag tacacccacacgggcctgctcccgcagactatgctcatcaccgacaccaccaacctgagc gccctggccagcctcacgcccaccaagcaggtcttcacctcagacactgaggcctccagt gagtccgggcttcacacgccggcatctcaggccaccaccctccacgtccccagccaggac cctgccagcatccagcacctgcagccggcccaccggctcagcgccagccccacagtgtcc tccagcagcctggtgctgtaccagagctcagactccagcaatggccagagccacctgctg ccatccaaccacagcgtcatcgagaccttcatctccacccagatggcctcttcctcccag taa >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_3|263_aa MAAPSGTVSDSESSNSSSDAEELERCREAAMPAWGLEQRPHVAGKPRAGAANSQLSTSQP SLRHKVNEHEQDGNELQTTPEFRAHVAKKLGALLDSFITISEAAKEPAKAKVQKVALEDD GFRLFFTSVPGGREKEESPQPRRKRQPSSSSSEDSDEEWRRCREAAVSASDILQESAIHS PGTVEKEAKKKRKLKKKAKKVASVDSAVAATTPTSMATVQKQKSGELNGDQVSLGTKKKK KAKKASETSPFPPAKSATAIPAN >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_3|792_bp atggcggcgcccagtggcacagtgagcgattcggaaagtagtaacagcagtagcgatgcg gaggagctggagcggtgccgcgaggcggcaatgccggcttggggcttggagcaacgcccg cacgtggcagggaagccaagagccggtgctgcaaatagccagttgtcaacctcccaaccg agcctcaggcataaggtgaatgagcatgaacaagatggcaacgagcttcagaccacccct gaattccgagcccacgtagccaagaagctgggagccctgctggacagcttcattaccatc tcagaagcagcaaaggagccagcaaaagctaaggtacagaaagtcgctttggaggatgat ggtttccgccttttcttcacatctgtccctggaggccgtgagaaggaagagtctccccaa ccccgccgaaagcgacagccctccagctccagcagtgaggacagtgacgaggagtggcgg cggtgccgggaggcagctgtgtcggcgtccgacatcctacaggagtcagccatccacagc cctggaacagtggagaaggaggcaaagaagaaaaggaagttgaaaaagaaagccaagaag gtggccagtgtcgactcggctgtcgctgccaccacccccaccagcatggccacagtccag aagcagaagtcaggtgagctcaacggggaccaggtgtcgcttgggaccaaaaagaagaaa aaggcaaagaaggccagcgagacctctccattcccaccagcaaagagtgctacagctata cctgcaaactga >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_4|514_aa MALMQELYSTPASRLDSFVAQWLQPHREWKEEVLDAVRTVEEFLRQEHFQGKRGLDQDVR VLKVVKVGSFGNGTVLRSTREVELVAFLSCFHSFQEAAKHHKDVLRLIWKTMWQSQDLLD LGLEDLRMEQRVPDALVFTIQTRGTAEPITVTIVPAYRALGPSLPNSQPPPEVYVSLIKA CGGPGNFCPSFSELQRNFVKHRPTKLKSLLRLVKHWYQQYVKARSPRANLPPLYALELLT IYAWEMGTEEDENFMLDEGFTTVMDLLLEYEVICIYWTKYYTLHNAIIEDCVRKQLKKER PIILDPADPTLNVAEGYRWDIVAQRASQCLKQDCCYDNRENPISSWNVKRARDIHLTVEQ RGYPDFNLIVNPYEPIRKVKEKIRRTRGYSGLQRLSFQVPGSERQLLSSRCSLAKYGIFS HTHIYLLETIPSEIQVFVKNPDGGSYAYAINPNSFILGLKQQIEDQQGLPKKQQQLEFQG QVLQDWLGLGIYGIQDSDTLILSKKKGEALFPAS >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_4|1545_bp atggcactgatgcaggaactgtatagcacaccagcctccaggctggactccttcgtggct cagtggctgcagccccaccgggagtggaaggaagaggtgctagacgctgtgcggaccgtg gaggagtttctgaggcaggagcatttccaggggaagcgtgggctggaccaggatgtgcgg gtgctgaaggtagtcaaggtgggctccttcgggaatggcacggttctcaggagcaccaga gaggtggagctggtggcgtttctgagctgtttccacagcttccaggaggcagccaagcat cacaaagatgttctgaggctgatatggaaaaccatgtggcaaagccaggacctgctggac ctcgggctcgaggacctgaggatggagcagagagtccccgatgctctcgtcttcaccatc cagaccagggggactgcggagcccatcacggtcaccattgtgcctgcctacagagccctg gggccttctcttcccaactcccagccaccccctgaggtctatgtgagcctgatcaaggcc tgcggtggtcctggaaatttctgcccatccttcagcgagctgcagagaaatttcgtgaaa catcggccaactaagctgaagagcctcctgcgcctggtgaaacactggtaccagcagtat gtgaaagccaggtcccccagagccaatctgccccctctctatgctcttgaacttctaacc atctatgcctgggaaatgggtactgaagaagacgagaatttcatgttggacgaaggcttc accactgtgatggacctgctcctggagtatgaagtcatctgtatctactggaccaagtac tacacactccacaatgcaatcattgaggattgtgtcagaaaacagctcaaaaaagagagg cccatcatcctggatccggccgaccccaccctcaacgtggcagaagggtacagatgggac atcgttgctcagagggcctcccagtgcctgaaacaggactgttgctatgacaacagggag aaccccatctccagctggaacgtgaagagggcacgagacatccacttgacagtggagcag aggggttacccagatttcaacctcatcgtgaacccttatgagcccataaggaaggttaaa gagaaaatccggaggaccaggggctactctggcctgcagcgtctgtccttccaggttcct ggcagtgagaggcagcttctcagcagcaggtgctccttagccaaatatgggatcttctcc cacactcacatctatctgctggagaccatcccctccgagatccaggtcttcgtgaagaat cctgatggtgggagctacgcctatgccatcaaccccaacagcttcatcctgggtctgaag cagcagattgaagaccagcaggggcttcctaaaaagcagcagcagctggaattccaaggc caagtcctgcaggactggttgggtctggggatctatggcatccaagacagtgacactctc atcctctcgaagaagaaaggagaggctctgtttccagccagttag >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_5|199_aa MGTQACGSSAEALSNGCTIITQLWDHQSLCKRVPSSSVSTGKGTMLNYSSDLDLILFLSC FSSVQDQAQLRDSIISFIEENWFTVARAWPTISLWSGTGRVQSRKSSGVIWMDKLPAFDA LGKDSDRLSVDWTVPTYIKQHKDYREARSRRGGDACSCKKMPGNRHRNSRSQISKTQQHK GSCLNKFIGESKAAIRTKL >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_5|600_bp atgggaacccaagcatgtggctccagcgccgaggctctgagcaacggatgcaccatcatc actcagctgtgggaccaccagtcactgtgcaaaagagttccatcatctagcgtctccaca gggaaggggacgatgctgaactacagctctgacctggacctgattctcttcctgagctgc ttctccagcgtccaagaccaggcacagctgcgagacagtatcatcagcttcattgaagag aattggttcactgtagcaagagcctggcctacaatatcactgtggtccggcacagggagg gtccagtccaggaagagcagcggagtcatttggatggataagctcccggctttcgatgct ctgggtaaagacagcgacaggctctcagtggattggacggtgcccacctacattaagcag cacaaagactacagggaagctagaagcagacgtggcggggatgcctgcagctgcaagaag atgcctgggaacagacacagaaactctcgctcccagataagcaaaacacagcagcacaaa ggcagctgtttgaataaattcattggagagtctaaggcagcaatccggaccaagctgtaa >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_6|132_aa MKTPLGWALARVGKGSLLTLFSLLDLLTERGVFPGSDPSQRVGALFSMHFSKAHGSEIQQ EQIMEDSNPDYARKVIWAEENLLAIPRSIPSASGQRQEEPQTSPPASSSTEGVLDLKLSG DWNHCCGLYSRA >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_6|399_bp atgaaaacacccctgggctgggctctagctcgagtgggcaaaggttccctcctcacactg ttcagcctgttggaccttctcactgaacgaggcgtcttcccaggatctgacccaagtcag agggtgggggctctattttcaatgcatttttccaaagctcatggctcagagatacaacaa gaacagatcatggaagattcaaatccagactatgccagaaaagtgatctgggctgaagaa aacctcctggccattcccagaagcatcccgtcggcatctgggcaaaggcaggaggagcct cagacatcaccacctgccagctcctccactgaaggcgtcctggacctgaagctatcagga gattggaatcattgctgtggtttgtacagcagggcctga >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_7|153_aa MGSLGEKTCEIKCEAGKTRTCAGGGDLLHRVCTFARTVGASAAVGGEEILDHTSERGGVG GSGRVCKSVGRDSLLRKANIRQQNPKNGVIKAASGVRGQFHARHENKEHVIKALLRAKFK FPGCQKIHISQKWVFTKFSEDEFEDTVAEKWLI >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_7|462_bp atggggtctctgggtgagaaaacgtgtgagataaagtgcgaggctggaaaaacgcgcacg tgtgcaggaggcggggatctgctgcaccgagtgtgcacatttgcacgaaccgtgggggcg tctgcggctgtgggtggagaggaaatcctggatcacacatctgagagagggggtgtgggt gggagtgggagagtatgtaagagcgtgggtcgtgacagccttctacgaaaggcaaatata agacagcaaaatccaaagaatggagttattaaggctgcaagtggagtccgaggccagttc catgccaggcatgagaacaaggagcatgtgattaaggccctgctcagggccaagttcaag tttcctggctgccagaagatccacatctcacagaagtgggtcttcaccaagttcagtgaa gatgaatttgaagacacggtggctgagaagtggctcatctga >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_8|224_aa MENGRKGQEWSQGSAQGEGRVTWRGTEDRSLKDTNTKETKLRQSREERKAERQTAPGAES PKVSKISYTKWDVQVTHMFVLHTFCMKANGYHIEQQRYTVLTQQYRKLNRTVLQQLQVSK QAKEFCDDQKTDHSIPAEQSQTNIQANIRLLEAACLGAGISLKSSTQWATHAGIFPMVSL PWHLLVIDRSWNKLNLKSTDLHIHSCGLLDHSLIHQLNESHAVL >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_8|675_bp atggaaaatgggcggaaggggcaagagtggagccaggggtctgcccagggagagggtaga gtgacatggagggggaccgaagacagatccctgaaggacaccaacaccaaggagactaag ctgaggcagagtagggaggagaggaaagccgaaagacagactgccccaggagcagagtct ccaaaagtgagcaaaatcagctacacaaaatgggacgtgcaagttacccatatgtttgtg ttgcacacattttgcatgaaagctaatggctaccatattgaacagcaaagatatacggta cttacccagcagtatagaaagctcaaccggacagtgctacaacagttgcaggtaagcaag caagccaaagaattctgtgatgatcagaagactgaccacagcatccctgcagaacagagc cagacaaacatccaggcaaacatccgcctgctggaggcagcctgcctgggtgctgggatc agtctcaaatctagcacacaatgggccacccacgctggcatctttcctatggtgtccttg ccctggcatctcctggtcattgataggtcctggaataagctcaatctgaaatcaacagac cttcatatccacagctgtggcttattggaccattccctgatccaccagctgaacgagagt catgctgtcctctga >gi568815586r:120920564_121138971|GENSCAN_predicted_peptide_9|42_aa MPACCSCSDVFQYETNKVTRIQSMNYGTIKWFFHVIIFSYVX >gi568815586r:120920564_121138971|GENSCAN_predicted_CDS_9|126_bp atgccggcctgctgcagctgcagtgatgttttccagtatgagacgaacaaagtcactcgg atccagagcatgaattatggcaccattaagtggttcttccacgtgatcatcttttcctac gtttgn