GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:52:24 Sequence gi568815585f:48133361_48361221 : 227861 bp : 39.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 17369 17417 49 1 1 65 58 83 0.422 4.16 1.02 Intr + 24829 24897 69 2 0 19 69 120 0.014 1.44 1.03 Intr + 33141 33222 82 1 1 119 86 50 0.455 5.68 1.04 Intr + 49291 49411 121 1 1 11 77 122 0.025 2.98 1.05 Intr + 68601 68680 80 2 2 71 113 66 0.068 4.83 1.06 Term + 68780 68957 178 2 1 -33 48 211 0.610 1.18 1.07 PlyA + 69417 69422 6 1.05 2.00 Prom + 70264 70303 40 -10.75 2.01 Sngl + 70432 70965 534 0 0 47 42 292 0.999 16.32 2.02 PlyA + 71249 71254 6 1.05 3.00 Prom + 73175 73214 40 -3.65 3.01 Sngl + 83279 83734 456 1 0 39 48 287 0.852 15.73 3.02 PlyA + 83963 83968 6 1.05 4.03 PlyA - 84494 84489 6 1.05 4.02 Term - 100678 99839 840 1 0 109 50 419 0.943 32.14 4.01 Init - 109076 109026 51 2 0 66 111 -17 0.236 -0.39 4.00 Prom - 112965 112926 40 -5.85 5.00 Prom + 113319 113358 40 -4.35 5.01 Init + 115440 115466 27 2 0 94 42 40 0.542 -0.35 5.02 Intr + 120448 120576 129 0 0 80 108 61 0.985 7.27 5.03 Intr + 122817 123023 207 2 0 41 105 262 0.997 21.55 5.04 Intr + 124766 124876 111 1 0 97 70 20 0.626 0.76 5.05 Intr + 125437 125587 151 0 1 56 85 97 0.952 5.01 5.06 Term + 127779 127864 86 1 2 104 47 97 0.964 4.04 5.07 PlyA + 128538 128543 6 1.05 6.05 PlyA - 128768 128763 6 1.05 6.04 Term - 136751 136632 120 2 0 56 54 92 0.550 0.09 6.03 Intr - 144193 144050 144 1 0 128 71 40 0.862 5.86 6.02 Intr - 149657 149535 123 2 0 28 98 86 0.756 3.46 6.01 Init - 153229 153125 105 1 0 64 58 70 0.190 1.87 6.00 Prom - 156557 156518 40 -6.95 7.00 Prom + 156626 156665 40 -7.85 7.01 Init + 160954 160978 25 0 1 68 123 8 0.225 2.07 7.02 Intr + 161986 162144 159 2 0 92 39 61 0.147 0.64 7.03 Intr + 170524 170689 166 2 1 63 99 358 0.253 32.50 7.04 Intr + 173920 174046 127 1 1 54 87 87 0.809 4.96 7.05 Term + 179398 179460 63 0 0 85 42 6 0.124 -7.39 7.06 PlyA + 181119 181124 6 1.05 8.06 PlyA - 181489 181484 6 1.05 8.05 Term - 183787 183611 177 1 0 118 50 93 0.514 5.20 8.04 Intr - 185205 184969 237 0 0 27 77 303 0.751 19.99 8.03 Intr - 186380 186010 371 0 2 -11 43 380 0.533 17.50 8.02 Intr - 186762 186617 146 2 2 1 69 237 0.282 12.11 8.01 Init - 187114 186774 341 1 2 35 -21 336 0.126 14.18 8.00 Prom - 193487 193448 40 -5.55 9.03 PlyA - 194475 194470 6 1.05 9.02 Term - 195029 194719 311 0 2 24 36 350 0.984 17.64 9.01 Init - 198443 198377 67 2 1 70 97 -26 0.374 -2.19 9.00 Prom - 201386 201347 40 -4.25 10.02 PlyA - 201799 201794 6 1.05 10.01 Sngl - 204360 203347 1014 0 0 56 36 333 0.774 21.66 10.00 Prom - 204566 204527 40 -11.34 11.00 Prom + 204891 204930 40 -9.55 11.01 Init + 206009 206134 126 1 0 86 58 135 0.615 8.51 11.02 Intr + 209239 209354 116 0 2 105 103 122 0.999 13.73 11.03 Intr + 211720 211839 120 1 0 109 98 81 0.975 9.89 11.04 Intr + 215596 215663 68 1 2 95 115 0 0.741 1.03 11.05 Term + 219856 220124 269 2 2 -50 45 207 0.640 -2.73 11.06 PlyA + 220585 220590 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_1|192_aa MGPQAKGCGQPLEVGKENKNESSDRARNSGKRADRQRAGGDLKIWNNSQPGAYSAATAAA GRTLSLGMSERDLLEALRRMKYQFEKSLYQSNMNSAKIEERANIKLMDHNSSPAREQNWT ENEFDELTEVGFRRITSVENINDLMELKNTAEELCEAYTNFNSRFDQAEERISVIEDQIN EIKREARLEKKE >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_1|579_bp atggggccacaagccaagggatgtggacagcctttagaagttggaaaagaaaataagaat gaaagctctgacagagccaggaacagtggcaagagagctgatcggcaaagagcaggtggt gacttgaaaatttggaataattcacaacctggtgcttattcagcagcaacagcagcagca ggcagaactctgagtttgggcatgtctgaacgggatctccttgaggcactaagaaggatg aaatatcagtttgaaaagagcctctatcagagcaacatgaattctgctaaaattgaagaa agagcaaatatcaaattgatggatcacaactcctcgccagcaagggaacaaaactggaca gagaatgagtttgatgagttgacagaagtaggtttcagaagaataaccagtgtagagaac ataaatgacctgatggagctgaaaaacacagcagaagaactttgtgaagcatacacaaac ttcaatagccgatttgatcaagcggaagaaaggatatcagtgattgaagatcaaattaat gaaataaagcgagaagcaagattagagaaaaaagagtga >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_2|177_aa MFFENNENKDTTYQNLWDTFKVVSKEKFIALNAHKRKQERCKIDTLTSQLKELQKQKQTN SKASRRQEITKIREELKEIESRKTLQRISESRSWFFEKINKIDRPQARLIKKKREKNQID AIKNDKRDITTDPTEIQTTIREYYKHLYVNKLENLEEMDKFLDTYTLPRVTHKKSNL >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_2|534_bp atgttctttgaaaacaatgagaacaaagacacaacgtaccagaatctctgggacacattt aaagtagtgtctaaagagaaatttatagcactaaatgcccacaagagaaagcaggaaaga tgtaaaatcgacaccctaacatcacaattaaaagaactacagaagcaaaagcaaacaaat tcaaaagctagcagaagacaagaaataactaagatcagagaagaactgaaggagatagag tcacgaaaaacccttcaaagaatcagtgaatccaggagctggttttttgaaaagatcaac aaaatagatagaccacaagccagactaataaagaagaaaagagagaagaatcaaatagac gcaataaaaaatgataaaagggatatcaccaccgatcccacagaaatacaaactaccatc agagaatactataaacacctctatgtaaataaactagaaaatctagaagaaatggataaa ttcctggacacatacaccctcccaagagtaactcacaagaagtcgaatctctga >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_3|151_aa MYGNAWISRQNFAAGTEPSWGTSARAVQKGNVGSEPPHRVPTGALASGAVRRGPLSSRPK NGRSTDSLHHAPGKAADIQCQPVKAARSGAVPCKATGEKLLKAMGAHLLHQCDLDVRHGV KEDHFGTLRFNDWLSFNDTFIHETIQIGTQL >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_3|456_bp atgtacggaaatgcctggatatccaggcagaactttgctgcagggacagagccctcatgg ggaacttctgctagggcagtgcagaagggaaatgtggggtcagagcccccacatagagtc ccgactggggcactggccagtggagctgtgagaagagggccactgtcctctagacccaag aatggtagatccaccgacagcttgcaccatgcacctggaaaagctgcagacattcaatgc cagcctgtgaaagcagccaggtcgggggccgtaccctgcaaagccacaggggagaagttg ctcaaggccatgggagcccacctcttgcatcagtgtgacctggatgtgagacatggagtc aaagaagatcattttggaactttaaggtttaatgactggttaagttttaatgacactttt atacatgagacaatacaaataggaactcagctgtga >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_4|296_aa MGKKSSKKKERKKERNLVKFTCHYRRAVEKGGKETLLKLRRLQLLDLVQVLFKPRNESGK QQRFRLVMMSAILGTLFILGRKCQTRLLLLCLSLPVPRSNKPSPGCTHAPRTAQTTGNSA QLANRSLSRPPKPPTPRQAPLPAHALPYKGAADPQETGAPGSLGDLSPRREPRARHLGAH CRCSPRSPPRPGPAPASSTSPGSDLAVHRDGVGGDDERLLAALGLVLLGLLLGQSGVERH LHHGAAWGALEGRRRARGAGCGLPAAIAASTAGLRERLRAAEEVREAPRRQRRLLR >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_4|891_bp atgggcaagaaatcatcaaaaaaaaaagaaagaaagaaagaaagaaacttggtgaagttc acctgccactaccgaagggccgtggaaaaaggaggtaaagaaacgcttttaaaattaaga cgcttacagttgctggacttagtccaagtgctgttcaagcctagaaacgaaagtgggaaa caacagaggtttaggcttgtaatgatgtcagccattttaggtacattatttatactggga aggaagtgccaaactcggttgttgttgttgtgcctgtcgctgccggtccccagatcgaac aagccaagtcctggctgcacgcacgctccccggactgcacaaacgaccgggaattcggca caattagccaatcgttccctctccaggcctcccaagcccccgaccccgcgtcaggcaccg cttccagcacacgcacttccttacaaaggagctgcggacccacaggaaacgggagcgccg ggctctctcggggacttgtccccgcggcgcgagccccgcgcccgccacctcggggcgcac tgccgctgcagtccgcgcagccctccccggccgggcccagcacccgcttcctcgacctcc ccgggctcggaccttgcagtccaccgcgacggcgtcgggggggatgatgagcgcctcctc gccgctcttgggctcgtccttcttggcctccttctgggccagagcggagttgaacgtcac cttcaccatggcgcggcctggggcgccctcgaagggcggcggcgggctcggggcgcgggc tgcgggctcccggctgcgattgcagcctctacggccgggctccgggagcggctccgggcg gctgaagaggttcgggaagctccgcggcggcagaggcggctactgcggtag >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_5|236_aa MGHGLDKLDDPDDVVPVGQRRAWCWCMCFGLAFMLAGVILGGAYLYKYFALQPDDVYYCG IKYIKDDVILNEPSADAPAALYQTIEENIKIFEEEEVEFISVPVPEFADSDPANIVHDFN KKLTAYLDLNLDKCYVIPLNTSIVMPPRNLLELLINIKAGTYLPQSYLIHEHMVITDRIE NIDHLGFFIYRLCHDKETYKLQRRETIKGIQKREASNCFAIRHFENKFAVETLICS >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_5|711_bp atgggccacgggttggacaagcttgatgacccagatgatgtggtaccagttggccaaaga agagcctggtgttggtgcatgtgctttggactagcatttatgcttgcaggtgttattcta ggaggagcatacttgtacaaatattttgcacttcaaccagatgacgtgtactactgtgga ataaagtacatcaaagatgatgtcatcttaaatgagccctctgcagatgccccagctgct ctctaccagacaattgaagaaaatattaaaatctttgaagaagaagaagttgaatttatc agtgtgcctgtcccagagtttgcagatagtgatcctgccaacattgttcatgactttaac aagaaacttacagcctatttagatcttaacctggataagtgctatgtgatccctctgaac acttccattgttatgccacccagaaacctactggagttacttattaacatcaaggctgga acctatttgcctcagtcctatctgattcatgagcacatggttattactgatcgcattgaa aacattgatcacctgggtttctttatttatcgactgtgtcatgacaaggaaacttacaaa ctgcaacgcagagaaactattaaaggtattcagaaacgtgaagccagcaattgtttcgca attcggcattttgaaaacaaatttgccgtggaaactttaatttgttcttga >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_6|163_aa METAGCDPRPMDSCGTLRPQNHPSASRPHCDVWDSVAKKEFELNLFVLLLSTILTLLPPD CEDWLTRSEYEDIGGKNHKNNIHPKYLLTSGLIGQSLRGQCKVSKEEKEGSFKIDQPNRK KTEKQYDRVKGREAVPTVLCFSALVDISFHMGYGNDHLLEKPL >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_6|492_bp atggaaactgcaggctgtgacccaagacccatggactcctgcggcactctcaggccccag aaccatccatctgcttccaggccacactgtgacgtatgggactctgtagccaagaaagaa tttgaactcaatctctttgtgcttttactttcaaccatcctaacactactgcctcctgac tgtgaggattggctcacaagaagtgagtacgaagacattggtgggaagaatcacaaaaat aatatccatccaaaatatttgttaaccagtgggcttattggacaatctttaagggggcag tgtaaagtcagtaaggaagagaaagaaggaagttttaagattgatcaaccaaatagaaaa aaaactgaaaagcaatatgatagggttaaaggaagggaagcagtgcccacggttctgtgc ttctctgccctagtggacatctcattccacatgggctatggaaatgaccaccttctggaa aagcctctgtga >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_7|179_aa MGLYCHTVGTNPHFLHGTWISGQQALMHKHLSRCLLRSYWPNKSQGQAQHQCGTLHKGMN VTRWLPPRKGVMPPKTPRKTAATAAAAAAEPPAPPPPPPPEEDPEQDSGPEDLPLVRLEF EETEEPDFTALCQKLKIPDHVRERAWLTWEKVSSVDGVLKVEAYVIDLSIFYACFSASF >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_7|540_bp atgggtctttattgccacacggtcgggactaacccacacttcttacatggcacctggatt tcaggacagcaagccctaatgcacaagcacttatcaaggtgcttgctgagatcctattgg ccaaacaagtcacaaggccaagcccagcatcagtgtggaactctgcacaaaggcatgaat gtcactcgctggctcccgccgcggaaaggcgtcatgccgcccaaaaccccccgaaaaacg gccgccaccgccgccgctgccgccgcggaacccccggcaccgccgccgccgccccctcct gaggaggacccagagcaggacagcggcccggaggacctgcctctcgtcaggcttgagttt gaagaaacagaagaacctgattttactgcattatgtcagaaattaaagataccagatcat gtcagagagagagcttggttaacttgggagaaagtttcatctgtggatggagtattgaag gtggaagcttatgtcattgacttgagcattttttatgcatgttttagtgcctctttctga >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_8|423_aa MFLRNATPGVAPQSKWEAFGPPGSFRFPGCFSEADEGVESVSVSARVQMLISTLQRSGVA RGTTDERAAQRGHRADAKPAAKPTVHKEQPALPACGLVADFDPMREEETADFGPFNDDSV DRDIAEAIREYLKAKSGAAQPGAGRGQPGAAQPSRAAGSGSRYKKPDPNENSTKSLLKSH EELAAKVAHRQGLKGAHKEFAFRKPPRLAKTNVQPRSLKSRVTTKQENKGSPKPAAPCSP SEAPQNKSGVKRSAGALRRGKQVTSAAQAPEASDSSSNDGTEEAIQGKASEVPGGEGAAK GPGDTRMTQGQGKTDEVRHLDEKESSEDKSSSLDSDEDLDTAIKDLRSKRKLKKRCREPR AACRKGGPGRAQPCLWKPTLAGEEGRRALAKQQGTDRAEFARQEELGLGGKHFRPEVLTK GHR >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_8|1272_bp atgtttctcaggaacgctactccaggggttgctccccagtccaaatgggaggcctttggc ccaccagggagctttaggttccccgggtgcttctcggaggctgacgagggggtggagagc gtgtcggtgagcgcccgggtgcagatgctcatcagcacgctgcagcgcagcggggttgct cggggcaccaccgatgagcgcgctgcacagaggggccacagggcagacgccaagccagct gccaagcccaccgtgcacaaggagcagcccgcattgcctgcctgtggtcttgttgctgac tttgaccccatgagggaggaggaaactgcagactttggcccattcaatgatgattccgtg gaccgggacattgcggaagccatccgggagtacctaaaggcaaagagtggagccgcacag cccggggctggcaggggccagccaggcgcagcccagccttccagggctgcaggcagtggc agtagatataaaaaaccagacccaaacgaaaattcgaccaagtcactcttgaaatcccac gaagagctagccgcaaaggtggcgcatcggcagggtctgaagggcgcccataaagagttc gcctttcgcaaacctccccggttagcaaagacgaacgtgcagcccagaagcctcaagtcc agggtcacgaccaagcaggagaacaagggcagcccgaagccagcagccccctgcagccct tcagaagcaccacagaataaaagcggggtcaagaggagcgctggcgccctgagaagggga aagcaagtcacaagtgcggcgcaggcgcccgaggcgtcagactccagcagcaacgacggc actgaggaggccatccagggcaaagccagcgaggtcccgggaggggagggcgcagccaag gggcccggcgacactcgcatgacgcagggccagggtaagacagacgaggtgaggcacctg gacgagaaggagagctccgaagacaagagcagctccctggacagtgacgaggacctggac acggccatcaaggacttaaggtccaagcgaaagctcaagaagaggtgcagggaacccagg gctgcgtgcaggaagggaggcccaggccgggcccagccctgtctttggaagcccacactt gctggcgaagaaggacggcgggccctggccaagcagcaaggcacagacagggcggagttt gcccgacaggaggaactcgggctcggaggaaagcattttagacctgaggtattgacgaag ggtcatcgatag >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_9|125_aa MNMGVYVSLQHTDFISCGYLPKGSQTTKKASPISIKLGSSKPKETVPTLAPKTLSVAAAF NEDEDSESEERPPEAKMRMKNIARDTPTSAGPNSFNKGKHGFSDNQKLWEWNIKSHLRNV YDQDN >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_9|378_bp atgaacatgggagtgtatgtatctcttcaacatactgacttcatttcctgtggatatcta ccaaaaggtagtcagacgacaaagaaagcatcacccatatccatcaaacttggatcaagt aagcctaaagaaactgttccaactcttgctccaaaaactctttcagtagcagcagctttt aatgaagatgaagatagtgaatcagaggaaaggcctccagaagcaaagatgaggatgaag aatattgcaagggatacaccaacatcagctggaccaaactccttcaataaaggaaagcat gggttttctgataaccagaagctatgggagtggaatataaaatctcatcttcgaaatgtc tatgaccaagacaattaa >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_10|337_aa MTGSNSHITILTLNVNGLNAPIKRHRLANWIKSQDPSVCCIQEIHLTCRDTHRLKIKEWR KIYQANGKQKKAGVAILVSDKTDVKPTKIKRDKEGHYIMVKGSIQQEELTILNIYAPNTG APKFIKQVLRDLQRDLDSHTIIMGDFNTPLSTLDSSMSQNVNKDIQELNSALHQADLIDI YRTLHPKSTEYTFFSAYHTYSKIDHIVGSKALLSKCKRKEIITNCLSDHSAIKLELRIKK LTQNRITTWKLNNLLLNDYWVHNEMKAEIKMFFETNQNKDTTYQNLWDTCKAVCRGKFIA LNAHKRKQETSKIDTLTSQLKELEKSKHIQKLAEGKK >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_10|1014_bp atgacaggatcaaattcacacataacaatattaaccttaaatgtaaatgggctaaatgct ccaattaaaagacacagactggcaaattggataaagagtcaagacccatcagtgtgctgt attcaggagatccatctcacgtgcagagacacacataggctcaaaataaaggaatggagg aagatctaccaagcaaatggaaaacaaaaaaaggcaggggttgcaatcctagtctctgat aaaacagacgttaaaccaacaaagatcaaaagagacaaagaaggccattacataatggta aagggatcaattcaacaagaagaactaactatcctaaatatatatgcacccaatacagga gcacccaaattcataaagcaagtccttagagacctacaaagagacttagactctcacaca ataataatgggagactttaacaccccactgtcaacattagacagttcaatgagtcagaac gttaacaaggatatccaggaattgaactcagctctgcaccaagcggacctaatagacatc tacagaactctccatccaaaatcaacagaatatacattcttctcagcatatcacacttat tccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaaaagaa attataacaaactgtctctcagatcacagtgcaatcaaactagaactcaggattaagaaa ctcactcaaaaccgcataactacatggaaactgaacaacctgctcctgaatgactactgg gtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaatcagaacaaagac acaacataccagaatctctgggacacatgtaaagcagtgtgtagagggaaatttatagca ctaaatgcccacaagagaaagcaggaaacatccaaaattgacaccctaacatcacaatta aaagaactagagaagagcaaacacattcaaaagctagcagaaggcaagaaataa >gi568815585f:48133361_48361221|GENSCAN_predicted_peptide_11|232_aa MAGAPLLASLLPCSLISDCCASNERGSVGVGPSEPGTGYNLLGGYIQKKKELWGICIFIA AVDLDEMSFTFTELQKNIEISVHKFFNLLKEIDTSTKVDNAMSRLLKKYDVLFALFSKLE RISTEINSALVLKVSWITFLLAKGKKREDTNKIRNEKGDITADTAETKSIISGFYKQLYA NKLENIEEMGKFLDMYNLPRLNQEEIQNLNRPITSNEIEAIIKSPSKEKPGT >gi568815585f:48133361_48361221|GENSCAN_predicted_CDS_11|699_bp atggcgggcgcccctctcctagcctcgctactgccttgcagtttgatctcagactgctgt gctagcaatgagcgaggctccgtgggtgtgggaccctccgagccaggcacaggatataat ctcctgggaggttatattcaaaagaaaaaggaactgtggggaatctgtatctttattgca gcagttgacctagatgagatgtcgttcacttttactgagctacagaaaaacatagaaatc agtgtccataaattctttaacttactaaaagaaattgataccagtaccaaagttgataat gctatgtcaagactgttgaagaagtatgatgtattgtttgcactcttcagcaaattggaa aggatatctactgaaataaattctgcattggtgctaaaagtttcttggatcacattttta ttagctaaaggaaaaaagagagaagatacaaataagatcagaaatgaaaaaggagacatt acagctgatactgcagaaactaaaagtatcattagtggcttctataagcaactatatgcc aataaactggaaaatatagaagaaatgggcaaattcctagacatgtacaacttaccaaga ttgaaccaggaagaaatccaaaacctgaacagaccaataacaagtaatgagattgaagcc ataataaagtctcctagtaaagaaaagcctgggacctga