GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:23:03 Sequence gi568815594r:100315896_100617914 : 302019 bp : 36.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5320 5347 28 0 1 79 92 36 0.809 2.91 1.02 Intr + 5466 5651 186 1 0 73 116 104 0.586 10.44 1.03 Term + 15834 16231 398 1 2 51 38 118 0.031 -2.65 1.04 PlyA + 16771 16776 6 1.05 2.00 Prom + 18201 18240 40 -3.65 2.01 Init + 18978 19143 166 2 1 80 87 71 0.807 6.04 2.02 Term + 37018 37118 101 2 2 26 37 202 0.499 6.21 2.03 PlyA + 37157 37162 6 1.05 3.00 Prom + 45484 45523 40 -7.95 3.01 Init + 45612 45706 95 2 2 59 97 42 0.476 2.10 3.02 Intr + 47404 47489 86 1 2 122 80 41 0.702 5.24 3.03 Intr + 61663 61865 203 2 2 -12 29 142 0.006 -3.62 3.04 Intr + 64417 64774 358 0 1 30 88 222 0.201 10.30 3.05 Term + 65135 66789 1655 0 2 -6 47 604 0.895 35.53 3.06 PlyA + 67597 67602 6 1.05 4.05 PlyA - 68995 68990 6 1.05 4.04 Term - 92356 92112 245 2 2 32 41 206 0.449 5.38 4.03 Intr - 100064 100003 62 1 2 80 119 68 0.878 6.56 4.02 Intr - 101246 101218 29 2 2 102 71 30 0.403 -1.30 4.01 Init - 107161 106958 204 1 0 55 89 161 0.778 11.80 4.00 Prom - 124731 124692 40 -4.15 5.09 PlyA - 124759 124754 6 1.05 5.08 Term - 127486 127361 126 1 0 44 48 149 0.819 3.80 5.07 Intr - 129138 129092 47 1 2 63 121 15 0.189 -0.49 5.06 Intr - 131676 131638 39 1 0 96 115 1 0.121 0.88 5.05 Intr - 149644 149528 117 2 0 124 89 50 0.752 8.22 5.04 Intr - 159214 159143 72 2 0 83 113 10 0.620 1.46 5.03 Intr - 164144 164022 123 0 0 112 97 107 0.990 13.64 5.02 Intr - 170885 170719 167 1 2 78 61 50 0.194 -0.02 5.01 Init - 207249 207200 50 0 2 68 87 75 0.212 5.87 5.00 Prom - 211963 211924 40 -3.65 6.02 PlyA - 212247 212242 6 -0.45 6.01 Sngl - 212904 212422 483 0 0 22 38 299 0.620 14.12 6.00 Prom - 221417 221378 40 -5.55 7.00 Prom + 224710 224749 40 -3.65 7.01 Init + 230148 230241 94 2 1 68 16 120 0.036 3.39 7.02 Intr + 243374 243524 151 0 1 82 88 140 0.206 11.70 7.03 Term + 245162 245570 409 2 1 4 38 231 0.330 3.40 7.04 PlyA + 245610 245615 6 1.05 8.02 PlyA - 248703 248698 6 1.05 8.01 Sngl - 265633 265373 261 1 0 58 41 193 0.799 6.51 8.00 Prom - 285425 285386 40 -3.65 9.00 Prom + 288871 288910 40 -5.95 9.01 Init + 291839 291941 103 1 1 60 68 129 0.941 8.55 9.02 Intr + 292099 292263 165 2 0 36 44 165 0.896 5.91 9.03 Intr + 292784 292968 185 0 2 56 35 137 0.918 3.79 9.04 Term + 293272 293427 156 0 0 33 43 184 0.963 5.35 9.05 PlyA + 293691 293696 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_1|203_aa MVEQPFEDSVRIHGVQESRGGNGVASLTIIRSDSLAIFLLPVSVTLCSAGLEVLVPKVGM LQPEDNNIELEGDFGAPTGPYLPGTKLPEGGTGCHLCCFVAFTGSEKSEVTRDWSGPQAY CSSPVEKARLLRGCPLPYLLTRQILQPRPARAIEQVSTQQLPRQSSGATKSLSATASAWN CPCHSQTNEGTKTLRALSRPPIS >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_1|612_bp atggtggaacagccttttgaggactcagtcaggattcatggagtacaagaatcaaggggt ggaaatggagtggcatcactcactattatccgtagtgactcactagcaatttttttgctt cctgtttctgtgaccttatgctctgccggtctagaggtcttagttccaaaggtaggaatg ttgcaaccagaagacaacaatattgaactggaaggtgacttcggggcaccaacaggtcca tatctccctgggacaaagctcccagagggaggaacaggttgccatctttgctgttttgta gccttcacaggttccgaaaaatctgaagtgactagggactggagtgggccccaagcatat tgcagcagccctgtggaaaaggccagactgttacgtgggtgcccactcccatatctcctc accagacagatcctccagccacgtcctgccagagctattgagcaagtatcaactcagcaa ctccctagacagagttcaggggcaaccaaaagcctctcagccactgcctctgcatggaac tgcccttgccactctcagactaatgaaggaacaaagaccctaagagctttatccagacct ccaataagctaa >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_2|88_aa MLLGHSAIKIEFTTIKIAQNCTITWKLNNILMNYFCVNNEINAEIKKFFENNENKEEEEG GGGGGKEEEEEEEEEEEEEAAKQNFLEN >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_2|267_bp atgctcttgggccacagtgcaataaaaatagaattcaccacaataaaaattgctcaaaac tgtacaattacctggaaattaaacaacatccttatgaattacttttgtgtaaataatgaa attaatgcagaaatcaagaagttttttgaaaataatgagaacaaagaagaagaagaagga ggaggaggaggagggaaggaggaggaggaggaagaggaggaggaagaggaggaggaggca gcaaaacaaaattttttggagaattga >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_3|798_aa MGTVGGEIVGGSTSECAFTKFISSASSGSYIRRVVSNNSLAGLLLSWHLLLRGPELTQEE SEQTSTRWVPKPLGPDQLQTKAELSMEILNKWDQDPEAFLRRIITGDETWLTSTIPKTNH NQNNSYQEDSVQTTLSTGPEPGRLAGWLDPVKRQQSLHFSSQEATSIGREGSTTSRKHPV GQKNLNNSLQPQTFPLTQPAQVRRNQKTNSGNTTKQASLTPPQNPTSSSAMNPNQEEIPD LPEKNSGEEENSKSLENISGGIIKENFPSLARDLDIQIEEAQRTPGKFITKRSSPRHIVI RLSKVKTKHINRIKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDK PTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVK LSLFADDMIVYLENPIISAPNLLKLISNFSKVSGYKITVQKSQAFLYTNNRQTESQIMSE LTFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDINKWKNIPCSWAGRINIVKMA ILPKVIYRFNAIPIKLPVTFFTELEKTTLKFIWNQKRARIAKSVLSQKNKAGSIMLPDFK LYYKATVTKTAWYWYQNRDIDQCNRTEPSEIMLHIYNHLIFDKPDKNKQWGKDSLFNKWC WENWLAIGRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMS KTPKAMATEAKIDKWDLIKLKSFCTAKATTIRVNKQPTKWEKIFATYSSDKGLISRIHDE LQQIYKEKTTPSKSGQRI >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_3|2397_bp atgggcactgtgggtggtgagattgtgggaggctcaacttcagaatgtgcttttacaaaa ttcatatcatcagcaagttctgggtcttacataaggagagttgtctccaataactctctt gcaggtctactactgtcttggcatctgctcctcagaggacctgagctaacacaagaagaa agtgagcaaacttccactcgatgggtgccaaaaccattgggcccagatcagctacagaca aaagcagagctttcaatggaaattttaaacaagtgggatcaagatcctgaagcatttctt cgaagaattataacaggagatgaaacatggctcaccagtacaatcccaaagacaaatcac aatcaaaacaatagttaccaagaggactctgtgcagacaaccctcagtaccggtccagag ccaggtagacttgctgggtggctagacccagtaaagagacaacaatcactgcacttcagc tcacaggaagccacatccataggaagagaggggagtactacatcaaggaaacaccctgtg ggacaaaagaatctgaacaacagccttcagccccagaccttccctctgacacagcctgcc caagtgagaaggaaccagaaaaccaactctggtaatacaacaaaacaagcttctttaaca ccgccacagaatcccactagttcctcagcaatgaatccaaaccaagaagaaatccctgat ttacctgaaaagaattcaggagaagaagagaattctaaaagcttggaaaacatatctggg ggaataatcaaggaaaactttcccagccttgctagagacctagacatccaaattgaagaa gcacaaagaacacctgggaaattcatcacaaaaagatcatcgcctaggcacattgtcatc aggttatccaaagttaagacaaagcatataaacagaatcaaagacaaaaaccacatgatt atctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaact ctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctatgacaaa cccacagccaatatcatactgaatgggcaaaaactggaagcattccctctgaaaactggc acaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctggcc agggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaa ttgtccctgtttgcagatgacatgattgtatatctagaaaaccccatcatctcagcccca aatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcactgtgcaa aaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaa ctcacattcacaattgcttcaaagagaataaagtacctaggaatccaacttacaagggac gtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatata aacaaatggaagaatattccatgctcatgggcaggaagaatcaatatcgtgaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccatcaagctaccagtcactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatc gccaagtcagtcctaagccaaaagaacaaagctggaagcatcatgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatata gatcaatgcaacagaacagagccctcagaaataatgctgcatatctacaaccatctgatc tttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccataggtagaaagctgaaactggatcccttccttacaccttat acaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccataaaaacc ctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtct aaaacaccaaaagcaatggcaacagaagccaaaattgacaaatgggatctaattaaacta aagagcttctgcacagcaaaagcaactaccatcagagtgaacaagcaacctacaaaatgg gagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatccatgatgaa ctccaacaaatttacaaggaaaaaacaacaccatcaaaaagtgggcaaaggatatga >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_4|179_aa MQALQQPAGLIPVSNSNAFIHGSRYAECLNVTSKGEESDYGQPFLNFTEPAIPLQSHLTA ACSENFPQAHQKMEMISKPQSDKESVKLLTVKTISHESVILNQNKMDEITEIEFRIWMAM KITENQKIETHSEESEEFNKTIQGMKYEIAILRKNKTDLTELKNSPQEFHNTLEVLTAE >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_4|540_bp atgcaagcacttcagcaaccagccggtcttattccagtgagtaacagcaatgccttcatt catggcagtagatatgcagaatgtttgaatgtgaccagtaaaggagaggaatcagattat gggcagccttttctcaactttactgagcctgcaatcccattacaatcccatctaactgct gcctgctctgaaaatttcccacaagcacaccagaaaatggaaatgatcagtaaacctcag tctgataaagagagcgtgaagcttcttaccgttaagacaatttctcatgagtctgtgatt cttaaccagaataaaatggatgaaattacagaaatagaattcagaatctggatggcaatg aagatcactgaaaatcagaagattgaaacccattccgaggaatctgaggaattcaacaaa accatacaaggaatgaaatatgaaatagccattttaagaaagaacaaaactgatctgaca gagctgaaaaactcaccacaagaatttcataacacattggaagtattaacagcagagtag >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_5|246_aa MGHRAELLDMKFMAGNRYCCFLFNKLTTGMLSSCSPQSGQVAETPVQSQSLPGLALPPHL TDGWQPEEIPLSGVLEAANNSLVVTTTKPSITTPNTESLQKNVVTPTTGTTPKGTITNEL LKMSLMSTATFLTSKDEGLKATTTDVRKNDSIISNVTVTSVTLPNAVSTLQSSKPKTETQ SSIKTTEIPVGINSKPEETPFCRKKRRPAQLAASDPYATESLPLLEFEKGNYITKCADIN AGTQGT >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_5|741_bp atgggacatcgagcagaactcttggacatgaagttcatggctggcaatagatattgttgt tttctgtttaacaaactcaccactggaatgctgtcttcctgttctccccagagcggccag gtggctgagacacctgtccaatctcagagcctgccaggcttggcactccctccccacctc acagatggctggcaacctgaggagattcctctttcaggtgttttagaggcagctaataat tcacttgttgttactacaacaaaaccatctataacaacaccaaacacagaatcattacag aaaaatgttgtcacaccaacaactggaacaactcctaaaggaacaatcaccaatgaatta cttaaaatgtctctgatgtcaacagctacttttttaacaagtaaagatgaaggattgaaa gccacaaccactgatgtcaggaagaatgactccatcatttcaaacgtaacagtaacaagt gttacacttccaaatgctgtttcaacattacaaagttccaaacccaagactgaaactcag agttcaattaaaacaacagaaataccagttgggatcaactcaaaaccagaagagacccct ttctgcaggaaaaagaggcgccctgcccaactggcagcctcagacccatatgcaactgaa agtcttcccctactagagtttgaaaaaggaaactacatcactaaatgtgcagacatcaat gcagggacacaaggaacatga >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_6|160_aa MRQDVNKDIQDWNSALHQADLIDIYGTLHPKSTEYTFFSAPYCTYSKIDHIVGSKALFSK CKRTKITTNCLSQHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYEVNNKIKTEIKMFFET NENNDTAYQNLWDTFKTVCKGKFIAQMPTGESRKDLKSTP >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_6|483_bp atgagacaggatgttaacaaggatatccaggactggaactcagctctgcaccaagctgac ctaatagacatctacggaactctccaccccaaatcaacagaatatacattcttctcagca ccatattgcacttattccaaaattgaccacatagttggaagtaaagcactcttcagcaaa tgtaaaagaacaaaaatcacaacaaactgtctctcacagcacagtgcaatcaaattagaa ctcaggattaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctgctc ctgaatgactacgaggtaaataacaaaataaagacagaaataaagatgttctttgaaacc aatgagaacaacgacacagcataccagaatctctgggacacatttaaaacagtgtgtaaa gggaaatttatagcacaaatgcccacaggagaaagcaggaaagatctaaaatcaacaccc taa >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_7|217_aa MKATSVCARVLQLHTQSIMDQQLQHHLGAVRKPTQMRRNQKTNTGNMKKQGSSTPPKIHT SSPAMDPNQEEISDLSEKEFRSTISDHSGIKLEINSKKNFQNHSNTWKLNKQLLSEHWVK NKIKVEIKKFFELNDNNDTTYQNIWDTANAVLRGNLIASNDYIEKSEREQTDNLRSHLME LEKKEQTKPKPSRRKEITKITAELNEIEITTTTTTKI >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_7|654_bp atgaaggcaacatcagtgtgcgccagagttttgcaactccacacgcagtccatcatggac caacaactgcagcatcacctaggagctgttagaaagcctacccaaatgagaaggaaccag aaaaccaacactggtaatatgaaaaaacaaggctcttcaacacccccaaaaattcacact agttcaccggcaatggatccaaatcaagaagaaatctctgatttatctgaaaaggaattc aggagcactatctcagaccacagtggaataaaactggaaatcaactccaaaaagaacttt caaaaccactcaaatacatggaaattaaataaacagctcttgagtgagcattgggtcaag aacaaaatcaaggtggaaattaaaaaattctttgaactgaatgacaataatgacacaacc tatcaaaacatctgggatacagcaaatgccgtgctaagaggaaatctcatagcctcaaat gactacatcgaaaagtctgaaagagaacaaacagacaatctaaggtcacacctcatggaa ctagagaaaaaagaacaaaccaaaccgaaacccagcagaagaaaggaaataaccaagatc acagcagaactaaatgaaattgaaataacaacaacaacaacaacaaaaatataa >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_8|86_aa MTVHESSRLDRAKTGSDRAPGNQQYSRGEFTEANRASVSGTLPHPDSLECAGVGSLECSM RRGKAKLNLGSISGKLPKETLKKKQI >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_8|261_bp atgactgtgcatgaatcatccagactagatagggccaaaacaggttctgacagagcccca ggcaatcagcagtacagcagaggtgaatttactgaagctaatagagcttcagtttcaggg actcttcctcacccagattccttggagtgtgcgggagttgggagtttagagtgttctatg agaagagggaaagccaaactgaatttagggagcatttctggcaaactgcctaaagagact ctcaaaaagaaacagatctga >gi568815594r:100315896_100617914|GENSCAN_predicted_peptide_9|202_aa MKLRTLTVSVIALKVTLLESVPSDVQMCSEFLPSGVKLQTFAVSVMALKVARPELFVLPG GLVVLLGSRVKLQVFAVSVTARKSSVDPKNSGTQLASPSGSRTGAAGGAACQSRALCLHS SALGWWMGLDSMEQGVVLVGEAPAAQEPMEWLREWAPALASPERGSDSAAAKGSSSTAKV GAQAEEAASEGCEDCQHAVTSQ >gi568815594r:100315896_100617914|GENSCAN_predicted_CDS_9|609_bp atgaagctgcggaccctcacggtgagtgttatagcccttaaggtgacgcttctggagtct gtcccttctgatgttcagatgtgttcggagtttcttccttctggagtgaagctgcagacc ttcgcggtgagtgttatggctcttaaggtagcgcgtccggagttgttcgttcttcccggt gggctcgtggtcttgctgggctcaagagtgaagctgcaggtcttcgcggtgagtgttaca gctcgtaaaagcagcgtggacccaaagaactcaggaacccagctggcttcacctagtgga tcccgcaccggggctgcaggtggagctgcctgccagtccagggccctgtgcttgcattct tcagcccttgggtggtggatgggactggactccatggagcagggggtggtgctcgtcggg gaggccccggccgcacaggagcccatggagtggctgagggagtgggctccagccttggcc agcccagaaaggggctccgacagtgcagcagcgaagggctcctcaagtaccgccaaagtg ggagcccaggcagaggaggcagcgagcgagggctgtgaggactgccagcacgctgtcacc tctcaatag