GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:26:01 Sequence gi568815593r:65491772_65724274 : 232503 bp : 37.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 37474 37570 97 2 1 56 94 67 0.020 2.25 1.02 Term + 53750 53954 205 2 1 28 32 184 0.202 2.66 1.03 PlyA + 54097 54102 6 1.05 2.03 PlyA - 54260 54255 6 -0.45 2.02 Term - 58161 57640 522 0 0 67 39 238 0.950 10.09 2.01 Init - 63136 63026 111 1 0 26 100 161 0.996 11.36 2.00 Prom - 68851 68812 40 -7.35 3.00 Prom + 69346 69385 40 -8.05 3.01 Init + 71540 71735 196 1 1 100 65 306 0.992 28.64 3.02 Intr + 75742 75833 92 2 2 86 56 57 0.607 1.09 3.03 Intr + 78079 78227 149 0 2 63 92 102 0.635 6.21 3.04 Intr + 80068 80515 448 1 1 56 94 316 0.977 21.42 3.05 Intr + 85108 85298 191 0 2 54 119 190 0.975 16.16 3.06 Intr + 87653 87842 190 2 1 62 80 140 0.644 9.27 3.07 Intr + 91267 91448 182 0 2 27 27 265 0.261 12.04 3.08 Intr + 93243 93324 82 0 1 68 115 -17 0.205 -2.28 3.09 Intr + 94228 94410 183 0 0 39 127 109 0.200 8.96 3.10 Term + 95482 95625 144 0 0 55 38 146 0.978 3.23 3.11 PlyA + 95653 95658 6 1.05 4.11 PlyA - 95756 95751 6 1.05 4.10 Term - 100177 99998 180 1 0 70 34 201 0.999 9.53 4.09 Intr - 102874 102750 125 2 2 62 95 54 0.677 2.88 4.08 Intr - 105409 105280 130 1 1 17 111 131 0.621 7.65 4.07 Intr - 113274 113140 135 0 0 61 90 152 0.961 12.54 4.06 Intr - 117687 117472 216 0 0 88 96 198 0.967 18.38 4.05 Intr - 119272 119090 183 1 0 30 22 215 0.996 8.16 4.04 Intr - 120110 119832 279 2 0 60 86 126 0.913 6.35 4.03 Intr - 122448 122327 122 1 2 72 57 121 0.993 6.69 4.02 Intr - 126484 126322 163 1 1 35 63 154 0.966 6.23 4.01 Init - 130263 130261 3 0 0 108 81 0 0.552 1.35 4.00 Prom - 130732 130693 40 -8.15 5.02 PlyA - 130881 130876 6 -0.45 5.01 Sngl - 132503 132171 333 2 0 88 43 240 0.904 15.49 5.00 Prom - 134656 134617 40 -7.15 6.00 Prom + 135942 135981 40 -6.65 6.01 Init + 137817 138168 352 2 1 61 32 168 0.543 5.87 6.02 Intr + 143530 143598 69 2 0 89 60 52 0.508 0.74 6.03 Intr + 144173 144272 100 0 1 63 86 101 0.986 5.65 6.04 Intr + 145925 146009 85 2 1 98 106 33 0.997 5.00 6.05 Intr + 155284 155411 128 0 2 69 75 131 0.856 8.46 6.06 Intr + 168928 169126 199 1 1 74 101 95 0.813 7.83 6.07 Term + 170279 170383 105 1 0 79 38 85 0.539 0.03 6.08 PlyA + 171741 171746 6 1.05 7.10 PlyA - 172359 172354 6 1.05 7.09 Term - 178586 178475 112 1 1 87 34 166 0.960 8.15 7.08 Intr - 180227 180144 84 1 0 79 94 59 0.481 3.62 7.07 Intr - 180510 180473 38 0 2 71 87 27 0.477 -2.96 7.06 Intr - 189023 188885 139 1 1 107 95 122 0.943 14.35 7.05 Intr - 193701 193597 105 2 0 113 98 92 0.997 11.11 7.04 Intr - 212607 212508 100 1 1 53 97 41 0.678 -0.25 7.03 Intr - 213025 212916 110 0 2 58 100 89 0.222 6.11 7.02 Intr - 227408 227321 88 0 1 46 64 46 0.127 -3.89 7.01 Init - 229036 228937 100 1 1 34 91 136 0.497 8.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:65491772_65724274|GENSCAN_predicted_peptide_1|100_aa XLFLNDSQIYTVVQDTSIQTSVVKKPSRKHLKRNEREIKKFSDKEKLRQFVTSRPTLKER LKEVLETETIKGILEDQEGRTIERTKTWTKATINPTVTRK >gi568815593r:65491772_65724274|GENSCAN_predicted_CDS_1|303_bp nttttatttctgaacgatagtcaaatttacacagtggtccaggatacttccattcagaca tcagtagtgaaaaagcctagcagaaagcatttgaaaaggaatgaaagagaaatcaagaaa ttctcagataaagaaaaactacgacaatttgtcaccagcagaccaactctaaaagaacgg ctcaaggaagttcttgaaacagaaacaataaaaggaattttagaagatcaggaaggaaga acaatagaaagaacaaaaacatggactaaagctaccataaatcctacagtcacaaggaaa taa >gi568815593r:65491772_65724274|GENSCAN_predicted_peptide_2|210_aa MNQEDLDPDSTTDVGDVTNTEEELIRECEEMWKDMEESPKVQEKFLKEIKDVALVNTQIM RKQNSLIAVIEKVLLVYLDDQTSHHSLSQSVIHNKALALFNSTKAERDEEAAEEKCEASR GWFMRFKERSHLHNIKMQGEAASTEAEASTSYPEDLAKILEGGYDKQQIFNVDRTALYWK KMPSRTLIVREKLMPGSKASKGQVDSLDKD >gi568815593r:65491772_65724274|GENSCAN_predicted_CDS_2|633_bp atgaatcaggaggatctagatccggatagtactacagatgtgggagatgttacaaatact gaagaagaacttattagagaatgtgaagaaatgtggaaagatatggaagaatctcccaaa gtgcaggaaaaatttttaaaggaaattaaagatgttgctctagtgaacacacaaattatg agaaaacagaatagccttattgctgttatagagaaagttttactagtctacttagatgat caaaccagccatcattccctaagccaaagcgtaatccacaacaaagccctagctcttttc aattctacaaaggctgagagagatgaggaagctgcagaagaaaagtgtgaagctagcaga ggttggttcatgaggtttaaggaaagaagccatctccataacataaaaatgcaaggggaa gcagcaagtactgaggcagaagcgtcaacaagttatccagaagatctagctaagatactt gaaggtggctatgataaacagcaaattttcaatgtagacagaacagccctctattggaag aagatgccatctaggactcttatagttagagagaagttgatgcctggctccaaagcttca aaaggacaggttgactctcttgataaggactag >gi568815593r:65491772_65724274|GENSCAN_predicted_peptide_3|618_aa MAAESGSDFQQRRRRRRDPEEPEKTELSERELAVAVAVSQENDEENEERWVGPLPVEATL AKKRKVLEFERVYLDNLPSASMYERSYMHRDVITHVNVLISFIEIGVIESIAVSSEGALF CSVGDDKAMKVFDVVNFDMINMLKLGYFPGQCEWIYCPGDAISSVAASEKSTGKIFIYDG RGDNQPLHIFDKLHTSPLTQIRLNPVYKAVVSSDKSGMIEYWTGPPHEYKFPKNVNWEYK TDTDLYEFAKCKAYPTSVCFSPDGKKIATIGSDRKVRIFRFVTGKLMRVFDESLSMFTEL QQMRQQLPDMEFGRRMAVERELEKVDAVRLINIVFDETGHFVLYGTMLGIKVINVETNRC VRILGKQENIRVMQLALFQGIAKKHRAATTIEMKASENPVLQNIQADPTIVCTSFKKNRF YMFTKREPEDTKSADSDRDVFNEKPSKEEVMAATQAEGPKRVSDSAIIHTSMGDIHTKLF PVECPKTVENFCVHSRNGYYNGHTFHRIIKGFMIQTGDPTGTGMGGESIWGGEFEDEFHS TLRHDRPYTLSMANAGSNTNGSQFFITVVPTPWLDNKHTVFGRVTKGMEVVQRISNVKVN PKTDKPYEDVSIINITVK >gi568815593r:65491772_65724274|GENSCAN_predicted_CDS_3|1857_bp atggcggcggaaagtggtagcgattttcagcagagacgtagaaggcgccgggacccggag gaaccggaaaaaacagaactcagcgaaagagagctggcagtagcagtggcggtgtcccag gagaacgatgaggagaacgaagagcgctgggttggacctttacctgtggaggcaacactg gccaagaagaggaaagtcttagagtttgaaagagtctatcttgataatctccccagtgca tccatgtatgagcgcagttacatgcatagagatgttatcacccatgtgaatgttttaatt tcatttattgaaataggagttattgagagtattgcagttagctctgagggagcattgttc tgttctgtgggtgatgataaagcaatgaaggtgtttgatgtagtgaactttgacatgatc aacatgctgaaacttggctattttcctggacagtgtgagtggatctattgcccaggggat gcaatttcttcagttgctgcttccgaaaagagtacaggaaaaattttcatttatgatggc cgaggagataaccagccacttcatatttttgacaaactccatacatcacctcttactcag atacggctaaacccagtttacaaagcagtagtgtcttctgacaaatctgggatgattgaa tactggactgggcctcctcatgaatataaattccccaaaaatgtgaactgggaatataaa actgacactgatttatatgaatttgccaagtgtaaggcttatccaaccagcgtatgtttt tcaccagatgggaagaaaatagctactattggttctgatagaaaagttagaattttcaga tttgtaactggaaaactcatgagagtctttgatgaatcactaagcatgtttactgaactg caacagatgaggcaacagttaccagacatggaatttggccgacgaatggctgtagaacgt gagttggagaaggttgatgctgtaagattaattaatatagtttttgatgaaactggacac ttcgtgctgtatggaacaatgctgggcattaaagttataaatgtagagacaaaccggtgt gtgcggattttaggcaaacaagaaaatattagagtgatgcaattggctttgttccagggg atagccaaaaagcatcgtgctgcaactactatagaaatgaaagcttctgaaaatcctgtt cttcagaatattcaagctgacccaacaatagtctgtacatcattcaaaaagaatagattt tatatgtttaccaaacgagaaccagaagatacgaaaagtgcagattctgatcgagatgtt tttaatgagaaaccttctaaagaagaagtcatggcagctactcaagctgaaggacctaaa cgagtttcggacagtgccattatccacaccagcatgggagacattcacaccaaacttttt cctgttgagtgccctaagacagtggaaaacttctgtgttcacagcagaaatggttattat aatgggcatacatttcaccgtataattaagggctttatgattcagactggagatccaaca ggtactggtatgggaggagaaagcatatggggaggagaatttgaagatgaatttcattca acattacgacatgacaggccatacacactcagcatggctaacgcgggatcaaatactaat ggatcccagtttttcataacggtagtaccaacgccttggcttgataataagcatacagta tttggacgagtgactaaaggaatggaagttgtacagaggatctccaacgtcaaagtcaat cccaaaacagataagccctatgaggatgtcagcatcataaatattactgtcaagtaa >gi568815593r:65491772_65724274|GENSCAN_predicted_peptide_4|511_aa MVLECGVCEDVFSLQGDKVPRLLLCGHTVCHDCLTRLPLHGRAIRCPFDRQVTDLGDSGV WGLKKNFALLELLERLQNGPIGQYGAAEESIGISGESIIRCDEDEAHLASVYCTVCATHL CSECSQVTHSTKTLAKHRRVPLADKPHEKTMCSQHQVHAIEFVCLEEGCQTSPLMCCVCK EYGKHQGHKHSVLEPEANQIRASILDMAHCIRTFTEEISDYSRKLVGIVQHIEGGEQIVE DGIGMAHTEHVPGTAENARSCIRAYFYDLHETLCRQEEMALSVVDAHVREKLIWLRQQQE DMTILLSEVSAACLHCEKTLQQDDCRVVLAKQEITRLLETLQKQQQQFTEVADHIQLDAS IPVTFTKDNRVHIGPKMEIRVVTLGLDGAGKTTILFKLKQDEFMQPIPTIAVVFVVDSSH RDRISEAHSELAKLLTEKELRDALLLIFANKQDVAGALSVEEITELLSLHKLCCGRSWYI QGCDARSGMGLYEGLDWLSRQLVAAGVLDVA >gi568815593r:65491772_65724274|GENSCAN_predicted_CDS_4|1536_bp atggtgctagagtgtggagtttgtgaagatgtcttttctttgcaaggagacaaagttccc cgtcttttgctttgtggccataccgtctgtcatgactgtctcactcgcctacctcttcat ggaagagcaatccgttgcccatttgatcgacaagtaacagacctaggtgattcaggtgtc tggggattgaaaaaaaattttgctttattggagcttttggaacgactgcagaatgggcct attggtcagtatggagctgcagaagaatccattgggatatctggagagagcatcattcgt tgtgatgaagatgaagctcaccttgcctctgtatattgcactgtgtgtgcaactcatttg tgctctgagtgttctcaagttactcattctacaaagacattagcaaagcacaggcgagtt cctctagctgataaacctcatgagaaaactatgtgctctcagcaccaggtgcatgccatt gagtttgtttgcttggaagaaggttgtcaaactagcccactcatgtgctgtgtctgcaaa gaatatggaaaacaccagggtcacaagcattcagtattggaaccagaagctaatcagatc cgagcatcaattttagatatggctcactgcatacggaccttcacagaggaaatctcagat tattccagaaaattagttggaattgtgcagcacattgaaggaggagaacaaatcgtggaa gatggaattggaatggctcacacagaacatgtaccagggactgcagagaatgcccggtca tgtattcgagcttatttttatgatctacatgaaactctgtgtcgtcaagaagaaatggct ctaagtgttgttgatgctcatgttcgtgaaaaattgatttggctcaggcagcaacaagaa gatatgactattttgttgtcagaggtttctgcagcctgcctccactgtgaaaagactttg cagcaggatgattgtagagttgtcttggcaaaacaggaaattacaaggttactggaaaca ttgcagaaacagcagcagcagtttacagaagttgcagatcacattcagttggatgccagc atccctgtcacttttacaaaggataatcgagttcacattggaccaaaaatggaaattcgg gtcgttacgttaggattggatggtgctggaaaaactactatcttgtttaagttaaaacag gatgaattcatgcagcccattccaacaattgctgttgtgtttgttgtagatagcagtcat agagacagaattagtgaagcacacagcgaacttgcaaagttgttaacggaaaaagaactc cgagatgctctgctcctgatttttgctaacaaacaggatgttgctggagcactgtcagta gaagaaatcactgaactactcagtctccataaattatgctgtggccgtagctggtatatt cagggctgtgatgctcgaagtggtatgggactgtatgaagggttggactggctctcacgg caacttgtagctgctggagtattggatgttgcttga >gi568815593r:65491772_65724274|GENSCAN_predicted_peptide_5|110_aa MATLVVNKLGAGVDSGRQGSRGTAVVKVRDLACSILHHRPPFGSSWRPGGIAAAQASIGG MRGLWAPFGITSLVLSSASFLSYGLKLPGLLLMTCHHRDICEKELAGQWQ >gi568815593r:65491772_65724274|GENSCAN_predicted_CDS_5|333_bp atggctaccctggttgtaaacaagctcggagcgggagtagacagtggccggcagggcagc cgggggacagctgtagtgaaggtgagggaccttgcgtgctctattctccaccatcggcct cccttcggttcatcctggagacctggcggcatcgccgcggcccaggcttcgataggtggg atgcgcggcctctgggccccttttggcattacaagtctcgtcctgtcctctgcatccttt ctctcctatggcctgaaattgcctggtcttctgctcatgacatgtcatcatcgggacatt tgtgaaaaggagcttgcgggccagtggcagtga >gi568815593r:65491772_65724274|GENSCAN_predicted_peptide_6|345_aa MTTEVILHYRPCESDPTQLPKIAEKAIQDFPTRPLSRFIPWFPYDGSKLPLRPKRSPPVI SEEAAEDVKQYLTISEHDAKSHSYDCTVDLLEFQPSLKKQHLTWSHTLKEQTNSGNLVMR LTKPTLFTNIPVTCEEKDLPGDLFNQLMRDDPSTVNGAEVLMLGEMLTLPQNFGNIFLGE TFSSYISVHNDSNQVVKDILVKADLQTSSQRLNLSASNAAVAELKPDCCIDDVIHHEVKE IGTHIVSTFGSRAYLQPMDTRQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGER GRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCR >gi568815593r:65491772_65724274|GENSCAN_predicted_CDS_6|1038_bp atgactacagaagtaatattacattatcgaccatgtgagagtgatcccacacaactgcca aaaattgcagaaaaagcaattcaagactttcctactcgtccgctatcaagatttatacct tggtttccatatgatgggtccaagcttccactcagacctaaaagatcaccacctgtgatt tctgaagaggcagctgaagatgtgaaacagtacttaaccatttcagaacatgatgctaag tcacacagttatgattgcacagtagatctattggagtttcaacctagcttgaaaaagcag catttaacctggtcacacacactgaaggaacagactaattctggaaatctggtgatgcgg ctgactaagcctactttattcaccaatatcccagtaacatgtgaagagaaagacttacct ggagatctctttaaccagctgatgagagatgatccttcaaccgttaatggtgcagaagtt ttaatgttgggagaaatgctgactttaccacagaattttgggaatatatttttgggagag accttttccagttatatcagcgttcataatgatagcaatcaagttgtaaaagacatatta gtaaaagctgatcttcagacaagttctcagcgtttaaatctttcagcctccaatgctgca gtggctgaacttaaaccggattgttgtattgatgatgtcatacatcatgaagtcaaagaa attggaacacacattgtgtctacgtttgggtcaagagcatatttgcaaccaatggataca cgccagtacttatactgcctaaagccaaagaatgaatttgcagaaaaagcaggcatcatt aagggagtaacagtaattggaaaattggatatagtatggaaaacaaatctaggtgaaagg ggaaggttacagaccagccaacttcaaagaatggctccaggttatggagatgttaggttg tctttggaggcaataccagataccgtaaaccttgaagaaccttttcatattacctgtaaa ataacaaactgcaggtaa >gi568815593r:65491772_65724274|GENSCAN_predicted_peptide_7|291_aa MSSIKHLVYAVIRFLREQSQMDTYTSDEQESLEGGGDENWAFALNLLAVWLVDVFPGGSE RSRERVEDQVQIEMVIREGVASGTGWGSSKREERDEDKEGNNHMKEENYAAAVDCYTQAI ELDPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEA VTSYQKALDLDPENDSYKSNLKIAEQKLREVSSPAASLMQNPQVQQLMSGMMTNAIGGPA AGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHIRSRSFSSSAEEHS >gi568815593r:65491772_65724274|GENSCAN_predicted_CDS_7|876_bp atgtcatctatcaagcacctggtttatgcagttattcgtttcttacgggaacaaagtcag atggacacttacacctcggatgaacaagaaagtttggaaggaggtggagatgaaaattgg gcatttgctctaaacctccttgcagtctggttagttgatgtctttccaggtggcagtgag aggagcagggaaagagtggaagaccaggtccagattgagatggtgattagggagggtgtt gcatctggaacgggctggggcagctctaaaagggaggagagagatgaagataaggaaggc aataaccacatgaaagaagaaaattatgctgctgcagtggattgttacacacaggcaata gaattggatcccaataatgcagtttactattgcaacagggctgctgctcagagcaaatta ggtcactacacagatgcgataaaggattgtgaaaaagcaatagcaattgattcaaagtac agcaaggcctatgggagaatggggctggccctcactgccttgaataaatttgaagaagca gttacaagttatcaaaaggcattagatcttgaccctgaaaatgattcctataagtcaaat ctgaaaatagcagaacagaagttaagagaggtatccagtcctgcggcaagtttaatgcag aaccctcaagttcaacagctaatgtcaggaatgatgacaaatgccattgggggacctgct gctggagttgggggcctaactgacctgtcaagcctcatccaagcgggacagcagtttgct cagcagatacagcaacaaaatcctgaacttatagagcaactgagaaatcacatccggagc agatcattcagcagcagcgctgaagagcattcctga