GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:25:48 Sequence gi568815588f:101031542_101236910 : 205369 bp : 50.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3151 3314 164 0 2 135 60 177 0.996 18.37 1.02 Intr + 3956 4130 175 2 1 89 22 243 0.892 17.74 1.03 Intr + 4403 4560 158 1 2 52 78 61 0.628 0.31 1.04 Intr + 4945 5020 76 1 1 116 98 52 0.964 8.52 1.05 Intr + 5182 5267 86 0 2 94 105 91 0.956 10.02 1.06 Intr + 5535 5662 128 0 2 139 82 115 0.992 16.32 1.07 Intr + 5841 5890 50 1 2 98 102 63 0.995 7.10 1.08 Intr + 7102 7151 50 0 2 121 119 -8 0.962 3.08 1.09 Term + 7948 8044 97 1 1 93 55 93 0.969 3.94 1.10 PlyA + 9479 9484 6 1.05 2.00 Prom + 13208 13247 40 -4.76 2.01 Init + 15266 15333 68 1 2 67 96 80 0.822 5.40 2.02 Intr + 16549 16621 73 1 1 94 81 35 0.634 2.81 2.03 Term + 16709 17074 366 1 0 93 45 112 0.563 2.00 2.04 PlyA + 18385 18390 6 1.05 3.00 Prom + 26730 26769 40 -2.96 3.01 Init + 31052 31562 511 1 1 78 75 711 0.748 61.89 3.02 Intr + 32720 32880 161 0 2 99 74 212 0.886 20.61 3.03 Intr + 32960 33107 148 1 1 56 95 152 0.934 12.51 3.04 Term + 33285 33379 95 1 2 112 42 107 0.991 6.49 3.05 PlyA + 34026 34031 6 1.05 4.05 PlyA - 34596 34591 6 1.05 4.04 Term - 58582 58553 30 1 0 108 44 51 0.121 0.55 4.03 Intr - 62129 61994 136 1 1 108 60 23 0.079 2.07 4.02 Intr - 83699 83578 122 2 2 120 32 58 0.105 2.69 4.01 Init - 87621 87562 60 0 0 89 69 17 0.096 1.16 4.00 Prom - 89157 89118 40 -4.76 5.00 Prom + 94882 94921 40 -2.26 5.01 Init + 100001 100568 568 1 1 109 109 594 0.976 58.83 5.02 Intr + 102634 102835 202 2 1 68 83 400 0.535 35.84 5.03 Intr + 105150 105368 219 0 0 55 46 354 0.676 25.32 5.04 Intr + 107896 108079 184 1 1 105 46 41 0.486 1.39 5.05 Intr + 108484 108619 136 0 1 118 96 48 0.848 8.74 5.06 Intr + 110384 110479 96 0 0 132 79 -8 0.779 2.68 5.07 Intr + 110736 110863 128 1 2 47 72 76 0.891 2.30 5.08 Intr + 114133 114270 138 0 0 56 105 39 0.434 3.06 5.09 Intr + 115456 115577 122 0 2 44 61 94 0.302 1.59 5.10 Intr + 130319 130440 122 2 2 48 76 101 0.123 5.04 5.11 Intr + 135600 135672 73 1 1 16 92 53 0.002 -3.04 5.12 Intr + 150097 150204 108 1 0 87 90 36 0.002 3.10 5.13 Term + 159769 159907 139 1 1 95 35 88 0.076 1.64 5.14 PlyA + 163129 163134 6 1.05 6.00 Prom + 166779 166818 40 -1.46 6.01 Init + 173813 173973 161 1 2 67 97 84 0.814 4.51 6.02 Intr + 179527 179555 29 1 2 124 105 -8 0.608 2.26 6.03 Intr + 185994 186096 103 1 1 79 14 157 0.849 6.63 6.04 Intr + 186434 186539 106 2 1 87 97 64 0.308 7.42 6.05 Term + 195347 195397 51 1 0 90 42 99 0.621 2.93 6.06 PlyA + 195406 195411 6 1.05 7.05 PlyA - 195560 195555 6 1.05 7.04 Term - 196249 195729 521 2 2 101 53 991 0.999 91.36 7.03 Intr - 197389 196950 440 0 2 35 102 635 0.650 52.76 7.02 Intr - 197662 197391 272 1 2 91 15 166 0.281 5.94 7.01 Intr - 204802 204648 155 2 2 113 46 41 0.223 2.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:101031542_101236910|GENSCAN_predicted_peptide_1|327_aa KMGELPLDINIQEPRWDQSTFLGRARHFFTVTDPRNLLLSGAQLEASRNIVQNYRAGVVT PGITEDQLWRAKYVYDSAFHPDTGEKVVLIGRMSAQVPMNMTITGCMLTFYRQGSKDEGH CRRGRSECLCSLRKTPTVVFWQWVNQSFNAIVNYSNRSGDTPITVRQLGTAYVSATTGAV ATALGLKSLTKHLPPLVGRFVPFAAVAAANCINIPLMRQRELQVGIPVADEAGQRLGYSV TAAKQGIFQVVISRICMAIPAMAIPPLIMDTLEKKDFLKRRPWLGAPLQVGLVGFCSIHI SNLEPELRAQIHEQNPSVEVVYYNKGL >gi568815588f:101031542_101236910|GENSCAN_predicted_CDS_1|984_bp aaaatgggtgaattgcctttagacatcaacatccaggaacctcgctgggaccaaagtact ttcctgggcagagcccggcactttttcactgttactgatcctcgaaatctgctgctgtcc ggggcacagctggaagcttctcggaacatcgtgcagaactacagggccggcgtggtgacc ccagggatcaccgaggaccagctgtggagggccaagtatgtgtatgactccgccttccat ccggacacaggggagaaggtggtcctgattggccgcatgtcagcccaggtgcccatgaac atgaccatcactggctgcatgctcacattctacaggcaggggtctaaagatgagggccac tgtagacggggcagaagtgagtgtctttgttccctcaggaagaccccaaccgtggtgttc tggcagtgggtgaatcagtccttcaatgccattgttaactactccaaccgcagtggtgac actcccatcactgtgaggcagctggggacagcctatgtgagtgccaccactggagctgtg gccacggccctgggactcaaatccctcaccaagcacctgccccccttggtcggcagattt gtgccctttgcagcagtggcagctgccaactgcatcaacatccccctgatgaggcagaga gagctgcaggtgggcatcccggtggctgatgaggcaggtcagaggcttggctactcggtg actgcagccaagcagggaatcttccaggtggtgatttcaagaatctgcatggcgattcct gccatggccatcccaccactgatcatggacactctggagaagaaagacttcctgaagcgc cgcccctggctgggggcacccctgcaggtgggactggtgggcttctgctccatacacata agcaacctggaaccagagctgagagctcagatccatgagcaaaaccccagcgttgaagtg gtctactacaacaaggggctttga >gi568815588f:101031542_101236910|GENSCAN_predicted_peptide_2|168_aa MEAEDRGAPLCVYALYLPPSGTSPGSFIKGLRQLCSSPAAAPLALRKDIRNRQPRLIVAR LRGREMSPDTAAAALASREPTRHGLGAPPAAAVAPLRLPFFSSYHSPAHLRLSGESSNGI ASVSVLCCVADLGALEGKSWHFGEEGLALSRRLLQLSCCLDLRTSNLK >gi568815588f:101031542_101236910|GENSCAN_predicted_CDS_2|507_bp atggaggctgaggaccgaggggcccctctttgtgtctatgctctgtatctcccgccgagc ggcaccagcccgggcagcttcattaagggcctccgccagctctgcagctcacctgctgca gcgcccctcgcgctccgcaaggacattagaaaccgtcaaccaagattaatagttgcccgg ctccggggcagggagatgagcccagatactgcggcggcggcgctcgcttcgcgcgagcca acccggcacgggctgggcgcaccacctgccgctgctgtcgctccactccgccttcctttc ttctcctcctaccactcgcccgcgcacctgaggctctcaggtgagtccagcaatgggatt gcgagcgtcagtgttctctgctgcgtcgccgacctgggcgcgctcgaaggcaagtcttgg cacttcggggaagaagggctcgccctctccaggagactcctccagctttcctgctgcctc gatttaaggacttccaaccttaaatga >gi568815588f:101031542_101236910|GENSCAN_predicted_peptide_3|304_aa MLPPPRPAAALALPVLLLLLVVLTPPPTGARPSPGPDYLRRGWMRLLAEGEGCAPCRPEE CAAPRGCLAGRVRDACGCCWECANLEGQLCDLDPSAHFYGHCGEQLECRLDTGGDLSRGE VPEPLCACRSQSPLCGSDGHTYSQICRLQEAARARPDANLTVAHPGPCESGPQIVSHPYD TWNVTGQDVIFGCEVFAYPMASIEWRKDGLDIQLPGDDPHISVQFRGGPQRFEVTGWLQI QAVRPSDEGTYRCLGRNALGQVEAPASLTVLTPDQLNSTGIPQLRSLNLVPEEEAESEEN DDYY >gi568815588f:101031542_101236910|GENSCAN_predicted_CDS_3|915_bp atgctgccgccgccgcggcccgcagctgccttggcgctgcctgtgctcctgctactgctg gtggtgctgacgccgcccccgaccggcgcaaggccatccccaggcccagattacctgcgg cgcggctggatgcggctgctagcggagggcgagggctgcgctccctgccggccagaagag tgcgccgcgccgcggggctgcctggcgggcagggtgcgcgacgcgtgcggctgctgctgg gaatgcgccaacctcgagggccagctctgcgacctggaccccagtgctcacttctacggg cactgcggcgagcagcttgagtgccggctggacacaggcggcgacctgagccgcggagag gtgccggaacctctgtgtgcctgtcgttcgcagagtccgctctgcgggtccgacggtcac acctactcccagatctgccgcctgcaggaggcggcccgcgctcggcccgatgccaacctc actgtggcacacccggggccctgcgaatcggggccccagatcgtgtcacatccatatgac acttggaatgtgacagggcaggatgtgatctttggctgtgaagtgtttgcctaccccatg gcctccatcgagtggaggaaggatggcttggacatccagctgccaggggatgacccccac atctctgtgcagtttaggggtggaccccagaggtttgaggtgactggctggctgcagatc caggctgtgcgtcccagtgatgagggcacttaccgctgccttggccgcaatgccctgggt caagtggaggcccctgctagcttgacagtgctcacacctgaccagctgaactctacaggc atcccccagctgcgatcactaaacctggttcctgaggaggaggctgagagtgaagagaat gacgattactactag >gi568815588f:101031542_101236910|GENSCAN_predicted_peptide_4|115_aa MHLYYPPLKSMHSEMHTTTQVFGWFLKLNPPGFLLHLMQQEKFGQEGAVGADQASSGNTQ GLFGTPLMSPGHPKTSPELTPLLTLTWTPVIRPAAAEAEKPQGLHGGCVILEATT >gi568815588f:101031542_101236910|GENSCAN_predicted_CDS_4|348_bp atgcatctgtactatcccccactgaaaagcatgcactcagagatgcatacaaccacacag gtctttggctggtttttgaagctgaacccacctggttttctccttcatctgatgcagcaa gagaaatttggccaggagggggccgttggtgcagaccaagcatccagtgggaacacacag gggctctttgggactccattaatgagccccggccaccccaagacaagccccgaattaact ccactgctgaccctgacatggacacctgtaattaggccagcagcagctgaagcagagaag ccccaggggctccatgggggctgcgtcatcctggaagcaaccacttga >gi568815588f:101031542_101236910|GENSCAN_predicted_peptide_5|744_aa MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGGG GSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGGGSS GGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWMESN RRYTKDRFTGHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKYLASAERAALAKALKM TDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAFQKSLAQPLPADPLCVHN SSLFALQNLQPWSDDSTKITSVTSVASACEPAECSGWALTSCVERSEQVLAAAPGLLQVS RIHRLWHPAPGRAKSRNSLQHLRSRLPLPGQPPSSGTSQGSHSLTAESTCRPEPSARQLS DPAQLDADSCRRGHSAAPTEEDLTPFPAGLGVGVVLGAGFRFQPCSSGEPTFLWLLREEP KDPKAGGRGRDSAATQKWELDLGLGHRNPERQPDVLACLGVSPPEEAKHKRSQRSTELLS CQGTPAWRCQAMRWRQPLAASLKDRGKGKATRAQVLLISVEEKHPGVPVTWKFPVAVLVQ RKEFPQMPEFRAAITLNGHKGELVPCNCVFQRLCRVPQGSLQPDPSPDATRTGWVLSPEL NCGDNSPHSTNNLANFHQFQVRIGSLDSGPDISAVPHVGLCNNQPGFSPNVGIVLEPRPS KAPPPANITYLIFLFSTSPTLSKR >gi568815588f:101031542_101236910|GENSCAN_predicted_CDS_5|2235_bp atggagcacctgggtccgcaccacctccacccgggtcacgcagagcccattagcttcggc atcgaccagatcctcaacagcccggaccagggtggctgcatgggacccgcctcgcgcctc caggacggagaatacggccttggctgcttggtcggaggcgcctacacttacggcggcggg ggctccgcggccgcgacgggggctggaggagcgggggcctatggtactggaggtcccggc ggccccggaggcccggcaggcggcggcggcgcctgcagcatgggtcctctgaccggctcc tacaacgtgaacatggccttggcaggcggccccggtcctggcggcggcggcggcagcagc ggcggtgccggggcactcagcgctgcgggggtaatccgggtgccggcacacaggccgctc gccggagccgtggcccacccccagcccctggccaccggcttgcccaccgtgccctctgtg cctgccatgccgggcgtcaacaacctcactggcctcaccttcccctggatggagagtaac cgcagatacacaaaggacaggttcacaggtcacccctatcagaaccggacgccccccaag aagaagaagccgcgcacgtccttcacacgcctgcagatctgcgagctggagaagcgcttc caccgccagaagtacctggcctcggccgagcgcgccgccctggccaaggcgctcaaaatg accgatgcgcaggtcaaaacctggttccagaaccggcggacaaagtggagacggcagact gcggaggaacgggaggccgagaggcagcaagcgaaccgcatcctcctgcagttgcagcag gaggccttccagaagagcctggcacagccgctgcccgctgaccctctgtgcgtgcacaac tcgtcgctcttcgccctgcagaatctgcagccgtggtctgacgactcgaccaaaatcact agcgtcacgtcggtggcgtcggcctgcgagcccgcggagtgctcaggttgggccctgacg agctgtgtggagcgtagtgagcaggttctggcggcagcgccgggtctgctgcaggtctcc cgcattcaccggctgtggcacccagcgccaggccgcgcaaaatccagaaatagcctccag cacctcagaagtcgtctccctctacctgggcagcccccatcttcaggaacatcacagggc tcacactcactaaccgcggagagcacatgcaggccggagccctcagcccggcagctctcg gaccctgcccagctcgacgcggactcatgcagaagaggacattccgcagcccctacagag gaggatctaactccattcccagctggcctgggggtgggggttgtcttaggagctggcttc aggttccagccctgcagctcgggtgaacccacatttctttggttgctgcgagaagagcca aaagacccaaaagctggaggccgcggccgtgattcagcagccacccagaagtgggagctg gacctgggacttggtcacagaaacccagaacgacagcctgatgtcctggcttgtcttggg gtgtcacctccagaagaagccaagcacaaacgttcacagagatccactgagctcctgagc tgccaagggacacctgcatggaggtgccaggccatgcgctggagacagcccctcgcagcc tccctgaaggatcggggcaagggcaaggccacgagagctcaggtgctgttaatttccgtt gaggagaagcaccctggggtccctgtcacttggaaattccctgtggcagtgcttgttcag aggaaggaattcccccagatgcctgagttcagggctgctatcaccctgaatggccacaag ggggagctcgtgccctgtaattgtgtgtttcagcgcctgtgccgtgtgccgcaaggatcc ctgcagccagatccttctcctgatgccaccaggactggatgggtcctcagtcctgaactg aactgtggtgacaactccccacattccaccaataatcttgccaatttccatcaatttcag gtccgtattggaagccttgactctggcccagacatttctgctgtacctcatgtaggtctc tgcaataaccaacctgggttctctcctaatgtggggattgtcctggagcccaggccttcc aaggctcctccgcccgccaacatcacctacctcatcttcctcttcagcacatccccaaca ttgtctaagcggtaa >gi568815588f:101031542_101236910|GENSCAN_predicted_peptide_6|149_aa MLKGKAATLKSHMPYLAQLPASGPKESRAQAPQPQRTGSILVSHCQLSSAISLRDQLHIF GGKVSPARIVRSRPQTLNTERKGFHPGDLETSDARATSASSIQPNGLEHSAPSRGNWEAQ HPAAAERMAGLVPQTRQPESFENIEISEI >gi568815588f:101031542_101236910|GENSCAN_predicted_CDS_6|450_bp atgctgaaaggaaaggctgccaccctgaagtcgcacatgccctaccttgcccagctcccg gcgagcggcccgaaggagagcagagcacaggctccacagcctcagagaacaggctcgatt ctggtctcccactgccaactgtcatctgcgatcagcctcagagatcagttacatatcttt ggtgggaaagtctctccggcccgaattgtccgctccaggccccagacgctgaacacggaa cgaaaaggcttccatcctggagacctggagacgagtgacgccagagcgacctcagccagt tccatccagccaaacggcctggagcactcagcgcccagccgggggaactgggaggcccaa cacccggctgcagccgagcgcatggctggcctggtaccgcaaacccggcagcccgagagc tttgaaaacatcgaaatctccgagatttaa >gi568815588f:101031542_101236910|GENSCAN_predicted_peptide_7|462_aa XPSTFPPGLSSGPMPGLRPAPPDSSETSVKGRELTTINGFWSDFDVSGPFELSGIGALAS ALPTAAARTRGQEPLPSHLTPLNSTGVGETGAPRSAPRLRPRIAGSPSSSGARVSALFPG PRAVLPSSAAALIGPAPGPRPAHAASFPPAAACPRRPAPELPAGWVPAARAAPAGTPNKA EMTSKEDGKAAPGEERRRSPLDHLPPPANSNKPLTPFSIEDILNKPSVRRSYSLCGAAHL LAAADKHAQGGLPLAGRALLSQTSPLCALEELASKTFKGLEVSVLQAAEGRDGMTIFGQR QTPKKRRKSRTAFTNHQIYELEKRFLYQKYLSPADRDQIAQQLGLTNAQVITWFQNRRAK LKRDLEEMKADVESAKKLGPSGQMDIVALAELEQNSEATAGGGGGCGRAKSRPGSPVLPP GAPKAPGAGALQLSPASPLTDQPASSQDCSEDEEDEEIDVDD >gi568815588f:101031542_101236910|GENSCAN_predicted_CDS_7|1389_bp ngtccttccacctttccacctggcctaagctcaggccccatgcccggcctgcgcccggcg ccccccgacagtagtgagacctctgtgaaagggagggagctgactaccattaacggtttt tggtcagactttgatgtttctgggccgtttgagttatccgggatcggcgccctcgcctcg gctcttccaactgcggccgccaggacccggggccaggagccactgccgagccacctgaca cctttaaatagcaccggggttggcgaaactggagccccgcgcagcgcgccccggctccgg ccccggattgctggaagccccagcagcagcggcgcccgcgtcagcgccctcttcccgggg ccccgcgctgttctcccctcctcagccgccgcgctaatcggccccgcgcccggcccgcgc cctgcccatgcggcctcctttccacccgccgctgcctgcccgcgccgtccggcgcccgag ctgcccgcgggctgggtccccgcggcccgagccgccccggccgggaccccgaacaaggcc gagatgacttccaaggaggacggcaaggcggcgccgggggaggagcggcggcgcagcccg ctggaccacctgcctccgcctgccaactccaacaagccactgacgccgttcagcatcgag gacatcctcaacaagccgtctgtgcggagaagttactcgctgtgcggggcggcgcacctg ctggccgccgcggacaagcacgcgcagggcggcttgcccctggcgggccgcgcgctgctc tcgcagacctcgccgctgtgcgcgctggaggagctcgccagcaagacgtttaaggggctg gaggtcagcgttctgcaggcagccgaaggccgcgacggtatgaccatctttgggcagcgg cagacccctaagaagcggcgaaagtcgcgcacggccttcaccaaccaccagatctatgaa ttggaaaagcgctttctataccagaagtacctgtcccccgccgatcgcgaccaaatcgcg cagcagctgggcctcaccaacgcgcaagtcatcacctggttccagaatcggcgcgctaag ctcaagcgggacctggaggagatgaaggccgacgtagagtccgccaagaaactgggcccc agcgggcagatggacatcgtggcgctggccgaactcgagcagaactcggaggccacagcc ggcggtggcggcggctgcggcagggccaagtcgaggcccggctctccggtcctcccccca ggcgccccgaaggccccgggcgctggcgccctgcagctctcgcctgcctctccgctcacg gaccagccggccagcagccaggactgctcggaggacgaggaagacgaagagatcgacgtg gacgattga