GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:21:27 Sequence gi568815588r:101127273_101328815 : 201543 bp : 50.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4270 4837 568 0 1 109 109 594 0.982 58.83 1.02 Intr + 6903 7104 202 1 1 68 83 400 0.535 35.84 1.03 Intr + 9419 9637 219 2 0 55 46 354 0.676 25.32 1.04 Intr + 12165 12348 184 0 1 105 46 41 0.486 1.39 1.05 Intr + 12753 12888 136 2 1 118 96 48 0.848 8.74 1.06 Intr + 14653 14748 96 2 0 132 79 -8 0.779 2.68 1.07 Intr + 15005 15132 128 0 2 47 72 76 0.891 2.30 1.08 Intr + 18402 18539 138 2 0 56 105 39 0.434 3.06 1.09 Intr + 19725 19846 122 2 2 44 61 94 0.302 1.59 1.10 Intr + 34588 34709 122 1 2 48 76 101 0.123 5.04 1.11 Intr + 39869 39941 73 0 1 16 92 53 0.002 -3.04 1.12 Intr + 54366 54473 108 0 0 87 90 36 0.002 3.10 1.13 Term + 64038 64176 139 0 1 95 35 88 0.076 1.64 1.14 PlyA + 67398 67403 6 1.05 2.00 Prom + 71048 71087 40 -1.46 2.01 Init + 78082 78242 161 0 2 67 97 84 0.814 4.51 2.02 Intr + 83796 83824 29 0 2 124 105 -8 0.608 2.26 2.03 Intr + 90263 90365 103 0 1 79 14 157 0.849 6.63 2.04 Intr + 90703 90808 106 1 1 87 97 64 0.308 7.42 2.05 Term + 99616 99666 51 0 0 90 42 99 0.621 2.93 2.06 PlyA + 99675 99680 6 1.05 3.07 PlyA - 99829 99824 6 1.05 3.06 Term - 100518 99998 521 1 2 101 53 991 0.999 91.36 3.05 Intr - 101658 101219 440 2 2 35 102 635 0.686 52.76 3.04 Intr - 101931 101660 272 0 2 91 15 166 0.308 5.94 3.03 Intr - 109071 108917 155 1 2 113 46 41 0.665 2.19 3.02 Intr - 109640 109493 148 2 1 68 71 117 0.990 7.81 3.01 Init - 113708 113628 81 2 0 70 96 119 0.994 10.39 3.00 Prom - 130543 130504 40 -6.66 4.10 PlyA - 130631 130626 6 1.05 4.09 Term - 133338 133183 156 0 0 92 48 49 0.600 -0.77 4.08 Intr - 133947 133787 161 1 2 79 58 99 0.146 5.71 4.07 Intr - 143963 143822 142 2 1 118 115 -59 0.489 -0.27 4.06 Intr - 147260 147135 126 2 0 124 21 75 0.659 5.28 4.05 Intr - 164390 164124 267 2 0 73 27 104 0.074 0.53 4.04 Intr - 170426 170352 75 2 0 99 52 51 0.062 2.31 4.03 Intr - 174882 174846 37 2 1 80 105 33 0.009 2.46 4.02 Intr - 178015 177891 125 1 2 127 48 7 0.006 -0.02 4.01 Intr - 195731 195703 29 0 2 137 61 16 0.143 1.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:101127273_101328815|GENSCAN_predicted_peptide_1|744_aa MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGGG GSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGGGSS GGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWMESN RRYTKDRFTGHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKYLASAERAALAKALKM TDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAFQKSLAQPLPADPLCVHN SSLFALQNLQPWSDDSTKITSVTSVASACEPAECSGWALTSCVERSEQVLAAAPGLLQVS RIHRLWHPAPGRAKSRNSLQHLRSRLPLPGQPPSSGTSQGSHSLTAESTCRPEPSARQLS DPAQLDADSCRRGHSAAPTEEDLTPFPAGLGVGVVLGAGFRFQPCSSGEPTFLWLLREEP KDPKAGGRGRDSAATQKWELDLGLGHRNPERQPDVLACLGVSPPEEAKHKRSQRSTELLS CQGTPAWRCQAMRWRQPLAASLKDRGKGKATRAQVLLISVEEKHPGVPVTWKFPVAVLVQ RKEFPQMPEFRAAITLNGHKGELVPCNCVFQRLCRVPQGSLQPDPSPDATRTGWVLSPEL NCGDNSPHSTNNLANFHQFQVRIGSLDSGPDISAVPHVGLCNNQPGFSPNVGIVLEPRPS KAPPPANITYLIFLFSTSPTLSKR >gi568815588r:101127273_101328815|GENSCAN_predicted_CDS_1|2235_bp atggagcacctgggtccgcaccacctccacccgggtcacgcagagcccattagcttcggc atcgaccagatcctcaacagcccggaccagggtggctgcatgggacccgcctcgcgcctc caggacggagaatacggccttggctgcttggtcggaggcgcctacacttacggcggcggg ggctccgcggccgcgacgggggctggaggagcgggggcctatggtactggaggtcccggc ggccccggaggcccggcaggcggcggcggcgcctgcagcatgggtcctctgaccggctcc tacaacgtgaacatggccttggcaggcggccccggtcctggcggcggcggcggcagcagc ggcggtgccggggcactcagcgctgcgggggtaatccgggtgccggcacacaggccgctc gccggagccgtggcccacccccagcccctggccaccggcttgcccaccgtgccctctgtg cctgccatgccgggcgtcaacaacctcactggcctcaccttcccctggatggagagtaac cgcagatacacaaaggacaggttcacaggtcacccctatcagaaccggacgccccccaag aagaagaagccgcgcacgtccttcacacgcctgcagatctgcgagctggagaagcgcttc caccgccagaagtacctggcctcggccgagcgcgccgccctggccaaggcgctcaaaatg accgatgcgcaggtcaaaacctggttccagaaccggcggacaaagtggagacggcagact gcggaggaacgggaggccgagaggcagcaagcgaaccgcatcctcctgcagttgcagcag gaggccttccagaagagcctggcacagccgctgcccgctgaccctctgtgcgtgcacaac tcgtcgctcttcgccctgcagaatctgcagccgtggtctgacgactcgaccaaaatcact agcgtcacgtcggtggcgtcggcctgcgagcccgcggagtgctcaggttgggccctgacg agctgtgtggagcgtagtgagcaggttctggcggcagcgccgggtctgctgcaggtctcc cgcattcaccggctgtggcacccagcgccaggccgcgcaaaatccagaaatagcctccag cacctcagaagtcgtctccctctacctgggcagcccccatcttcaggaacatcacagggc tcacactcactaaccgcggagagcacatgcaggccggagccctcagcccggcagctctcg gaccctgcccagctcgacgcggactcatgcagaagaggacattccgcagcccctacagag gaggatctaactccattcccagctggcctgggggtgggggttgtcttaggagctggcttc aggttccagccctgcagctcgggtgaacccacatttctttggttgctgcgagaagagcca aaagacccaaaagctggaggccgcggccgtgattcagcagccacccagaagtgggagctg gacctgggacttggtcacagaaacccagaacgacagcctgatgtcctggcttgtcttggg gtgtcacctccagaagaagccaagcacaaacgttcacagagatccactgagctcctgagc tgccaagggacacctgcatggaggtgccaggccatgcgctggagacagcccctcgcagcc tccctgaaggatcggggcaagggcaaggccacgagagctcaggtgctgttaatttccgtt gaggagaagcaccctggggtccctgtcacttggaaattccctgtggcagtgcttgttcag aggaaggaattcccccagatgcctgagttcagggctgctatcaccctgaatggccacaag ggggagctcgtgccctgtaattgtgtgtttcagcgcctgtgccgtgtgccgcaaggatcc ctgcagccagatccttctcctgatgccaccaggactggatgggtcctcagtcctgaactg aactgtggtgacaactccccacattccaccaataatcttgccaatttccatcaatttcag gtccgtattggaagccttgactctggcccagacatttctgctgtacctcatgtaggtctc tgcaataaccaacctgggttctctcctaatgtggggattgtcctggagcccaggccttcc aaggctcctccgcccgccaacatcacctacctcatcttcctcttcagcacatccccaaca ttgtctaagcggtaa >gi568815588r:101127273_101328815|GENSCAN_predicted_peptide_2|149_aa MLKGKAATLKSHMPYLAQLPASGPKESRAQAPQPQRTGSILVSHCQLSSAISLRDQLHIF GGKVSPARIVRSRPQTLNTERKGFHPGDLETSDARATSASSIQPNGLEHSAPSRGNWEAQ HPAAAERMAGLVPQTRQPESFENIEISEI >gi568815588r:101127273_101328815|GENSCAN_predicted_CDS_2|450_bp atgctgaaaggaaaggctgccaccctgaagtcgcacatgccctaccttgcccagctcccg gcgagcggcccgaaggagagcagagcacaggctccacagcctcagagaacaggctcgatt ctggtctcccactgccaactgtcatctgcgatcagcctcagagatcagttacatatcttt ggtgggaaagtctctccggcccgaattgtccgctccaggccccagacgctgaacacggaa cgaaaaggcttccatcctggagacctggagacgagtgacgccagagcgacctcagccagt tccatccagccaaacggcctggagcactcagcgcccagccgggggaactgggaggcccaa cacccggctgcagccgagcgcatggctggcctggtaccgcaaacccggcagcccgagagc tttgaaaacatcgaaatctccgagatttaa >gi568815588r:101127273_101328815|GENSCAN_predicted_peptide_3|538_aa MTAEVVLVPPVATLVALSVTARKADGQGKESPGQDEAARRDPGSEHRSEPGVFTVLGTTP PTFPVRLGTADAGFSTGPSTFPPGLSSGPMPGLRPAPPDSSETSVKGRELTTINGFWSDF DVSGPFELSGIGALASALPTAAARTRGQEPLPSHLTPLNSTGVGETGAPRSAPRLRPRIA GSPSSSGARVSALFPGPRAVLPSSAAALIGPAPGPRPAHAASFPPAAACPRRPAPELPAG WVPAARAAPAGTPNKAEMTSKEDGKAAPGEERRRSPLDHLPPPANSNKPLTPFSIEDILN KPSVRRSYSLCGAAHLLAAADKHAQGGLPLAGRALLSQTSPLCALEELASKTFKGLEVSV LQAAEGRDGMTIFGQRQTPKKRRKSRTAFTNHQIYELEKRFLYQKYLSPADRDQIAQQLG LTNAQVITWFQNRRAKLKRDLEEMKADVESAKKLGPSGQMDIVALAELEQNSEATAGGGG GCGRAKSRPGSPVLPPGAPKAPGAGALQLSPASPLTDQPASSQDCSEDEEDEEIDVDD >gi568815588r:101127273_101328815|GENSCAN_predicted_CDS_3|1617_bp atgactgcagaggtggttcttgttcccccagtggccaccctagtggctctgtcggtcaca gcccggaaagcagatggccagggcaaggaaagtcctggccaggacgaagcagcccggagg gatcccggctctgagcacagatccgagcctggcgtcttcactgtgcttggaaccacgccg cccacgttccctgtgcggctgggcactgcagatgctgggttttccacaggtccttccacc tttccacctggcctaagctcaggccccatgcccggcctgcgcccggcgccccccgacagt agtgagacctctgtgaaagggagggagctgactaccattaacggtttttggtcagacttt gatgtttctgggccgtttgagttatccgggatcggcgccctcgcctcggctcttccaact gcggccgccaggacccggggccaggagccactgccgagccacctgacacctttaaatagc accggggttggcgaaactggagccccgcgcagcgcgccccggctccggccccggattgct ggaagccccagcagcagcggcgcccgcgtcagcgccctcttcccggggccccgcgctgtt ctcccctcctcagccgccgcgctaatcggccccgcgcccggcccgcgccctgcccatgcg gcctcctttccacccgccgctgcctgcccgcgccgtccggcgcccgagctgcccgcgggc tgggtccccgcggcccgagccgccccggccgggaccccgaacaaggccgagatgacttcc aaggaggacggcaaggcggcgccgggggaggagcggcggcgcagcccgctggaccacctg cctccgcctgccaactccaacaagccactgacgccgttcagcatcgaggacatcctcaac aagccgtctgtgcggagaagttactcgctgtgcggggcggcgcacctgctggccgccgcg gacaagcacgcgcagggcggcttgcccctggcgggccgcgcgctgctctcgcagacctcg ccgctgtgcgcgctggaggagctcgccagcaagacgtttaaggggctggaggtcagcgtt ctgcaggcagccgaaggccgcgacggtatgaccatctttgggcagcggcagacccctaag aagcggcgaaagtcgcgcacggccttcaccaaccaccagatctatgaattggaaaagcgc tttctataccagaagtacctgtcccccgccgatcgcgaccaaatcgcgcagcagctgggc ctcaccaacgcgcaagtcatcacctggttccagaatcggcgcgctaagctcaagcgggac ctggaggagatgaaggccgacgtagagtccgccaagaaactgggccccagcgggcagatg gacatcgtggcgctggccgaactcgagcagaactcggaggccacagccggcggtggcggc ggctgcggcagggccaagtcgaggcccggctctccggtcctccccccaggcgccccgaag gccccgggcgctggcgccctgcagctctcgcctgcctctccgctcacggaccagccggcc agcagccaggactgctcggaggacgaggaagacgaagagatcgacgtggacgattga >gi568815588r:101127273_101328815|GENSCAN_predicted_peptide_4|372_aa XASEPRAAVKGSAAGFLTLLLISEILGHHLSPSWRQMNLQTGVSSGEGQPGRARPGPHIQ EVGLVPSHQQSSDECQAPLALQLPLFIPLGGRTATPPNSCPKGRNLLAFLVLALAPWDPR APGKRPKARSPAQGWPLSPEDPQGRGRRGMEALGDLGLRSPARERGRVSGTSCAKGPSVI AENGQFLHLSPDLLSESGHRTPVPAPSPIKIPSSVPSPLLATGLWGMPLSLSEMLFPLLF THVMPLILSISAQLWLPQNSLALTPKAGQQPLAAAFHSGPFEAFGPQAEEPPPAVGLLES PAGKQTGGGSNPGGEAPGSPDLNLEPDFRESPLLFIHLFSKLLPNACSVPGNRNTLVDTI ELGPVFMEINFT >gi568815588r:101127273_101328815|GENSCAN_predicted_CDS_4|1119_bp natgcctcagagccccgggccgcagttaagggctctgcagctggatttctgactcttcta ctgatctctgaaatcctggggcaccacctctcaccttcatggagacagatgaaccttcag acaggggtgtcttcaggggaggggcagcccggcagggccagacctggtcctcatatccag gaagtaggcctggttcccagccaccaacagtcctctgatgaatgtcaggccccgctggcc ctgcaattacctctgttcatccctcttggtgggcgtacggcgactccccccaactcctgc cccaaaggaaggaaccttctggctttcctggtgttggccctggcgccctgggaccctaga gccccggggaagcgcccaaaggctaggtcaccggcccagggctggccactgtccccggaa gacccccaaggccgcggccgacgcggcatggaggcgctgggagatctgggccttcgcagc ccggctagggagcgtggacgagtctccgggacatcctgcgcgaagggtccttctgttatt gcagagaacggccagttcctgcacttgtccccagacctcctctctgaaagtgggcaccgg acaccggtccctgcccccagcccaatcaaaattccctcttctgttccctctccactgctg gccacagggctttggggcatgccgctttccctgtctgaaatgcttttccctcttctcttc acccatgtaatgcctcttatcctttcaatatcagctcagctgtggcttcctcagaacagc cttgccctgactcctaaggcaggccagcagcctctggccgcggcattccactctgggccc tttgaagcctttgggccacaagcggaggagccgccccctgccgtggggctcctagaatct ccagctggaaaacaaaccggaggaggaagtaatcctggaggggaagctccagggtcccca gacctcaacctcgaacctgacttcagagaaagcccactcctcttcattcatctgttcagc aaactcttaccaaacgcttgttctgtgccaggcaacaggaatacattggtggacacgata gagctaggtcctgtcttcatggagatcaactttacttga