GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:19:33 Sequence gi568815592r:41698680_41906983 : 208304 bp : 46.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3903 3973 71 2 2 108 100 57 0.843 6.88 1.02 Intr + 6730 6912 183 1 0 95 46 97 0.329 5.10 1.03 Intr + 12637 12719 83 1 2 99 68 60 0.330 4.38 1.04 Intr + 18161 18289 129 0 0 46 60 101 0.097 3.77 1.05 Term + 20539 20552 14 2 2 124 40 18 0.139 -1.04 1.06 PlyA + 20646 20651 6 1.05 2.10 PlyA - 20966 20961 6 1.05 2.09 Term - 38325 38173 153 0 0 60 38 225 0.934 12.62 2.08 Intr - 39149 39051 99 2 0 89 84 83 0.995 8.31 2.07 Intr - 41267 41120 148 1 1 106 64 205 0.990 20.14 2.06 Intr - 41931 41812 120 2 0 133 70 111 0.999 13.41 2.05 Intr - 43810 43611 200 1 2 104 63 341 0.999 31.35 2.04 Intr - 44710 44592 119 2 2 108 74 230 0.999 23.78 2.03 Intr - 45835 45718 118 1 1 63 73 139 0.999 9.94 2.02 Intr - 46129 45979 151 0 1 34 76 302 0.997 23.76 2.01 Init - 48655 48597 59 1 2 86 89 104 0.990 9.08 2.00 Prom - 48752 48713 40 -3.96 3.12 PlyA - 49117 49112 6 -3.64 3.11 Term - 49462 49369 94 0 1 68 43 118 0.234 2.60 3.10 Intr - 49636 49581 56 1 2 78 65 60 0.174 0.48 3.09 Intr - 55306 55219 88 0 1 66 95 57 0.183 4.17 3.08 Intr - 57336 57229 108 2 0 61 32 136 0.181 4.70 3.07 Intr - 57774 57589 186 2 0 92 49 57 0.182 1.00 3.06 Intr - 70040 69891 150 1 0 54 -7 143 0.009 0.68 3.05 Intr - 72854 71944 911 2 2 90 6 761 0.019 58.18 3.04 Intr - 73285 73137 149 2 2 75 99 101 0.961 9.85 3.03 Intr - 74280 74119 162 1 0 117 77 111 0.999 12.85 3.02 Intr - 76926 76740 187 0 1 162 96 289 0.999 36.36 3.01 Init - 78308 78243 66 2 0 107 75 72 0.979 8.87 3.00 Prom - 78735 78696 40 -11.53 4.00 Prom + 79318 79357 40 -6.36 4.01 Init + 80727 80785 59 2 2 64 64 59 0.905 1.88 4.02 Intr + 82745 82841 97 2 1 48 94 79 0.818 4.51 4.03 Intr + 84783 84926 144 2 0 26 67 114 0.835 3.58 4.04 Intr + 86256 86393 138 2 0 47 95 109 0.814 8.16 4.05 Intr + 86658 86861 204 2 0 74 92 188 0.997 17.20 4.06 Intr + 87449 87653 205 1 1 114 87 101 0.996 11.37 4.07 Intr + 88710 88927 218 1 2 95 26 70 0.270 -0.28 4.08 Intr + 88932 89146 215 2 2 -7 76 286 0.223 15.41 4.09 Term + 90889 90976 88 1 1 79 45 62 0.546 -1.87 4.10 PlyA + 91194 91199 6 1.05 5.07 PlyA - 91227 91222 6 1.05 5.06 Term - 93487 93446 42 1 0 102 47 49 0.235 -0.64 5.05 Intr - 98044 97902 143 2 2 102 61 50 0.349 3.77 5.04 Intr - 100250 100045 206 1 2 62 92 214 0.997 18.04 5.03 Intr - 101259 101151 109 1 1 37 99 45 0.792 -0.16 5.02 Intr - 105331 105127 205 1 1 96 79 48 0.950 3.47 5.01 Init - 108304 106949 1356 1 0 70 91 1734 0.982 162.76 5.00 Prom - 112427 112388 40 -5.86 6.07 PlyA - 112693 112688 6 1.05 6.06 Term - 113173 113120 54 1 0 120 42 39 0.359 0.06 6.05 Intr - 115081 114991 91 0 1 88 92 6 0.182 1.00 6.04 Intr - 115533 115432 102 2 0 64 95 36 0.144 1.19 6.03 Intr - 126300 126156 145 1 1 87 95 84 0.285 8.34 6.02 Intr - 139821 139705 117 1 0 72 46 70 0.015 1.64 6.01 Intr - 205744 205637 108 2 0 43 95 43 0.079 0.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 72854 71940 915 2 0 90 50 757 0.963 64.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:41698680_41906983|GENSCAN_predicted_peptide_1|159_aa ERSAKVFMDATSAYVVNEVPPQGRLLGGNQQSGALTSGFPERQPYRKACAHSHVHSHGTA VKSTLFPTTQLLKGQGPSSTLSMKRLQVEDTRTWPSGLNVPTAVALGEGVLTAWPTPWPI TQSPPQLAEDLEHFEPNCCVDGGIGMPACTLHHKPDGTV >gi568815592r:41698680_41906983|GENSCAN_predicted_CDS_1|480_bp gaaagatcagctaaagttttcatggatgctacgtcagcctatgtggtcaatgaggtccct ccccagggaaggttgctggggggtaatcagcagagtggagctctgacttcagggttccct gagaggcagccataccgcaaggcctgtgcccactcccatgtacactcacacgggacagct gtaaagtccaccctcttcccaacgacacagcttctgaagggccagggaccctcatccacc ctgtccatgaagaggctccaggtagaggacactcggacatggccctctgggctcaacgtc cccaccgctgtggccttaggtgaaggtgtgctgactgcctggcctaccccctggcccatc actcagagtcccccacagctagcagaggacctggagcatttcgaacctaactgctgtgtt gatggaggcatcggcatgcccgcgtgcaccctacatcacaagccagatgggaccgtctag >gi568815592r:41698680_41906983|GENSCAN_predicted_peptide_2|388_aa MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSES STYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIM GLAYPALSVDEATTAMQGMVQEGALTSPVFSVYLSNQQGSSGGAVVFGGVDSSLYTGQIY WAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQE DEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSSYILSNNGYCTVGVEPTYLSSQNGQPL WILGDVFLRSYYSVYDLGNNRVGFATAA >gi568815592r:41698680_41906983|GENSCAN_predicted_CDS_2|1167_bp atgaagtggatggtggtggtcttggtctgcctccagctcttggaggcagcagtggtcaaa gtgcccctgaagaaatttaagtctatccgtgagaccatgaaggagaagggcttgctgggg gagttcctgaggacccacaagtatgatcctgcttggaagtaccgctttggtgacctcagc gtgacctacgagcccatggcctacatggatgctgcctactttggtgagatcagcatcggg actccaccccagaacttcctggtcctttttgacaccggctcctccaacttgtgggtgccc tctgtctactgccagagccaggcctgcaccagtcactcccgcttcaaccccagcgagtcg tccacctactccaccaatgggcagaccttctccctgcagtatggcagtggcagcctcacc ggcttctttggctatgacaccctgactgtccagagcatccaggtccccaaccaggagttc ggcttgagtgagaatgagcctggtaccaacttcgtctatgcgcagtttgatggcatcatg ggcctggcctaccctgctctgtccgtggatgaggccaccacagctatgcagggcatggtg caggagggcgccctcaccagccccgtcttcagcgtctacctcagcaaccagcagggctcc agcgggggagcggttgtctttgggggtgtggatagcagcctgtacacggggcagatctac tgggcgcctgtcacccaggaactctactggcagattggcattgaagagttcctcatcggc ggccaggcctccggctggtgttctgagggttgccaggccatcgtggacacaggcacctct ctgctcactgtgccccagcagtacatgagtgctcttctgcaggccacaggggcccaggag gatgagtatggacagtttctcgtgaactgtaacagcattcagaatctgcccagcttgacc ttcatcatcaatggtgtggagttccctctgccaccttcctcctatatcctcagtaacaac ggctactgcaccgtgggagtcgagcccacctacctgtcctcccagaacggccagcccctg tggatcctcggggatgtcttcctcaggtcctactattccgtctacgacttgggcaacaac agagtaggctttgccactgccgcctag >gi568815592r:41698680_41906983|GENSCAN_predicted_peptide_3|718_aa MGSCCSCLNRDSVPDNHPTKFKVTNVDDEGVELGSGVMELTQSELVLHLHRREAVRWPYL CLRRYGYDSNLFSFESGRRCQTGQGIFAFKCSRAEEIFNLLQDLMQCNSINVMEEPVIIT RNSHPAELDLPRAPQPPNALGYTVSSFSNGCPGEGPRFSAPRRLSTSSLRHPSLGEESTH ALIAPDEQSHTYVNTPASEDDHRRGRHCLQPLPEGQAPFLPQARGPDQRDPQVFLQPGQV KFVLGPTPARRHMVKCQGLCPSLHDPPHHNNNNEAPSECPAQPKCTYENVTGGLWRGAGW RLSPEEPGWNGLAHRRAALLHYENLPPLPPVWESQAQQLGGEAGDDGDSRDGLTPSSNGF PDGEEDETPLQKPTSTRAAIRSHGSFPVPLTRRRGSPRVFNFDFRRPGPEPPRQLNYIQV ELKGWGGDRPKGPQNPSSPQAPMPTTHPARSSDSYAVIDLKKTVAMSNLQRALPRDDGTA RKTRHNSTDLPLEMGSDDWPQQDVKRPFEWGPAELKRAAATTPELAPNPSPPYILLISGP EGGADPGFGHHQPGVRPKPEGAGTTFESIPFDVIIGLSYPSVSVLGATAVMDCLVKWNLI PGPCQQDEVTSGEVTFGNMNSQLCTGKADWSPVNARVIGSWAALRNVPSSIQYTSFPPTQ KGSSHLQAKASDLGRYKRHNQKLVWLTLPTTSTRTPDTAISTTLTITIIVTSILATSY >gi568815592r:41698680_41906983|GENSCAN_predicted_CDS_3|2157_bp atggggagctgctgcagctgcctgaacagagacagcgttccagacaaccaccccaccaag ttcaaggtgacaaatgtggatgatgagggggtggagctgggctctggggtgatggagctg acgcagagtgagctggtgctgcacctgcatcggcgtgaggccgtccgctggccttatctc tgcttgcggcgctatggctacgactccaacctcttctcctttgagagtggccgccgatgt cagacaggccagggaatatttgcatttaagtgttcccgggctgaggaaatcttcaacctc cttcaggatctgatgcagtgcaacagcatcaatgtgatggaagagcctgtcatcatcacc cgcaatagccaccccgctgagcttgacctccctcgagccccccagccacccaatgctcta ggctacactgtctccagcttttccaatggctgccctggagagggcccacgattctcagct ccccggcggctctcgacaagcagcctgcggcacccctcgcttggggaagagtccacccat gccctcattgctcctgatgagcagtcccacacctatgtcaacacaccggccagtgaagat gaccaccgcaggggccgccactgcctgcagcccctgcctgagggtcaggcacccttcctc ccgcaggcccggggacctgaccaacgggacccacaggtgttcttgcagccaggccaggtg aagtttgtgttgggcccgacccctgctcggcggcacatggtgaagtgccagggcctctgt cccagcctgcatgaccccccacaccacaataataacaatgaggccccttctgagtgtcca gcccagcccaagtgcacctacgagaacgtcaccggggggctgtggcgaggggctggctgg agactgagcccagaggagccgggctggaatggccttgcccaccgccgggccgccctgctg cactatgagaacctgcccccactgccccctgtgtgggaaagccaagcccagcagctggga ggggaggctggggatgatggggactcgagggatgggctcacaccctcttccaatggcttc cctgatggtgaggaggacgagaccccactgcagaagcccaccagcacccgggccgccatc cgcagccacggcagctttcctgtgccactgacccgccgccgcggctccccaagggtcttc aactttgatttccgccggccggggcccgagcccccaaggcagcttaactacatccaggtg gagctaaagggctggggtggagaccgccctaaggggccccagaacccctcgagcccccaa gcccccatgcccaccacccaccctgcccgaagctcagactcctacgccgtgattgacctc aaaaagaccgtggccatgtccaacctgcagagagctctgccccgagacgatggcaccgcc aggaaaacccggcacaacagcaccgacctgcctcttgaaatgggcagtgacgactggcca caacaggatgtgaaaaggcccttcgagtggggcccagcagagctcaagagggcagctgcc acaaccccagagctggcaccaaacccttctcccccttatatccttttgatctctggacca gaggggggagcagacccagggtttggccatcaccaaccgggagtacggcctaagccagag ggagctggcaccaccttcgaaagcattccctttgatgtgataatagggctgtcctatcca tctgtctctgtacttggggccactgcagtcatggactgccttgttaagtggaatctgatt ccagggccgtgccaacaggatgaagttacttcgggagaagtcacctttgggaacatgaac agccagctgtgcacaggtaaggccgactggagccctgtgaatgccagagttattggcagc tgggctgccctgcgaaatgtgccttcatccatccagtacacctcctttccaccaacccag aaaggaagcagtcatctgcaagcaaaggcctctgatctaggtcggtacaagaggcacaac cagaagctggtgtggcttactctccccaccaccagcaccagaactcccgacactgccatc agcaccaccctcaccatcaccatcattgtaaccagcatcctagctactagctactga >gi568815592r:41698680_41906983|GENSCAN_predicted_peptide_4|455_aa MRDPRNFRAVEAFGGIQEKWGSAVSGYSWCEQPILATFHRKPDLKDRRGRKQALPQMSVQ NSGWPHQEDSPKPQDPGPPANSDSDSGHLPGEDPEDTHAQERYCLALGEEERAELQLFCA RRKQEALGQGVARLVLPKLEGHTCEKCRELLKPGEYGVFAARAGEQRCWHQPCFACQACG QALINLIYFYHDGQLYCGRHHAELLRPRCPACDQLIFSWRCTEAEGQRWHENHFCCQDCA GPLGGGRYALPGGSPCCPSCFENRYSDAGSSWAGALEGQAFLGWDSPVPTILLPSQLNDH STRDSYRLESRRWDREQGTVEPGRAPPLLEFVEAQQRAVPLPDVGLSEQRPHCRPQPEVP DRRCVIHRRRRYGSSTEAHAKLSTMASSTVPVSAAGSANETPEIPDNVGDWLRGVYRFAT DRNDFRRQMESCAEPESSKKQPTICDHHKMCPDGS >gi568815592r:41698680_41906983|GENSCAN_predicted_CDS_4|1368_bp atgagggatcctaggaatttccgggcagtggaagcatttgggggcatccaagagaaatgg ggcagtgctgttagtggctactcctggtgtgaacagcccatcctggccaccttccacagg aaacctgacctgaaggacagaagaggaaggaagcaggctttgccacaaatgtcagtgcag aactctggctggccccaccaagaagacagccccaagccccaggatccaggtccaccagcc aactcagacagtgactcaggccacctgccgggggaggaccctgaggatacccatgctcag gagcgctactgcctggcccttggggaggaggagcgggccgagctgcagctcttctgtgcc aggcggaagcaggaagccctgggacagggggtagcccgcctggtacttcccaagcttgaa ggacacacctgtgagaagtgtagggagctgctgaagccaggggagtacggagtgtttgca gcccgggcaggggaacagcgctgctggcaccagccttgctttgcctgccaggcctgtggc caggccctgataaacctcatctacttctaccatgatggacaactctactgcggccgtcat catgcagagttgctgcgcccgcgctgcccggcttgtgaccagctgatcttctcctggcgc tgcaccgaggcggagggacagcgctggcatgagaaccacttctgttgccaggactgcgcc gggcctctgggcgggggacgttatgccctgcctgggggaagcccctgctgccccagctgc ttcgagaaccgctactcggatgcaggctcgagctgggccggggcactggaagggcaggca ttccttggctgggacagccccgtccccaccatcctcctcccaagccaattaaatgatcac agcacgcgtgacagttaccggctggagagccggaggtgggaccgggagcaggggaccgta gaaccgggccgcgctcctcccctcctagagttcgtggaggcgcagcagagggccgtccct cttccggatgtcggactaagcgaacagcgcccccactgccggccgcagccggaagtgcca gaccggaggtgcgtcattcaccggcgacgccgatacggttcctccaccgaggcccatgcg aagctttccactatggcttccagcactgtcccggtgagcgctgctggctcggctaatgaa actcccgaaataccggacaacgtgggagattggcttcggggcgtctaccgctttgccact gataggaatgacttccggagacaaatggaatcctgtgctgaacccgaatcttccaaaaaa cagcctacaatctgtgaccaccacaagatgtgccctgatggcagctga >gi568815592r:41698680_41906983|GENSCAN_predicted_peptide_5|686_aa MDRCKHVGRLRLAQDHSILNPQKWCCLECATTESVWACLKCSHVACGRYIEDHALKHFEE TGHPLAMEVRDLYVFCYLCKDYVLNDNPEGDLKLLRSSLLAVRGQKQDTPVRRGRTLRSM ASGEDVVLPQRAPQGQPQMLTALWYRRQRLLARTLRLWFEKSSRGQAKLEQRRQEEALER KKEEARRRRREVKRRLLEELASTPPRKSARLLLHTPRDAGPAASRPAALPTSRRVPAATL KLRRQPAMAPGVTGLRNLGNTCYMNSILQVLSHLQKFRECFLNLDPSKTEHLFPKATNGK TQLSGKPTNSSATELSLRNDRAEACEREGFCWNGRASISRSLELIQNKEPSSKHISLCRE LHTLFRVMWSGKWALVSPFAMLHSVWSLIPAFRGYDQQDAQEFLCELLHKVQQELESEGT TRRILIPFSQRKLTKQVLKVVNTIFHGQLLSQVTCISCNYKSNTIEPFWDLSLEFPERYH CIEKGFVPLNQTECLLTEMLAKFTETEALEGRIYACDQCNSKRRKSNPKPLVLSEARKQL MIYRLPQVLRLHLKRFRWSGRNHREKIGVHVVFDQVLTMEPYCCRDMLSSLDKETFAYDL SAVVMHHGKGFGSGHYTAYCYNTEGGFWVHCNDSKLNVCSVEEVCKTQAYILFYTQRTVQ GNARISETHLQAQALTEMALSECGRC >gi568815592r:41698680_41906983|GENSCAN_predicted_CDS_5|2061_bp atggatagatgcaaacatgtagggcggttacggctcgcccaggaccactccatcctgaac cctcagaagtggtgctgcttagagtgtgccaccaccgagtccgtgtgggcctgcctcaag tgctcccacgtggcctgcggccgctatattgaggaccacgccctgaaacactttgaggag acgggacacccgctagccatggaagtccgggatctctacgtgttctgttacctgtgcaag gactacgtgctcaatgataacccagagggggacctgaagctgctaagaagctccctcctg gcggtccggggccagaaacaggacacgccggtgagacgtgggcggacgctgcggtccatg gcttcgggtgaggacgtggtcctgccgcagcgcgctcctcagggacagccgcagatgctc acggctctgtggtaccggcgtcagcgcctgctggccaggacgctgcggctgtggttcgag aagagctcccggggccaggcgaagctggagcagcggcggcaggaggaggccctggagcgc aagaaggaggaggcgcggaggcggcggcgcgaggtgaaacggcggctgctggaggagctg gccagcacccctccgcgcaagagtgcacggctgctcctgcacacgccccgcgacgcgggc ccggctgcctcgcgccccgccgccctccctacctcacgcagagtgcccgccgccacactc aagctgcgtcgccagccggccatggccccaggcgtcacgggcctgcgcaacctgggcaac acctgctacatgaactccatcctccaggtgctcagccacctccagaagttccgagaatgt ttcctcaaccttgacccttccaaaacggaacatctgtttcccaaagccaccaacgggaag actcagctttctggcaagccaaccaacagctcggccacggagctgtccttgagaaatgac agggccgaggcatgcgagcgggagggcttctgctggaacggcagggcctccattagtcgg agtctggagctcatccagaacaaggagccgagttcaaagcacatttccctctgccgtgaa ctgcacaccctcttccgagtcatgtggtccgggaagtgggccctagtgtcgcccttcgcc atgctccactcagtgtggagcctgatccctgccttccgcggctacgaccaacaggacgcg caggaatttctctgcgagctgctgcacaaggtgcagcaggaactcgagtctgagggcacc acacgccggatcctcatccccttctcccagaggaagctcaccaaacaggtcttaaaggtg gtgaataccatatttcatgggcagctgctcagtcaggtcacatgtatatcatgcaattac aaatccaataccattgagcccttttgggacctatccctggaattccctgaacgctatcac tgcatagaaaaggggtttgtccctttgaatcaaacagagtgcttgctcactgagatgctg gccaaattcacagagacagaggccctggaagggagaatctacgcttgtgaccagtgtaac agcaaacgacgaaaatccaatcccaaaccccttgttctgagtgaagctagaaagcagtta atgatctacagactacctcaggttctccggctgcaccttaaaagattcaggtggtctggc cgtaatcatcgagagaagattggggtccatgtcgtctttgaccaggtattaaccatggaa ccttactgctgcagggacatgctctcctctcttgacaaagagacctttgcctatgatctc tccgcagtggtcatgcatcacgggaaagggtttggctcaggacactacacagcctattgc tacaacacagagggaggtttttgggtccactgcaatgactcaaagctgaatgtatgcagt gtcgaggaagtgtgcaaaacccaggcctacatccttttttacactcaaagaacagtgcag ggcaatgcaagaatctcagaaacccatctccaagctcaggcgctgactgagatggcgctg agtgaatgtggaaggtgctaa >gi568815592r:41698680_41906983|GENSCAN_predicted_peptide_6|205_aa XHIRPALSLENTRFKNCHLGTFSFVSIKGNVSLKEAATQWSYQALGWFWGISAKSPVIDS SSGLPAMDNSTCSGGVEFPEVRLLYSCTRDEGSCTVHVFEMPSMPYLKEGPLYIFAMQHI LLDLSTKGVQDIYLKLLIYLKLADVEVFRHIMFFPNYLEAGLPSLNQYGEVQSSPVFFLN GFTLPIWQELVVSINVYPICTLSWY >gi568815592r:41698680_41906983|GENSCAN_predicted_CDS_6|618_bp ncacacattcgaccagctttaagtctagagaacaccagatttaagaactgccacttgggt actttttcctttgtgtcaattaaaggaaatgtgtcactaaaagaagcagctacccagtgg agctaccaggccctgggctggttctggggaatatctgcaaagagtcctgtgatagattcc tcttcaggtctcccagccatggataatagcacctgctctggtggagttgagttcccagaa gtacgtctgctttattcttgtacaagagatgagggttcatgcacagtccacgtctttgaa atgccttctatgccttatttgaaagaaggccccctttatatttttgccatgcagcacatt ctcttggacctatcgaccaagggtgtgcaggacatttatttaaagctgctgatctattta aaacttgctgatgtagaagtattcagacacatcatgttctttcccaattacctagaagct ggtctgcctagcttaaatcagtatggtgaggtgcaaagtagccctgtgtttttcctgaat ggatttactctgcccatatggcaggaattagttgtaagcatcaatgtgtacccaatttgt acccttagttggtactaa